Mercury 2

Inception · Released Mar 2026

Mercury 2 is a reasoning model from Inception that uses diffusion-based token generation to produce and refine multiple tokens in parallel rather than sequentially.

Strengths: The parallel token generation approach enables faster inference speeds compared to autoregressive models while maintaining reasoning capability.
Best for: Tasks requiring complex reasoning where inference latency is a primary concern.
Limitations: As a reasoning diffusion model, it may have different quality characteristics or trade-offs compared to traditional sequential reasoning models, and support for integrations may be limited given the novel architecture.

Input / 1M

$0.25

Output / 1M

$0.75

Cached input / 1M

$0.025

Context window

Price history

Snapshots

Effective	Input	Output	Cached in	Note	Source
11 Jun 2026	$0.25	$0.75	$0.025	Imported from OpenRouter	openrouter.ai

Data updated Jun 29, 2026 Report a problem