Mercury 2

Inception · Mid

Compare →

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...

Input / 1M

$0.25

Output / 1M

$0.75

Cached input / 1M

$0.025

Context window

128K

Where it sits

98th-cheapest mid model

by blended $/Mtok among 262 listed mid models

34% below the mid median

blended $/Mtok across 262 mid models

Output costs 3× input

$0.25 in / $0.75 out per 1M

Cached input saves 90%

$0.025 vs $0.25 per 1M fresh

Held flat since launch (Jun 2026)

no blended price change recorded

Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.

Price history

Only one price on record so far — the history chart appears once a price changes.

Snapshots

Effective Input Output Cached in Note Source
11 Jun 2026 $0.25 $0.75 $0.025 Imported from OpenRouter openrouter.ai