part of the price index

Price your workload across every model.

Describe it in a sentence or fill in the fields. You'll see the bill for every model, the cheapest equivalent that fits, and where the money goes. Prices pull from the live index.

Workload profile Tweak anything — the board updates live.
share of system tokens served from cache
Estimated spend · recommended model
$4.04 /mo
Support bot over our docs, 5k chats/day, 3 turns each · 150K req/mo · 4.6K tok/req
<$0.0001per request
Cheapest equivalent that fits your needs
I Ling-2.6-flashinclusionAI · Mid · 262K ctx
$4.04 <$0.0001 / call
99% Switch GPT-5 → Ling-2.6-flash, save $768/mo ($9,222/yr)

Every model, priced for this workload

$/mo $/call

Ranked cheapest first · green = cheapest equivalent, your baseline marked

I Ling-2.6-flash Mid · 262K ctx $4.04 <$0.0001/call cheaper −99% Mistral NemoMid · 131K ctx $14.18 <$0.0001/call cheaper −98% Llama 3.1 8B InstructMid · 131K ctx $14.18 <$0.0001/call cheaper −98% I Granite 4.0 MicroMid · 131K ctx $16.59 $0.0001/call cheaper −98% Ministral 3 3B 2512Mid · 131K ctx $25.05 $0.0002/call cheaper −97% L LFM2-24B-A2BMid · 32K ctx $25.20 $0.0002/call cheaper −97% gpt-oss-20bMid · 131K ctx $25.62 $0.0002/call cheaper −97% Llama 3.2 1B InstructMid · 60K ctx $27.56 $0.0002/call cheaper −96% S Llama 3 8B LunarisMid · 8K ctx $27.82 $0.0002/call cheaper −96% A Nova Micro 1.0Mid · 128K ctx $29.40 $0.0002/call cheaper −96% Qwen2.5 7B InstructMid · 32K ctx $30.45 $0.0002/call cheaper −96% T Hy3 previewMid · 262K ctx $30.56 $0.0002/call cheaper −96% C Command R7B (12-2024)Mid · 128K ctx $31.50 $0.0002/call cheaper −96% GPT-5 NanoMid · 400K ctx $33.30 $0.0002/call cheaper −96% gpt-oss-120bMid · 131K ctx $34.02 $0.0002/call cheaper −96% Z GLM 4.7 FlashMid · 202K ctx $34.80 $0.0002/call cheaper −95% X MiMo-V2-FlashMid · 262K ctx $35.55 $0.0002/call cheaper −95% Voxtral Small 24B 2507Mid · 32K ctx $35.55 $0.0002/call cheaper −95% Mistral Small 3Mid · 32K ctx $35.70 $0.0002/call cheaper −95% A Trinity MiniMid · 131K ctx $36.22 $0.0002/call cheaper −95% I Granite 4.1 8BMid · 131K ctx $36.75 $0.0002/call cheaper −95% Gemma 3 4BMid · 131K ctx $36.75 $0.0002/call cheaper −95% X MiMo-V2.5Mid · 1.05M ctx $37.04 $0.0002/call cheaper −95% DeepSeek V4 FlashMid · 1M ctx $37.04 $0.0002/call cheaper −95% Ministral 3 8B 2512Mid · 262K ctx $37.58 $0.0003/call cheaper −95% S Step 3.5 FlashMid · 262K ctx $38.85 $0.0003/call cheaper −95% Gemma 3 12BMid · 131K ctx $39.38 $0.0003/call cheaper −95% Qwen3 30B A3B Instruct 2507Mid · 128K ctx $40.47 $0.0003/call cheaper −95% Gemini 2.5 Flash LiteMid · 1.05M ctx $40.80 $0.0003/call cheaper −95% Gemini 2.5 Flash Lite Preview 09-2025Mid · 1.05M ctx $40.80 $0.0003/call cheaper −95% G MythoMax 13BMid · 4K ctx context too small $40.95 $0.0003/call cheaper −95% N Nemotron 3 Nano 30B A3BMid · 262K ctx $42.00 $0.0003/call cheaper −95% Gemma 3n 4BMid · 32K ctx $44.10 $0.0003/call cheaper −94% gpt-oss-safeguard-20bMid · 131K ctx $45.00 $0.0003/call cheaper −94% GPT-4.1 NanoSmall · 1M ctx · below min tier $48.00 $0.0003/call cheaper −94% M Phi 4Mid · 16K ctx $48.30 $0.0003/call cheaper −94% P Laguna XS.2Mid · 262K ctx $49.50 $0.0003/call cheaper −94% Llama 3.2 3B InstructMid · 80K ctx $49.65 $0.0003/call cheaper −94% Ministral 3 14B 2512Mid · 262K ctx $50.10 $0.0003/call cheaper −94% A Nova Lite 1.0Mid · 300K ctx $50.40 $0.0003/call cheaper −93% I Ling-2.6-1TMid · 262K ctx $51.26 $0.0003/call cheaper −93% I Ring-2.6-1TMid · 262K ctx $51.26 $0.0003/call cheaper −93% Qwen3 8BMid · 40K ctx $52.50 $0.0004/call cheaper −93% Qwen3.5-FlashMid · 1M ctx $54.60 $0.0004/call cheaper −93% Gemma 4 26B A4BMid · 262K ctx $55.12 $0.0004/call cheaper −93% Mistral Small 3.2 24BMid · 128K ctx $57.75 $0.0004/call cheaper −93% Qwen3 Coder 30B A3B InstructMid · 160K ctx $58.28 $0.0004/call cheaper −92% Gemma 3 27BMid · 131K ctx $58.80 $0.0004/call cheaper −92% U Solar Pro 3Mid · 128K ctx $61.20 $0.0004/call cheaper −92% Qwen3 235B A22B Instruct 2507Mid · 262K ctx $61.95 $0.0004/call cheaper −92% B Seed 1.6 FlashMid · 262K ctx $63.00 $0.0004/call cheaper −92% Qwen3 32BMid · 40K ctx $65.10 $0.0004/call cheaper −92% Llama 4 ScoutMid · 1M ctx $66.15 $0.0004/call cheaper −91% R Reka EdgeMid · 16K ctx $68.25 $0.0005/call cheaper −91% Qwen3 235B A22B Thinking 2507Mid · 262K ctx $68.25 $0.0005/call cheaper −91% M Phi 4 Mini InstructMid · 128K ctx $68.78 $0.0005/call cheaper −91% Qwen3.5-9BMid · 262K ctx $70.88 $0.0005/call cheaper −91% SabaMid · 32K ctx $71.10 $0.0005/call cheaper −91% Qwen3 30B A3B Thinking 2507Mid · 131K ctx $71.40 $0.0005/call cheaper −91% B UI-TARS 7BMid · 128K ctx $73.50 $0.0005/call cheaper −90% R Reka Flash 3Mid · 65K ctx $73.50 $0.0005/call cheaper −90% Qwen3 14BMid · 40K ctx $75.60 $0.0005/call cheaper −90% Z GLM 4.5 AirMid · 131K ctx $76.12 $0.0005/call cheaper −90% Qwen3 VL 8B InstructMid · 131K ctx $76.65 $0.0005/call cheaper −90% DeepSeek V3Mid · 128K ctx $77.49 $0.0005/call cheaper −90% DeepSeek R1Frontier · 128K ctx $77.49 $0.0005/call cheaper −90% Mistral Small 4Small · 262K ctx · below min tier $78.75 $0.0005/call cheaper −90% Gemma 4 31BMid · 262K ctx $79.58 $0.0005/call cheaper −90% Llama 3.3 70B InstructMid · 131K ctx $79.80 $0.0005/call cheaper −90% N Nemotron 3 SuperMid · 262K ctx $80.32 $0.0005/call cheaper −90% B Seed-2.0-MiniMid · 262K ctx $84.00 $0.0006/call cheaper −89% Qwen3 VL 32B InstructMid · 131K ctx $87.36 $0.0006/call cheaper −89% I Mercury 2Mid · 128K ctx $88.88 $0.0006/call cheaper −88% GPT-4o-mini (2024-07-18)Mid · 128K ctx $90.00 $0.0006/call cheaper −88% Qwen3 Coder NextMid · 262K ctx $92.10 $0.0006/call cheaper −88% M MiniMax M2.5Mid · 196K ctx $93.75 $0.0006/call cheaper −88% Llama 3 8B InstructMid · 8K ctx $95.55 $0.0006/call cheaper −88% P Laguna M.1Mid · 262K ctx $99.00 $0.0007/call cheaper −87% Qwen3 Coder FlashMid · 1M ctx $99.16 $0.0007/call cheaper −87% Qwen3 30B A3BMid · 40K ctx $102 $0.0007/call cheaper −87% E Rnj 1 InstructMid · 32K ctx $102 $0.0007/call cheaper −87% Qwen3 Next 80B A3B ThinkingMid · 131K ctx $102 $0.0007/call cheaper −87% N Hermes 4 70BMid · 131K ctx $103 $0.0007/call cheaper −87% Qwen-PlusMid · 1M ctx $105 $0.0007/call cheaper −86% M MiniMax M2Mid · 196K ctx $105 $0.0007/call cheaper −86% GPT-5.4 NanoMid · 400K ctx $105 $0.0007/call cheaper −86% Codestral 2508Mid · 256K ctx $107 $0.0007/call cheaper −86% M MiniMax M2.1Mid · 196K ctx $108 $0.0007/call cheaper −86% A Trinity Large ThinkingMid · 262K ctx $108 $0.0007/call cheaper −86% Qwen3 VL 30B A3B InstructMid · 131K ctx $109 $0.0007/call cheaper −86% S Step 3.7 FlashMid · 256K ctx $110 $0.0007/call cheaper −86% X MiMo-V2.5-ProMid · 1.05M ctx $113 $0.0008/call cheaper −85% DeepSeek V4 ProFrontier · 1M ctx $113 $0.0008/call cheaper −85% M MiniMax M2.7Mid · 196K ctx $114 $0.0008/call cheaper −85% Qwen3 Next 80B A3B InstructMid · 262K ctx $114 $0.0008/call cheaper −85% T Hunyuan A13B InstructMid · 131K ctx $118 $0.0008/call cheaper −85% Z GLM 4.6VMid · 131K ctx $119 $0.0008/call cheaper −85% A Olmo 3 32B ThinkMid · 65K ctx $121 $0.0008/call cheaper −84% M MiniMax M2-herMid · 65K ctx $122 $0.0008/call cheaper −84% Llama Guard 4 12BMid · 163K ctx $123 $0.0008/call cheaper −84% GPT-4o-mini Search PreviewMid · 128K ctx $126 $0.0008/call cheaper −84% C Command R (08-2024)Mid · 128K ctx $126 $0.0008/call cheaper −84% Llama 4 MaverickFrontier · 1M ctx $126 $0.0008/call cheaper −84% C Command RMid · 128K ctx $126 $0.0008/call cheaper −84% Gemini 3.1 Flash LiteMid · 1.05M ctx $128 $0.0009/call cheaper −83% Gemini 3.1 Flash Lite PreviewMid · 1.05M ctx $128 $0.0009/call cheaper −83% Qwen3 VL 235B A22B InstructMid · 262K ctx $129 $0.0009/call cheaper −83% T Rocinante 12BMid · 32K ctx $130 $0.0009/call cheaper −83% DeepSeek V3 0324Mid · 163K ctx $135 $0.0009/call cheaper −82% DeepSeek V3.1Mid · 163K ctx $135 $0.0009/call cheaper −82% K KAT-Coder-Pro V2Mid · 256K ctx $137 $0.0009/call cheaper −82% M MiniMax M3Mid · 524K ctx $137 $0.0009/call cheaper −82% Qwen3.6 35B A3BMid · 262K ctx $141 $0.0009/call cheaper −82% Qwen3.5-35B-A3BMid · 262K ctx $141 $0.0009/call cheaper −82% T Cydonia 24B V4.1Mid · 131K ctx $143 $0.0010/call cheaper −81% Qwen3 VL 8B ThinkingMid · 131K ctx $145 $0.0010/call cheaper −81% Qwen3.7 PlusMid · 1M ctx $146 $0.0010/call cheaper −81% DeepSeek V3.1 TerminusMid · 163K ctx $153 $0.0010/call cheaper −80% GPT-5.1-Codex-MiniMid · 400K ctx $154 $0.0010/call cheaper −80% GPT-5 MiniMid · 400K ctx $154 $0.0010/call cheaper −80% DeepSeek V3.2Mid · 128K ctx $162 $0.0011/call cheaper −79% Qwen3 VL 30B A3B ThinkingMid · 131K ctx $164 $0.0011/call cheaper −79% P Perceptron Mk1Mid · 32K ctx $173 $0.0012/call cheaper −78% Qwen3.6 FlashMid · 1M ctx $177 $0.0012/call cheaper −77% Mistral Large 3 2512Mid · 262K ctx $178 $0.0012/call cheaper −77% P INTELLECT-3Mid · 131K ctx $184 $0.0012/call cheaper −76% M MiniMax-01Mid · 1M ctx $184 $0.0012/call cheaper −76% Mistral Medium 3Mid · 131K ctx $184 $0.0012/call cheaper −76% Devstral 2 2512Mid · 262K ctx $184 $0.0012/call cheaper −76% Mistral Medium 3.1Mid · 131K ctx $184 $0.0012/call cheaper −76% Z GLM 4.7Mid · 202K ctx $190 $0.0013/call cheaper −75% Nano Banana (Gemini 2.5 Flash Image)Mid · 32K ctx $191 $0.0013/call cheaper −75% Gemini 2.5 FlashMid · 1M ctx $191 $0.0013/call cheaper −75% DeepSeek V3.2 ExpMid · 163K ctx $192 $0.0013/call cheaper −75% GPT-4.1 MiniSmall · 1M ctx · below min tier $192 $0.0013/call cheaper −75% Z GLM 4.6Mid · 202K ctx $194 $0.0013/call cheaper −75% R1 Distill Qwen 32BMid · 32K ctx $198 $0.0013/call cheaper −74% Qwen Plus 0728Mid · 1M ctx $205 $0.0014/call cheaper −73% Qwen3.5-27BMid · 262K ctx $205 $0.0014/call cheaper −73% Qwen Plus 0728 (thinking)Mid · 1M ctx $205 $0.0014/call cheaper −73% Qwen3 Coder 480B A35BMid · 262K ctx $233 $0.0016/call cheaper −70% Llama 3.2 11B Vision InstructMid · 131K ctx $235 $0.0016/call cheaper −70% Z GLM 4.5VMid · 65K ctx $237 $0.0016/call cheaper −69% N Nemotron 3 UltraMid · 262K ctx $238 $0.0016/call cheaper −69% T Skyfall 36B V2Mid · 32K ctx $244 $0.0016/call cheaper −68% Qwen3.5 Plus 2026-02-15Mid · 1M ctx $246 $0.0016/call cheaper −68% Qwen2.5 72B InstructMid · 32K ctx $248 $0.0017/call cheaper −68% Z GLM 5Mid · 202K ctx $248 $0.0017/call cheaper −68% Mistral Small 3.1 24BMid · 128K ctx $250 $0.0017/call cheaper −68% Gemini 3 FlashMid · 1M ctx $256 $0.0017/call cheaper −67% Gemini 3 Flash PreviewMid · 1.05M ctx $256 $0.0017/call cheaper −67% Z GLM 4.5Mid · 131K ctx $258 $0.0017/call cheaper −67% B Seed-2.0-LiteMid · 262K ctx $262 $0.0018/call cheaper −66% B Seed 1.6Mid · 262K ctx $262 $0.0018/call cheaper −66% N Llama 3.3 Nemotron Super 49B V1.5Mid · 131K ctx $273 $0.0018/call cheaper −65% T UnslopNemo 12BMid · 32K ctx $273 $0.0018/call cheaper −65% Llama 3.1 70B InstructMid · 131K ctx $273 $0.0018/call cheaper −65% Qwen3.5-122B-A10BMid · 262K ctx $273 $0.0018/call cheaper −65% Qwen3.5 Plus 2026-04-20Mid · 1M ctx $284 $0.0019/call cheaper −63% Kimi K2.5Frontier · 256K ctx $296 $0.0020/call cheaper −62% A Aion-2.0Mid · 131K ctx $300 $0.0020/call cheaper −61% Qwen3 VL 235B A22B ThinkingMid · 131K ctx $300 $0.0020/call cheaper −61% Qwen3 MaxFrontier · 256K ctx $302 $0.0020/call cheaper −61% Llama Guard 3 8BMid · 131K ctx $306 $0.0020/call cheaper −60% Qwen3.6 PlusMid · 1M ctx $307 $0.0020/call cheaper −60% Kimi K2.7 CodeMid · 262K ctx $315 $0.0021/call cheaper −59% U ReMM SLERP 13BMid · 6K ctx $318 $0.0021/call cheaper −59% A Nova 2 LiteMid · 1M ctx $320 $0.0021/call cheaper −59% B ERNIE 4.5 VL 424B A47BMid · 123K ctx $330 $0.0022/call cheaper −57% Qwen3 Coder PlusMid · 1M ctx $331 $0.0022/call cheaper −57% Qwen3.6 27BMid · 262K ctx $348 $0.0023/call cheaper −55% Grok Build 0.1Mid · 256K ctx $351 $0.0023/call cheaper −55% R1 0528Mid · 163K ctx $356 $0.0024/call cheaper −54% A Coder LargeMid · 32K ctx $357 $0.0024/call cheaper −54% Llama 3 70B InstructMid · 8K ctx $360 $0.0024/call cheaper −53% Qwen2.5 VL 72B InstructMid · 128K ctx $364 $0.0024/call cheaper −53% M MiniMax M1Mid · 1M ctx $368 $0.0024/call cheaper −52% Qwen3.5 397B A17BMid · 131K ctx $371 $0.0025/call cheaper −52% Qwen3 235B A22BMid · 131K ctx $382 $0.0025/call cheaper −51% GPT-5.4 MiniMid · 400K ctx $385 $0.0026/call cheaper −50% Mistral Large 3Frontier · 262K ctx $394 $0.0026/call cheaper −49% GPT-3.5 TurboMid · 16K ctx $394 $0.0026/call cheaper −49% Grok 4.20 Multi-AgentMid · 2M ctx $415 $0.0028/call cheaper −46% Grok 4.3Frontier · 1M ctx $415 $0.0028/call cheaper −46% Grok 4.20Frontier · 2M ctx $415 $0.0028/call cheaper −46% M WizardLM-2 8x22BMid · 65K ctx $423 $0.0028/call cheaper −45% Kimi K2.6Frontier · 256K ctx $429 $0.0029/call cheaper −44% Gemma 2 27BMid · 8K ctx $444 $0.0030/call cheaper −43% S Llama 3.3 Euryale 70BMid · 131K ctx $449 $0.0030/call cheaper −42% Claude Haiku 4.5Small · 200K ctx · below min tier $460 $0.0031/call cheaper −40% Qwen2.5 Coder 32B InstructMid · 32K ctx $468 $0.0031/call cheaper −39% Nano Banana 2 (Gemini 3.1 Flash Image Preview)Mid · 131K ctx $472 $0.0032/call cheaper −39% Nano Banana 2 (Gemini 3.1 Flash Image)Mid · 65K ctx $472 $0.0032/call cheaper −39% N Hermes 3 70B InstructMid · 131K ctx $478 $0.0032/call cheaper −38% Kimi K2 0711Mid · 131K ctx $480 $0.0032/call cheaper −38% Z GLM 5.2Mid · 1.05M ctx $491 $0.0033/call cheaper −36% GPT Audio MiniMid · 128K ctx $504 $0.0034/call cheaper −35% Z GLM 5 TurboMid · 262K ctx $505 $0.0034/call cheaper −35% Kimi K2 0905Mid · 262K ctx $509 $0.0034/call cheaper −34% Kimi K2 ThinkingMid · 262K ctx $509 $0.0034/call cheaper −34% A Aion-1.0-MiniMid · 131K ctx $514 $0.0034/call cheaper −33% M Weaver (alpha)Mid · 8K ctx $525 $0.0035/call cheaper −32% o4-miniSmall · 200K ctx · below min tier $528 $0.0035/call cheaper −32% o4 Mini HighMid · 200K ctx $528 $0.0035/call cheaper −32% A Virtuoso LargeMid · 131K ctx $536 $0.0036/call cheaper −31% Z GLM 5.1Mid · 202K ctx $544 $0.0036/call cheaper −30% R1 Distill Llama 70BMid · 8K ctx $546 $0.0036/call cheaper −29% M Morph V3 FastMid · 81K ctx $567 $0.0038/call cheaper −27% R1Mid · 64K ctx $572 $0.0038/call cheaper −26% S Llama 3.1 Euryale 70B v2.2Mid · 131K ctx $580 $0.0039/call cheaper −25% A Aion-RP 1.0 (8B)Mid · 32K ctx $588 $0.0039/call cheaper −24% GPT-5 Image MiniMid · 400K ctx $600 $0.0040/call cheaper −22% R Relace Apply 3Mid · 256K ctx $601 $0.0040/call cheaper −22% o3 Mini HighMid · 200K ctx $660 $0.0044/call cheaper −15% M Morph V3 LargeMid · 262K ctx $667 $0.0044/call cheaper −14% A Nova Pro 1.0Mid · 300K ctx $672 $0.0045/call cheaper −13% N Hermes 3 405B InstructMid · 131K ctx $682 $0.0046/call cheaper −12% P SonarMid · 127K ctx $682 $0.0046/call cheaper −12% W Palmyra X5Mid · 1.04M ctx $693 $0.0046/call cheaper −10% Qwen3 Max ThinkingMid · 262K ctx $696 $0.0046/call cheaper −10% Mistral Large 2407Mid · 131K ctx $711 $0.0047/call cheaper −8% Mixtral 8x22B InstructMid · 65K ctx $711 $0.0047/call cheaper −8% S Switchpoint RouterMid · 131K ctx $714 $0.0048/call cheaper −8% GPT-3.5 Turbo (older v0613)Mid · 4K ctx context too small $735 $0.0049/call cheaper −5% Gemini 3.5 FlashMid · 1M ctx $770 $0.0051/call GPT-5Frontier · 1M ctx · baseline $772 $0.0052/call Gemini 2.5 ProFrontier · 1M ctx $772 $0.0052/call GPT-5.1-Codex-MaxMid · 400K ctx $772 $0.0052/call GPT-5 ChatMid · 128K ctx $772 $0.0052/call Gemini 2.5 Pro Preview 06-05Mid · 1.05M ctx $772 $0.0052/call Gemini 2.5 Pro Preview 05-06Mid · 1.05M ctx $772 $0.0052/call GPT-5 CodexMid · 400K ctx $772 $0.0052/call GPT-5.1-CodexMid · 400K ctx $775 $0.0052/call GPT-5.1 ChatMid · 128K ctx $775 $0.0052/call GPT-5.1Mid · 400K ctx $775 $0.0052/call N Hermes 4 405BMid · 131K ctx $788 $0.0052/call dearer +2% R Relace SearchMid · 256K ctx $788 $0.0052/call dearer +2% D Cogito v2.1 671BMid · 128K ctx $853 $0.0057/call dearer +10% o3Frontier · 200K ctx $960 $0.0064/call dearer +24% GPT-4.1Mid · 1M ctx $960 $0.0064/call dearer +24% o4 Mini Deep ResearchMid · 200K ctx $960 $0.0064/call dearer +24% Qwen3.6 Max PreviewMid · 262K ctx $983 $0.0066/call dearer +27% Gemini 3 ProFrontier · 1M ctx $1,026 $0.0068/call dearer +33% Gemini 3.1 ProFrontier · 1M ctx $1,026 $0.0068/call dearer +33% Nano Banana Pro (Gemini 3 Pro Image)Mid · 65K ctx $1,026 $0.0068/call dearer +33% Gemini 3.1 Pro Preview Custom ToolsMid · 1.05M ctx $1,026 $0.0068/call dearer +33% Gemini 3.1 Pro PreviewMid · 1.05M ctx $1,026 $0.0068/call dearer +33% Nano Banana Pro (Gemini 3 Pro Image Preview)Mid · 65K ctx $1,026 $0.0068/call dearer +33% GPT-3.5 Turbo InstructMid · 4K ctx context too small $1,050 $0.0070/call dearer +36% GPT-5.3 ChatMid · 128K ctx $1,082 $0.0072/call dearer +40% GPT-5.2 ChatMid · 128K ctx $1,082 $0.0072/call dearer +40% GPT-5.3-CodexMid · 400K ctx $1,082 $0.0072/call dearer +40% GPT-5.2-CodexMid · 400K ctx $1,082 $0.0072/call dearer +40% GPT-5.2Mid · 400K ctx $1,082 $0.0072/call dearer +40% GPT-5.4Mid · 1.05M ctx $1,283 $0.0086/call dearer +66% A Nova Premier 1.0Mid · 1M ctx $1,331 $0.0089/call dearer +72% Mistral Medium 3.5Mid · 128K ctx $1,339 $0.0089/call dearer +73% Claude Sonnet 4.6Mid · 1M ctx $1,382 $0.0092/call dearer +79% Claude Sonnet 4.5Mid · 1M ctx $1,382 $0.0092/call dearer +79% GPT-4o (2024-11-20)Mid · 128K ctx $1,500 $0.010/call dearer +94% GPT-4o (2024-08-06)Mid · 128K ctx $1,500 $0.010/call dearer +94% A Jamba Large 1.7Mid · 256K ctx $1,680 $0.011/call dearer +117% P Sonar Deep ResearchMid · 128K ctx $1,680 $0.011/call dearer +117% P Sonar Reasoning ProMid · 128K ctx $1,680 $0.011/call dearer +117% Qwen 3.7 MaxFrontier · 1M ctx $1,969 $0.013/call dearer +155% S Llama 3.1 70B Hanami x1Mid · 16K ctx $2,048 $0.014/call dearer +165% I Inflection 3 PiMid · 8K ctx $2,100 $0.014/call dearer +172% GPT AudioMid · 128K ctx $2,100 $0.014/call dearer +172% C Command R PlusFrontier · 128K ctx $2,100 $0.014/call dearer +172% C Command R+ (08-2024)Mid · 128K ctx $2,100 $0.014/call dearer +172% GPT-4o Search PreviewMid · 128K ctx $2,100 $0.014/call dearer +172% I Inflection 3 ProductivityMid · 8K ctx $2,100 $0.014/call dearer +172% GPT-3.5 Turbo 16kMid · 16K ctx $2,100 $0.014/call dearer +172% C Command AMid · 256K ctx $2,100 $0.014/call dearer +172% A Magnum v4 72BMid · 16K ctx $2,153 $0.014/call dearer +179% Claude Opus 4.6Frontier · 1M ctx $2,303 $0.015/call dearer +198% Claude Opus 4.5Frontier · 200K ctx $2,303 $0.015/call dearer +198% Claude Opus 4.8Frontier · 1M ctx $2,303 $0.015/call dearer +198% Claude Opus 4.7Frontier · 1M ctx $2,303 $0.015/call dearer +198% GPT-5.5Frontier · 1M ctx $2,565 $0.017/call dearer +232% GPT-5 ImageMid · 400K ctx $2,625 $0.018/call dearer +240% P Sonar Pro SearchMid · 200K ctx $2,678 $0.018/call dearer +247% P Sonar ProMid · 200K ctx $2,678 $0.018/call dearer +247% A Aion-1.0Mid · 131K ctx $2,940 $0.020/call dearer +281% GPT-5.4 Image 2Mid · 272K ctx $2,948 $0.020/call dearer +282% GPT-4o (2024-05-13)Mid · 128K ctx $3,938 $0.026/call dearer +410% Claude Fable 5Frontier · 1M ctx $4,605 $0.031/call dearer +496% o3 Deep ResearchMid · 200K ctx $4,800 $0.032/call dearer +521% GPT-4 Turbo PreviewMid · 128K ctx $7,875 $0.052/call dearer +919% GPT-5 ProMid · 400K ctx $15,750 $0.105/call dearer +1939% o3-proFrontier · 200K ctx $16,800 $0.112/call dearer +2075% o3 ProMid · 200K ctx $16,800 $0.112/call dearer +2075% GPT-5.2 ProMid · 400K ctx $22,050 $0.147/call dearer +2754% GPT-5.5 ProFrontier · 1M ctx $28,350 $0.189/call dearer +3570% GPT-5.4 ProMid · 1.05M ctx $28,350 $0.189/call dearer +3570% o1-proMid · 200K ctx $126,000 $0.840/call dearer +16211%

Where the money goes

Ling-2.6-flash · $4.04/mo
Input 61% $2.46/mo Output 39% $1.58/mo
Prompt caching at 80% saves $3.84/mo on input.

This workload, priced through history

cheapest model that fit, on each date
Mar 2024$394
99%
today$4.04
See what drove the drops

Cut it further

contextual levers for this workload
Cache the context

You reuse 4K context tokens each call. Cached input bills up to 90% cheaper.

≈ $0.72/mo at 95% hit
Route by difficulty

Send easy calls to a small model and reserve a frontier model for the hard ones. A 70/30 split often halves spend.

Use the Batch API

Non-urgent jobs (eval, backfill, summarization) run ~50% cheaper on async batch endpoints.

≈ −50% on batchable volume
Read the cost-cutting playbook

Assumptions

  • Prices are USD per 1M tokens, at current list rates.
  • Dimensions priced: input, output, and cached input where offered.
  • Cache hit rate applies to your system / reused tokens only.
  • Excludes batch discounts, free tiers, image/audio, and rate-limit effects.
  • “Cheapest equivalent” respects your context-window and minimum-capability constraints.
A cost comparison, not a quality verdict. A cheaper model can change output quality; validate any switch with your own eval before moving production traffic.
Always-current, history-backed list prices — nothing you enter leaves your browser.
Permalink encodes this exact workload
Coming soon · the measure product

Measure your real usage

Paste a production trace or connect your provider data and get your exact cost per call — then the same cheapest-equivalent recommendation, on the calls you actually ship.

Paste a trace Connect your data Exact cost & savings

No spam — one note when it ships. This is the separate measure product, not part of the estimator.

Alert me when this estimate changes

Get an email when a price change shifts this bill — we'll name the model and the new number.