Market events
Every price cut, repricing, and model launch we've tracked — 29 market events and 299 launches, going back to 2023.
2026
-
Fugu Ultra released
Sakana ships Fugu Ultra at $5.00 in / $30.00 out per 1M.
-
Nano Banana Pro (Gemini 3 Pro Image) released
Google ships Nano Banana Pro (Gemini 3 Pro Image) at $2.00 in / $12.00 out per 1M.
-
Nano Banana 2 (Gemini 3.1 Flash Image) released
Google ships Nano Banana 2 (Gemini 3.1 Flash Image) at $0.5 in / $3.00 out per 1M.
-
GLM 5.2 released
Z.ai ships GLM 5.2 at $0.95 in / $3.00 out per 1M.
-
Kimi K2.7 Code released
Moonshot AI ships Kimi K2.7 Code at $0.74 in / $3.50 out per 1M.
-
Claude Fable 5 released
Anthropic ships Claude Fable 5 at $10.00 in / $50.00 out per 1M.
-
Nex-N2-Pro released
Nex AGI ships Nex-N2-Pro at $0.25 in / $1.00 out per 1M.
-
Nemotron 3 Ultra released
NVIDIA ships Nemotron 3 Ultra at $0.5 in / $2.20 out per 1M.
-
Qwen3.7 Plus released
Alibaba ships Qwen3.7 Plus at $0.32 in / $1.28 out per 1M.
-
MiniMax M3 released
MiniMax ships MiniMax M3 at $0.3 in / $1.20 out per 1M.
-
Step 3.7 Flash released
StepFun ships Step 3.7 Flash at $0.2 in / $1.15 out per 1M.
-
Claude Opus 4.8 released
Anthropic ships Claude Opus 4.8 at $5.00 in / $25.00 out per 1M.
-
DeepSeek V4 Pro 75% cut
DeepSeek makes a 75% promotional discount permanent, pricing V4 Pro at $0.435/$0.87.
-
Qwen 3.7 Max released
Alibaba ships Qwen 3.7 Max at $2.50 in / $7.50 out per 1M.
-
Grok Build 0.1 released
xAI ships Grok Build 0.1 at $1.00 in / $2.00 out per 1M.
-
Cheap Flash era ends
Gemini 3.5 Flash ships at $1.50/$9 per 1M — triple its predecessor — as Google reprices Flash from budget tier toward Pro territory.
-
Gemini 3.5 Flash released
Google ships Gemini 3.5 Flash at $1.50 in / $9.00 out per 1M.
-
Perceptron Mk1 released
Perceptron ships Perceptron Mk1 at $0.15 in / $1.50 out per 1M.
-
Ring-2.6-1T released
inclusionAI ships Ring-2.6-1T at $0.075 in / $0.625 out per 1M.
-
Gemini 3.1 Flash Lite released
Google ships Gemini 3.1 Flash Lite at $0.25 in / $1.50 out per 1M.
-
Grok 4.3 released
xAI ships Grok 4.3 at $1.25 in / $2.50 out per 1M.
-
Granite 4.1 8B released
IBM ships Granite 4.1 8B at $0.05 in / $0.1 out per 1M.
-
Mistral Medium 3.5 released
Mistral ships Mistral Medium 3.5 at $1.50 in / $7.50 out per 1M.
-
Laguna XS.2 released
Poolside ships Laguna XS.2 at $0.1 in / $0.2 out per 1M.
-
Laguna M.1 released
Poolside ships Laguna M.1 at $0.2 in / $0.4 out per 1M.
-
Qwen3.6 Max Preview released
Alibaba ships Qwen3.6 Max Preview at $1.04 in / $6.24 out per 1M.
-
Qwen3.6 Flash released
Alibaba ships Qwen3.6 Flash at $0.1875 in / $1.12 out per 1M.
-
Qwen3.6 35B A3B released
Alibaba ships Qwen3.6 35B A3B at $0.14 in / $1.00 out per 1M.
-
Qwen3.6 27B released
Alibaba ships Qwen3.6 27B at $0.2885 in / $3.17 out per 1M.
-
Qwen3.5 Plus 2026-04-20 released
Alibaba ships Qwen3.5 Plus 2026-04-20 at $0.3 in / $1.80 out per 1M.
-
GPT-5.5 Pro released
OpenAI ships GPT-5.5 Pro at $30.00 in / $180.00 out per 1M.
-
DeepSeek V4 Pro released
DeepSeek ships DeepSeek V4 Pro at $0.435 in / $0.87 out per 1M.
-
DeepSeek V4 Flash released
DeepSeek ships DeepSeek V4 Flash at $0.14 in / $0.28 out per 1M.
-
GPT-5.5 raises frontier prices
GPT-5.5 launches at $5/$30 per 1M, a sharp step up from GPT-5's commodity pricing — frontier labs begin testing price tolerance.
-
Ling-2.6-1T released
inclusionAI ships Ling-2.6-1T at $0.075 in / $0.625 out per 1M.
-
GPT-5.5 released
OpenAI ships GPT-5.5 at $5.00 in / $30.00 out per 1M.
-
MiMo-V2.5-Pro released
Xiaomi ships MiMo-V2.5-Pro at $0.435 in / $0.87 out per 1M.
-
MiMo-V2.5 released
Xiaomi ships MiMo-V2.5 at $0.105 in / $0.28 out per 1M.
-
Hy3 preview released
Tencent ships Hy3 preview at $0.063 in / $0.21 out per 1M.
-
Ling-2.6-flash released
inclusionAI ships Ling-2.6-flash at $0.01 in / $0.03 out per 1M.
-
GPT-5.4 Image 2 released
OpenAI ships GPT-5.4 Image 2 at $8.00 in / $15.00 out per 1M.
-
Kimi K2.6 released
Moonshot AI ships Kimi K2.6 at $0.95 in / $4.00 out per 1M.
-
Claude Opus 4.7 released
Anthropic ships Claude Opus 4.7 at $5.00 in / $25.00 out per 1M.
-
GLM 5.1 released
Z.ai ships GLM 5.1 at $0.98 in / $3.08 out per 1M.
-
Gemma 4 26B A4B released
Google ships Gemma 4 26B A4B at $0.06 in / $0.33 out per 1M.
-
Qwen3.6 Plus released
Alibaba ships Qwen3.6 Plus at $0.325 in / $1.95 out per 1M.
-
Gemma 4 31B released
Google ships Gemma 4 31B at $0.12 in / $0.35 out per 1M.
-
Trinity Large Thinking released
Arcee AI ships Trinity Large Thinking at $0.25 in / $0.8 out per 1M.
-
GLM 5V Turbo released
Z.ai ships GLM 5V Turbo at $1.20 in / $4.00 out per 1M.
-
Grok 4.20 Multi-Agent released
xAI ships Grok 4.20 Multi-Agent at $1.25 in / $2.50 out per 1M.
-
KAT-Coder-Pro V2 released
Kwaipilot ships KAT-Coder-Pro V2 at $0.3 in / $1.20 out per 1M.
-
Reka Edge released
Rekaai ships Reka Edge at $0.1 in / $0.1 out per 1M.
-
MiniMax M2.7 released
MiniMax ships MiniMax M2.7 at $0.24 in / $0.96 out per 1M.
-
GPT-5.4 Nano released
OpenAI ships GPT-5.4 Nano at $0.2 in / $1.25 out per 1M.
-
GPT-5.4 Mini released
OpenAI ships GPT-5.4 Mini at $0.75 in / $4.50 out per 1M.
-
Mistral Small 4 released
Mistral ships Mistral Small 4 at $0.1 in / $0.3 out per 1M.
-
GLM 5 Turbo released
Z.ai ships GLM 5 Turbo at $1.20 in / $4.00 out per 1M.
-
Nemotron 3 Super released
NVIDIA ships Nemotron 3 Super at $0.09 in / $0.45 out per 1M.
-
Seed-2.0-Lite released
ByteDance Seed ships Seed-2.0-Lite at $0.25 in / $2.00 out per 1M.
-
Qwen3.5-9B released
Alibaba ships Qwen3.5-9B at $0.1 in / $0.15 out per 1M.
-
Grok 4.20 released
xAI ships Grok 4.20 at $1.25 in / $2.50 out per 1M.
-
GPT-5.4 released
OpenAI ships GPT-5.4 at $2.50 in / $15.00 out per 1M.
-
GPT-5.4 Pro released
OpenAI ships GPT-5.4 Pro at $30.00 in / $180.00 out per 1M.
-
Mercury 2 released
Inception ships Mercury 2 at $0.25 in / $0.75 out per 1M.
-
Gemini 3.1 Flash Lite Preview released
Google ships Gemini 3.1 Flash Lite Preview at $0.25 in / $1.50 out per 1M.
-
GPT-5.3 Chat released
OpenAI ships GPT-5.3 Chat at $1.75 in / $14.00 out per 1M.
-
Seed-2.0-Mini released
ByteDance Seed ships Seed-2.0-Mini at $0.1 in / $0.4 out per 1M.
-
Nano Banana 2 (Gemini 3.1 Flash Image Preview) released
Google ships Nano Banana 2 (Gemini 3.1 Flash Image Preview) at $0.5 in / $3.00 out per 1M.
-
Qwen3.5-Flash released
Alibaba ships Qwen3.5-Flash at $0.065 in / $0.26 out per 1M.
-
Qwen3.5-35B-A3B released
Alibaba ships Qwen3.5-35B-A3B at $0.14 in / $1.00 out per 1M.
-
Qwen3.5-27B released
Alibaba ships Qwen3.5-27B at $0.195 in / $1.56 out per 1M.
-
Qwen3.5-122B-A10B released
Alibaba ships Qwen3.5-122B-A10B at $0.26 in / $2.08 out per 1M.
-
LFM2-24B-A2B released
LiquidAI ships LFM2-24B-A2B at $0.03 in / $0.12 out per 1M.
-
Gemini 3.1 Pro Preview Custom Tools released
Google ships Gemini 3.1 Pro Preview Custom Tools at $2.00 in / $12.00 out per 1M.
-
GPT-5.3-Codex released
OpenAI ships GPT-5.3-Codex at $1.75 in / $14.00 out per 1M.
-
Aion-2.0 released
AionLabs ships Aion-2.0 at $0.8 in / $1.60 out per 1M.
-
Gemini 3.1 Pro released
Google ships Gemini 3.1 Pro at $2.00 in / $12.00 out per 1M.
-
Gemini 3.1 Pro Preview released
Google ships Gemini 3.1 Pro Preview at $2.00 in / $12.00 out per 1M.
-
Claude Sonnet 4.6 released
Anthropic ships Claude Sonnet 4.6 at $3.00 in / $15.00 out per 1M.
-
Qwen3.5 Plus 2026-02-15 released
Alibaba ships Qwen3.5 Plus 2026-02-15 at $0.26 in / $1.56 out per 1M.
-
Qwen3.5 397B A17B released
Alibaba ships Qwen3.5 397B A17B at $0.385 in / $2.45 out per 1M.
-
MiniMax M2.5 released
MiniMax ships MiniMax M2.5 at $0.15 in / $0.9 out per 1M.
-
GLM 5 released
Z.ai ships GLM 5 at $0.6 in / $1.92 out per 1M.
-
Qwen3 Max Thinking released
Alibaba ships Qwen3 Max Thinking at $0.78 in / $3.90 out per 1M.
-
Claude Opus 4.6 released
Anthropic ships Claude Opus 4.6 at $5.00 in / $25.00 out per 1M.
-
Qwen3 Coder Next released
Alibaba ships Qwen3 Coder Next at $0.11 in / $0.8 out per 1M.
-
Step 3.5 Flash released
StepFun ships Step 3.5 Flash at $0.09 in / $0.3 out per 1M.
-
Solar Pro 3 released
Upstage ships Solar Pro 3 at $0.15 in / $0.6 out per 1M.
-
Kimi K2.5 released
Moonshot AI ships Kimi K2.5 at $0.6 in / $3.00 out per 1M.
-
MiniMax M2-her released
MiniMax ships MiniMax M2-her at $0.3 in / $1.20 out per 1M.
-
Palmyra X5 released
Writer ships Palmyra X5 at $0.6 in / $6.00 out per 1M.
-
GPT Audio released
OpenAI ships GPT Audio at $2.50 in / $10.00 out per 1M.
-
GPT Audio Mini released
OpenAI ships GPT Audio Mini at $0.6 in / $2.40 out per 1M.
-
GLM 4.7 Flash released
Z.ai ships GLM 4.7 Flash at $0.06 in / $0.4 out per 1M.
-
GPT-5.2-Codex released
OpenAI ships GPT-5.2-Codex at $1.75 in / $14.00 out per 1M.
2025
-
Seed 1.6 released
ByteDance Seed ships Seed 1.6 at $0.25 in / $2.00 out per 1M.
-
Seed 1.6 Flash released
ByteDance Seed ships Seed 1.6 Flash at $0.075 in / $0.3 out per 1M.
-
MiniMax M2.1 released
MiniMax ships MiniMax M2.1 at $0.29 in / $0.95 out per 1M.
-
GLM 4.7 released
Z.ai ships GLM 4.7 at $0.4 in / $1.75 out per 1M.
-
Gemini 3 Flash released
Google ships Gemini 3 Flash at $0.5 in / $3.00 out per 1M.
-
Gemini 3 Flash Preview released
Google ships Gemini 3 Flash Preview at $0.5 in / $3.00 out per 1M.
-
Nemotron 3 Nano 30B A3B released
NVIDIA ships Nemotron 3 Nano 30B A3B at $0.05 in / $0.2 out per 1M.
-
MiMo-V2-Flash released
Xiaomi ships MiMo-V2-Flash at $0.1 in / $0.3 out per 1M.
-
GPT-5.2 released
OpenAI ships GPT-5.2 at $1.75 in / $14.00 out per 1M.
-
GPT-5.2 Pro released
OpenAI ships GPT-5.2 Pro at $21.00 in / $168.00 out per 1M.
-
GPT-5.2 Chat released
OpenAI ships GPT-5.2 Chat at $1.75 in / $14.00 out per 1M.
-
Devstral 2 2512 released
Mistral ships Devstral 2 2512 at $0.4 in / $2.00 out per 1M.
-
Relace Search released
Relace ships Relace Search at $1.00 in / $3.00 out per 1M.
-
GLM 4.6V released
Z.ai ships GLM 4.6V at $0.3 in / $0.9 out per 1M.
-
Rnj 1 Instruct released
EssentialAI ships Rnj 1 Instruct at $0.15 in / $0.15 out per 1M.
-
GPT-5.1-Codex-Max released
OpenAI ships GPT-5.1-Codex-Max at $1.25 in / $10.00 out per 1M.
-
Mistral Large 3: 75% cheaper
Mistral's open-weight frontier flagship lands at $0.50/$1.50 per 1M — 75% below Large 2 — keeping open-model pressure on closed pricing.
-
Nova 2 Lite released
Amazon ships Nova 2 Lite at $0.3 in / $2.50 out per 1M.
-
Mistral Large 3 released
Mistral ships Mistral Large 3 at $0.5 in / $1.50 out per 1M.
-
Ministral 3 8B 2512 released
Mistral ships Ministral 3 8B 2512 at $0.15 in / $0.15 out per 1M.
-
Ministral 3 3B 2512 released
Mistral ships Ministral 3 3B 2512 at $0.1 in / $0.1 out per 1M.
-
Ministral 3 14B 2512 released
Mistral ships Ministral 3 14B 2512 at $0.2 in / $0.2 out per 1M.
-
Trinity Mini released
Arcee AI ships Trinity Mini at $0.045 in / $0.15 out per 1M.
-
Mistral Large 3 2512 released
Mistral ships Mistral Large 3 2512 at $0.5 in / $1.50 out per 1M.
-
DeepSeek V3.2 released
DeepSeek ships DeepSeek V3.2 at $0.2288 in / $0.3432 out per 1M.
-
INTELLECT-3 released
Prime Intellect ships INTELLECT-3 at $0.2 in / $1.10 out per 1M.
-
Opus gets 67% cheaper
Anthropic drops Opus pricing from $15/$75 to $5/$25 with the Opus 4.5 release.
-
Claude Opus 4.5 released
Anthropic ships Claude Opus 4.5 at $5.00 in / $25.00 out per 1M.
-
Olmo 3 32B Think released
AllenAI ships Olmo 3 32B Think at $0.15 in / $0.5 out per 1M.
-
Nano Banana Pro (Gemini 3 Pro Image Preview) released
Google ships Nano Banana Pro (Gemini 3 Pro Image Preview) at $2.00 in / $12.00 out per 1M.
-
Gemini 3 Pro released
Google ships Gemini 3 Pro at $2.00 in / $12.00 out per 1M.
-
Qwen3 Max halved in price war
Alibaba cuts Qwen3 Max roughly 50% as China's AI price war reignites, pressuring domestic and global rivals alike.
-
GPT-5.1-Codex-Mini released
OpenAI ships GPT-5.1-Codex-Mini at $0.25 in / $2.00 out per 1M.
-
GPT-5.1-Codex released
OpenAI ships GPT-5.1-Codex at $1.25 in / $10.00 out per 1M.
-
GPT-5.1 released
OpenAI ships GPT-5.1 at $1.25 in / $10.00 out per 1M.
-
GPT-5.1 Chat released
OpenAI ships GPT-5.1 Chat at $1.25 in / $10.00 out per 1M.
-
Cogito v2.1 671B released
Deep Cogito ships Cogito v2.1 671B at $1.25 in / $1.25 out per 1M.
-
Kimi K2 Thinking released
Moonshot AI ships Kimi K2 Thinking at $0.6 in / $2.50 out per 1M.
-
Nova Premier 1.0 released
Amazon ships Nova Premier 1.0 at $2.50 in / $12.50 out per 1M.
-
Voxtral Small 24B 2507 released
Mistral ships Voxtral Small 24B 2507 at $0.1 in / $0.3 out per 1M.
-
Sonar Pro Search released
Perplexity ships Sonar Pro Search at $3.00 in / $15.00 out per 1M.
-
gpt-oss-safeguard-20b released
OpenAI ships gpt-oss-safeguard-20b at $0.075 in / $0.3 out per 1M.
-
Qwen3 VL 32B Instruct released
Alibaba ships Qwen3 VL 32B Instruct at $0.104 in / $0.416 out per 1M.
-
MiniMax M2 released
MiniMax ships MiniMax M2 at $0.255 in / $1.00 out per 1M.
-
Granite 4.0 Micro released
IBM ships Granite 4.0 Micro at $0.017 in / $0.112 out per 1M.
-
Phi 4 Mini Instruct released
Microsoft ships Phi 4 Mini Instruct at $0.08 in / $0.35 out per 1M.
-
GPT-5 Image Mini released
OpenAI ships GPT-5 Image Mini at $2.50 in / $2.00 out per 1M.
-
Claude Haiku 4.5 released
Anthropic ships Claude Haiku 4.5 at $1.00 in / $5.00 out per 1M.
-
Qwen3 VL 8B Thinking released
Alibaba ships Qwen3 VL 8B Thinking at $0.117 in / $1.36 out per 1M.
-
Qwen3 VL 8B Instruct released
Alibaba ships Qwen3 VL 8B Instruct at $0.08 in / $0.5 out per 1M.
-
GPT-5 Image released
OpenAI ships GPT-5 Image at $10.00 in / $10.00 out per 1M.
-
o4 Mini Deep Research released
OpenAI ships o4 Mini Deep Research at $2.00 in / $8.00 out per 1M.
-
o3 Deep Research released
OpenAI ships o3 Deep Research at $10.00 in / $40.00 out per 1M.
-
Llama 3.3 Nemotron Super 49B V1.5 released
NVIDIA ships Llama 3.3 Nemotron Super 49B V1.5 at $0.4 in / $0.4 out per 1M.
-
Nano Banana (Gemini 2.5 Flash Image) released
Google ships Nano Banana (Gemini 2.5 Flash Image) at $0.3 in / $2.50 out per 1M.
-
Qwen3 VL 30B A3B Thinking released
Alibaba ships Qwen3 VL 30B A3B Thinking at $0.13 in / $1.56 out per 1M.
-
Qwen3 VL 30B A3B Instruct released
Alibaba ships Qwen3 VL 30B A3B Instruct at $0.13 in / $0.52 out per 1M.
-
GPT-5 Pro released
OpenAI ships GPT-5 Pro at $15.00 in / $120.00 out per 1M.
-
GLM 4.6 released
Z.ai ships GLM 4.6 at $0.43 in / $1.74 out per 1M.
-
DeepSeek V3.2 Exp released
DeepSeek ships DeepSeek V3.2 Exp at $0.27 in / $0.41 out per 1M.
-
Claude Sonnet 4.5 released
Anthropic ships Claude Sonnet 4.5 at $3.00 in / $15.00 out per 1M.
-
Cydonia 24B V4.1 released
TheDrummer ships Cydonia 24B V4.1 at $0.3 in / $0.5 out per 1M.
-
Relace Apply 3 released
Relace ships Relace Apply 3 at $0.85 in / $1.25 out per 1M.
-
Gemini 2.5 Flash Lite Preview 09-2025 released
Google ships Gemini 2.5 Flash Lite Preview 09-2025 at $0.1 in / $0.4 out per 1M.
-
Qwen3 VL 235B A22B Thinking released
Alibaba ships Qwen3 VL 235B A22B Thinking at $0.26 in / $2.60 out per 1M.
-
Qwen3 VL 235B A22B Instruct released
Alibaba ships Qwen3 VL 235B A22B Instruct at $0.2 in / $0.88 out per 1M.
-
Qwen3 Max released
Alibaba ships Qwen3 Max at $0.36 in / $1.43 out per 1M.
-
Qwen3 Coder Plus released
Alibaba ships Qwen3 Coder Plus at $0.65 in / $3.25 out per 1M.
-
GPT-5 Codex released
OpenAI ships GPT-5 Codex at $1.25 in / $10.00 out per 1M.
-
DeepSeek V3.1 Terminus released
DeepSeek ships DeepSeek V3.1 Terminus at $0.27 in / $0.95 out per 1M.
-
Qwen3 Coder Flash released
Alibaba ships Qwen3 Coder Flash at $0.195 in / $0.975 out per 1M.
-
Qwen3 Next 80B A3B Thinking released
Alibaba ships Qwen3 Next 80B A3B Thinking at $0.0975 in / $0.78 out per 1M.
-
Qwen3 Next 80B A3B Instruct released
Alibaba ships Qwen3 Next 80B A3B Instruct at $0.09 in / $1.10 out per 1M.
-
Qwen Plus 0728 released
Alibaba ships Qwen Plus 0728 at $0.26 in / $0.78 out per 1M.
-
Qwen Plus 0728 (thinking) released
Alibaba ships Qwen Plus 0728 (thinking) at $0.26 in / $0.78 out per 1M.
-
Kimi K2 0905 released
Moonshot AI ships Kimi K2 0905 at $0.6 in / $2.50 out per 1M.
-
Qwen3 30B A3B Thinking 2507 released
Alibaba ships Qwen3 30B A3B Thinking 2507 at $0.08 in / $0.4 out per 1M.
-
Hermes 4 70B released
Nous ships Hermes 4 70B at $0.13 in / $0.4 out per 1M.
-
Hermes 4 405B released
Nous ships Hermes 4 405B at $1.00 in / $3.00 out per 1M.
-
DeepSeek V3.1 released
DeepSeek ships DeepSeek V3.1 at $0.21 in / $0.79 out per 1M.
-
Mistral Medium 3.1 released
Mistral ships Mistral Medium 3.1 at $0.4 in / $2.00 out per 1M.
-
GLM 4.5V released
Z.ai ships GLM 4.5V at $0.6 in / $1.80 out per 1M.
-
Jamba Large 1.7 released
AI21 ships Jamba Large 1.7 at $2.00 in / $8.00 out per 1M.
-
GPT-5 sparks price war
GPT-5 launches at commodity pricing ($1.25/$10) — TechCrunch calls it a price-war trigger.
-
GPT-5 released
OpenAI ships GPT-5 at $1.25 in / $10.00 out per 1M.
-
GPT-5 Nano released
OpenAI ships GPT-5 Nano at $0.05 in / $0.4 out per 1M.
-
GPT-5 Mini released
OpenAI ships GPT-5 Mini at $0.25 in / $2.00 out per 1M.
-
GPT-5 Chat released
OpenAI ships GPT-5 Chat at $1.25 in / $10.00 out per 1M.
-
gpt-oss-20b released
OpenAI ships gpt-oss-20b at $0.029 in / $0.14 out per 1M.
-
gpt-oss-120b released
OpenAI ships gpt-oss-120b at $0.039 in / $0.18 out per 1M.
-
Codestral 2508 released
Mistral ships Codestral 2508 at $0.3 in / $0.9 out per 1M.
-
Qwen3 Coder 30B A3B Instruct released
Alibaba ships Qwen3 Coder 30B A3B Instruct at $0.07 in / $0.27 out per 1M.
-
Qwen3 30B A3B Instruct 2507 released
Alibaba ships Qwen3 30B A3B Instruct 2507 at $0.0482 in / $0.193 out per 1M.
-
Qwen3 235B A22B Thinking 2507 released
Alibaba ships Qwen3 235B A22B Thinking 2507 at $0.1 in / $0.1 out per 1M.
-
GLM 4.5 released
Z.ai ships GLM 4.5 at $0.6 in / $2.20 out per 1M.
-
GLM 4.5 Air released
Z.ai ships GLM 4.5 Air at $0.13 in / $0.85 out per 1M.
-
Qwen3 Coder 480B A35B released
Alibaba ships Qwen3 Coder 480B A35B at $0.22 in / $1.80 out per 1M.
-
UI-TARS 7B released
ByteDance ships UI-TARS 7B at $0.1 in / $0.2 out per 1M.
-
Gemini 2.5 Flash Lite released
Google ships Gemini 2.5 Flash Lite at $0.1 in / $0.4 out per 1M.
-
Qwen3 235B A22B Instruct 2507 released
Alibaba ships Qwen3 235B A22B Instruct 2507 at $0.09 in / $0.1 out per 1M.
-
Switchpoint Router released
Switchpoint ships Switchpoint Router at $0.85 in / $3.40 out per 1M.
-
Kimi K2 0711 released
Moonshot AI ships Kimi K2 0711 at $0.57 in / $2.30 out per 1M.
-
Hunyuan A13B Instruct released
Tencent ships Hunyuan A13B Instruct at $0.14 in / $0.57 out per 1M.
-
Morph V3 Large released
Morph ships Morph V3 Large at $0.9 in / $1.90 out per 1M.
-
Morph V3 Fast released
Morph ships Morph V3 Fast at $0.8 in / $1.20 out per 1M.
-
ERNIE 4.5 VL 424B A47B released
Baidu ships ERNIE 4.5 VL 424B A47B at $0.42 in / $1.25 out per 1M.
-
Mistral Small 3.2 24B released
Mistral ships Mistral Small 3.2 24B at $0.075 in / $0.2 out per 1M.
-
MiniMax M1 released
MiniMax ships MiniMax M1 at $0.4 in / $2.20 out per 1M.
-
Gemini 2.5 Pro released
Google ships Gemini 2.5 Pro at $1.25 in / $10.00 out per 1M.
-
Gemini 2.5 Flash released
Google ships Gemini 2.5 Flash at $0.3 in / $2.50 out per 1M.
-
o3 price cut 80%
OpenAI slashes o3 by 80% ($10→$2/1M input), making frontier reasoning mainstream-affordable.
-
o3-pro released
OpenAI ships o3-pro at $20.00 in / $80.00 out per 1M.
-
o3 Pro released
OpenAI ships o3 Pro at $20.00 in / $80.00 out per 1M.
-
Gemini 2.5 Pro Preview 06-05 released
Google ships Gemini 2.5 Pro Preview 06-05 at $1.25 in / $10.00 out per 1M.
-
R1 0528 released
DeepSeek ships R1 0528 at $0.5 in / $2.15 out per 1M.
-
Gemma 3n 4B released
Google ships Gemma 3n 4B at $0.06 in / $0.12 out per 1M.
-
Mistral Medium 3 released
Mistral ships Mistral Medium 3 at $0.4 in / $2.00 out per 1M.
-
Gemini 2.5 Pro Preview 05-06 released
Google ships Gemini 2.5 Pro Preview 05-06 at $1.25 in / $10.00 out per 1M.
-
Virtuoso Large released
Arcee AI ships Virtuoso Large at $0.75 in / $1.20 out per 1M.
-
Coder Large released
Arcee AI ships Coder Large at $0.5 in / $0.8 out per 1M.
-
Llama Guard 4 12B released
Meta ships Llama Guard 4 12B at $0.18 in / $0.18 out per 1M.
-
Qwen3 8B released
Alibaba ships Qwen3 8B at $0.05 in / $0.4 out per 1M.
-
Qwen3 32B released
Alibaba ships Qwen3 32B at $0.08 in / $0.28 out per 1M.
-
Qwen3 30B A3B released
Alibaba ships Qwen3 30B A3B at $0.12 in / $0.5 out per 1M.
-
Qwen3 235B A22B released
Alibaba ships Qwen3 235B A22B at $0.455 in / $1.82 out per 1M.
-
Qwen3 14B released
Alibaba ships Qwen3 14B at $0.1 in / $0.24 out per 1M.
-
o4-mini released
OpenAI ships o4-mini at $1.10 in / $4.40 out per 1M.
-
o4 Mini High released
OpenAI ships o4 Mini High at $1.10 in / $4.40 out per 1M.
-
o3 released
OpenAI ships o3 at $2.00 in / $8.00 out per 1M.
-
Long-context goes cheap
GPT-4.1 lands a 1M-token window at mid-tier pricing, matching Gemini on context economics.
-
GPT-4.1 released
OpenAI ships GPT-4.1 at $2.00 in / $8.00 out per 1M.
-
GPT-4.1 Nano released
OpenAI ships GPT-4.1 Nano at $0.1 in / $0.4 out per 1M.
-
GPT-4.1 Mini released
OpenAI ships GPT-4.1 Mini at $0.4 in / $1.60 out per 1M.
-
Llama 4 Scout released
Meta ships Llama 4 Scout at $0.08 in / $0.3 out per 1M.
-
Llama 4 Maverick released
Meta ships Llama 4 Maverick at $0.15 in / $0.6 out per 1M.
-
DeepSeek V3 0324 released
DeepSeek ships DeepSeek V3 0324 at $0.2 in / $0.77 out per 1M.
-
o1-pro released
OpenAI ships o1-pro at $150.00 in / $600.00 out per 1M.
-
Mistral Small 3.1 24B released
Mistral ships Mistral Small 3.1 24B at $0.351 in / $0.555 out per 1M.
-
Gemma 3 4B released
Google ships Gemma 3 4B at $0.05 in / $0.1 out per 1M.
-
Gemma 3 12B released
Google ships Gemma 3 12B at $0.05 in / $0.15 out per 1M.
-
Command A released
Cohere ships Command A at $2.50 in / $10.00 out per 1M.
-
Reka Flash 3 released
Rekaai ships Reka Flash 3 at $0.1 in / $0.2 out per 1M.
-
Gemma 3 27B released
Google ships Gemma 3 27B at $0.08 in / $0.16 out per 1M.
-
GPT-4o-mini Search Preview released
OpenAI ships GPT-4o-mini Search Preview at $0.15 in / $0.6 out per 1M.
-
GPT-4o Search Preview released
OpenAI ships GPT-4o Search Preview at $2.50 in / $10.00 out per 1M.
-
Skyfall 36B V2 released
TheDrummer ships Skyfall 36B V2 at $0.55 in / $0.8 out per 1M.
-
Sonar Reasoning Pro released
Perplexity ships Sonar Reasoning Pro at $2.00 in / $8.00 out per 1M.
-
Sonar Pro released
Perplexity ships Sonar Pro at $3.00 in / $15.00 out per 1M.
-
Sonar Deep Research released
Perplexity ships Sonar Deep Research at $2.00 in / $8.00 out per 1M.
-
GPT-4.5: ultra-premium experiment
OpenAI tests $75/$150 pricing with GPT-4.5 — the most expensive API model ever offered. Quickly superseded by cheaper, better models.
-
Saba released
Mistral ships Saba at $0.2 in / $0.6 out per 1M.
-
o3 Mini High released
OpenAI ships o3 Mini High at $1.10 in / $4.40 out per 1M.
-
Llama Guard 3 8B released
Meta ships Llama Guard 3 8B at $0.484 in / $0.03 out per 1M.
-
Aion-RP 1.0 (8B) released
AionLabs ships Aion-RP 1.0 (8B) at $0.8 in / $1.60 out per 1M.
-
Aion-1.0-Mini released
AionLabs ships Aion-1.0-Mini at $0.7 in / $1.40 out per 1M.
-
Aion-1.0 released
AionLabs ships Aion-1.0 at $4.00 in / $8.00 out per 1M.
-
Qwen2.5 VL 72B Instruct released
Alibaba ships Qwen2.5 VL 72B Instruct at $0.8 in / $1.00 out per 1M.
-
Qwen-Plus released
Alibaba ships Qwen-Plus at $0.26 in / $0.78 out per 1M.
-
Mistral Small 3 released
Mistral ships Mistral Small 3 at $0.05 in / $0.08 out per 1M.
-
R1 Distill Qwen 32B released
DeepSeek ships R1 Distill Qwen 32B at $0.29 in / $0.29 out per 1M.
-
Sonar released
Perplexity ships Sonar at $1.00 in / $1.00 out per 1M.
-
R1 Distill Llama 70B released
DeepSeek ships R1 Distill Llama 70B at $0.8 in / $0.8 out per 1M.
-
The DeepSeek moment
DeepSeek R1 ships near-frontier reasoning at ~1/20th the price. Markets jolt; pricing pressure spikes industry-wide.
-
R1 released
DeepSeek ships R1 at $0.7 in / $2.50 out per 1M.
-
DeepSeek R1 released
DeepSeek ships DeepSeek R1 at $0.28 in / $0.42 out per 1M.
-
MiniMax-01 released
MiniMax ships MiniMax-01 at $0.2 in / $1.10 out per 1M.
-
Phi 4 released
Microsoft ships Phi 4 at $0.07 in / $0.14 out per 1M.
-
Llama 3.1 70B Hanami x1 released
Sao10K ships Llama 3.1 70B Hanami x1 at $3.00 in / $3.00 out per 1M.
2024
-
DeepSeek V3: frontier for cents
DeepSeek releases a GPT-4o-class open model priced around $0.27/$1.10 per 1M, previewing the shock R1 would deliver a month later.
-
DeepSeek V3 released
DeepSeek ships DeepSeek V3 at $0.28 in / $0.42 out per 1M.
-
Llama 3.3 Euryale 70B released
Sao10K ships Llama 3.3 Euryale 70B at $0.65 in / $0.75 out per 1M.
-
Command R7B (12-2024) released
Cohere ships Command R7B (12-2024) at $0.0375 in / $0.15 out per 1M.
-
Llama 3.3 70B Instruct released
Meta ships Llama 3.3 70B Instruct at $0.1 in / $0.32 out per 1M.
-
Nova Pro 1.0 released
Amazon ships Nova Pro 1.0 at $0.8 in / $3.20 out per 1M.
-
Nova Micro 1.0 released
Amazon ships Nova Micro 1.0 at $0.035 in / $0.14 out per 1M.
-
Nova Lite 1.0 released
Amazon ships Nova Lite 1.0 at $0.06 in / $0.24 out per 1M.
-
GPT-4o (2024-11-20) released
OpenAI ships GPT-4o (2024-11-20) at $2.50 in / $10.00 out per 1M.
-
Mistral Large 2407 released
Mistral ships Mistral Large 2407 at $2.00 in / $6.00 out per 1M.
-
Qwen2.5 Coder 32B Instruct released
Alibaba ships Qwen2.5 Coder 32B Instruct at $0.66 in / $1.00 out per 1M.
-
UnslopNemo 12B released
TheDrummer ships UnslopNemo 12B at $0.4 in / $0.4 out per 1M.
-
Magnum v4 72B released
Anthracite Org ships Magnum v4 72B at $3.00 in / $5.00 out per 1M.
-
Qwen2.5 7B Instruct released
Alibaba ships Qwen2.5 7B Instruct at $0.04 in / $0.1 out per 1M.
-
Inflection 3 Productivity released
Inflection ships Inflection 3 Productivity at $2.50 in / $10.00 out per 1M.
-
Inflection 3 Pi released
Inflection ships Inflection 3 Pi at $2.50 in / $10.00 out per 1M.
-
OpenAI caching: auto 50% off
DevDay brings automatic prompt caching — a no-code 50% discount on recently seen input tokens across GPT-4o and o1 models.
-
Gemini 1.5 Pro cut 64%
Google cuts Gemini 1.5 Pro to $1.25/$5.00 per 1M on prompts under 128K — the flagship tier joins the price war.
-
Rocinante 12B released
TheDrummer ships Rocinante 12B at $0.25 in / $0.5 out per 1M.
-
Llama 3.2 3B Instruct released
Meta ships Llama 3.2 3B Instruct at $0.0509 in / $0.335 out per 1M.
-
Llama 3.2 1B Instruct released
Meta ships Llama 3.2 1B Instruct at $0.027 in / $0.201 out per 1M.
-
Llama 3.2 11B Vision Instruct released
Meta ships Llama 3.2 11B Vision Instruct at $0.345 in / $0.345 out per 1M.
-
Qwen2.5 72B Instruct released
Alibaba ships Qwen2.5 72B Instruct at $0.36 in / $0.4 out per 1M.
-
o1 creates reasoning price tier
OpenAI's o1-preview launches at $15/$60 — a new pricing category where chain-of-thought tokens make effective costs 2–5× the list price.
-
Command R+ (08-2024) released
Cohere ships Command R+ (08-2024) at $2.50 in / $10.00 out per 1M.
-
Command R (08-2024) released
Cohere ships Command R (08-2024) at $0.15 in / $0.6 out per 1M.
-
Llama 3.1 Euryale 70B v2.2 released
Sao10K ships Llama 3.1 Euryale 70B v2.2 at $0.85 in / $0.85 out per 1M.
-
Hermes 3 70B Instruct released
Nous ships Hermes 3 70B Instruct at $0.7 in / $0.7 out per 1M.
-
Hermes 3 405B Instruct released
Nous ships Hermes 3 405B Instruct at $1.00 in / $1.00 out per 1M.
-
Anthropic ships prompt caching
Cached input tokens cost 90% less on Claude, making long-system-prompt and RAG workloads dramatically cheaper.
-
Llama 3 8B Lunaris released
Sao10K ships Llama 3 8B Lunaris at $0.04 in / $0.05 out per 1M.
-
Google slashes Flash 78%
Gemini 1.5 Flash drops 78% on input / 71% on output to $0.075/$0.30 per 1M, undercutting GPT-4o mini by half.
-
OpenAI cuts GPT-4o 50%
GPT-4o input drops $5→$2.50/1M and prompt caching arrives — the first big frontier price war shot.
-
GPT-4o (2024-08-06) released
OpenAI ships GPT-4o (2024-08-06) at $2.50 in / $10.00 out per 1M.
-
Llama 3.1 405B opens frontier
Meta releases a GPT-4-class model as open weights; hosted 405B undercuts proprietary frontier pricing and anchors expectations lower.
-
Llama 3.1 8B Instruct released
Meta ships Llama 3.1 8B Instruct at $0.02 in / $0.03 out per 1M.
-
Llama 3.1 70B Instruct released
Meta ships Llama 3.1 70B Instruct at $0.4 in / $0.4 out per 1M.
-
Mistral Nemo released
Mistral ships Mistral Nemo at $0.02 in / $0.03 out per 1M.
-
GPT-4o mini at $0.15
OpenAI replaces GPT-3.5 Turbo with GPT-4o mini at $0.15/$0.60 per 1M — over 60% cheaper and far more capable.
-
GPT-4o-mini (2024-07-18) released
OpenAI ships GPT-4o-mini (2024-07-18) at $0.15 in / $0.6 out per 1M.
-
Gemma 2 27B released
Google ships Gemma 2 27B at $0.65 in / $0.65 out per 1M.
-
Claude 3.5 Sonnet resets value
Anthropic ships a model beating Opus at one-fifth Opus's price ($3/$15), collapsing the gap between mid-tier price and frontier quality.
-
China's LLM price war erupts
After DeepSeek-V2 launched at ~$0.14/1M input on May 6, Alibaba cuts Qwen prices up to 97% and Baidu makes Ernie Speed/Lite free hours later.
-
GPT-4o: frontier at half price
GPT-4o launches at $5/$15 per 1M — half of GPT-4 Turbo's price with better performance, resetting the frontier price bar.
-
GPT-4o (2024-05-13) released
OpenAI ships GPT-4o (2024-05-13) at $5.00 in / $15.00 out per 1M.
-
Llama 3 8B Instruct released
Meta ships Llama 3 8B Instruct at $0.14 in / $0.14 out per 1M.
-
Llama 3 70B Instruct released
Meta ships Llama 3 70B Instruct at $0.51 in / $0.74 out per 1M.
-
Mixtral 8x22B Instruct released
Mistral ships Mixtral 8x22B Instruct at $2.00 in / $6.00 out per 1M.
-
WizardLM-2 8x22B released
Microsoft ships WizardLM-2 8x22B at $0.62 in / $0.62 out per 1M.
-
Command R Plus released
Cohere ships Command R Plus at $2.50 in / $10.00 out per 1M.
-
Command R released
Cohere ships Command R at $0.15 in / $0.6 out per 1M.
-
Claude 3 brings $0.25 Haiku
The Claude 3 family ships with Haiku at $0.25/$1.25 per 1M, staking out the fast-and-cheap tier against GPT-3.5 Turbo.
-
Gemini 1.5 Pro: 1M context
Google announces a 1M-token context window, an order of magnitude beyond rivals — long context becomes a $/token battleground.
-
OpenAI cuts GPT-3.5 Turbo 50%
Third GPT-3.5 Turbo cut in a year: input drops 50% to $0.50/1M, output 25% to $1.50. New embedding models arrive 5x cheaper too.
-
GPT-4 Turbo Preview released
OpenAI ships GPT-4 Turbo Preview at $10.00 in / $30.00 out per 1M.
-
GPT-3.5 Turbo (older v0613) released
OpenAI ships GPT-3.5 Turbo (older v0613) at $1.00 in / $2.00 out per 1M.
2023
-
Mixtral 8x7B goes open-weight
Mistral releases Mixtral 8x7B under Apache 2.0 — GPT-3.5-class quality anyone can host, setting a price floor under proprietary small models.
-
GPT-4 Turbo: 3× cheaper
GPT-4 Turbo launches at $10/$30 with 128K context, cutting the frontier price by two-thirds and kicking off a year of rapid cuts.
-
GPT-3.5 Turbo Instruct released
OpenAI ships GPT-3.5 Turbo Instruct at $1.50 in / $2.00 out per 1M.
-
GPT-3.5 Turbo 16k released
OpenAI ships GPT-3.5 Turbo 16k at $3.00 in / $4.00 out per 1M.
-
Weaver (alpha) released
Mancer ships Weaver (alpha) at $0.75 in / $1.00 out per 1M.
-
ReMM SLERP 13B released
Undi95 ships ReMM SLERP 13B at $0.45 in / $0.65 out per 1M.
-
MythoMax 13B released
Gryphe ships MythoMax 13B at $0.06 in / $0.06 out per 1M.
-
GPT-3.5 Turbo released
OpenAI ships GPT-3.5 Turbo at $0.5 in / $1.50 out per 1M.
-
GPT-4 sets the frontier price
GPT-4 launches at $30/$60 per MTok — 10× the cost of GPT-3.5 Turbo, establishing the first frontier pricing baseline.
No events of this kind yet.