prices synced 2026-07-01

Market events

Every market event and model launch we've tracked — 30 market events and 301 launches, going back to 2023.

2026

  1. Market

    DeepSeek V4 adds peak-valley pricing

    DeepSeek V4 introduces time-based peak/off-peak pricing, varying API token costs by demand.

    Source
  2. Market

    DeepSeek V4 Pro 75% cut

    DeepSeek makes a 75% promotional discount permanent, pricing V4 Pro at $0.435/$0.87.

  3. Market

    Cheap Flash era ends

    Gemini 3.5 Flash ships at $1.50/$9 per 1M — triple its predecessor — as Google reprices Flash from budget tier toward Pro territory.

  4. Market

    GPT-5.5 raises frontier prices

    GPT-5.5 launches at $5/$30 per 1M, a sharp step up from GPT-5's commodity pricing — frontier labs begin testing price tolerance.

2025

  1. Market

    Mistral Large 3: 75% cheaper

    Mistral's open-weight frontier flagship lands at $0.50/$1.50 per 1M — 75% below Large 2 — keeping open-model pressure on closed pricing.

  2. Market

    Opus gets 67% cheaper

    Anthropic drops Opus pricing from $15/$75 to $5/$25 with the Opus 4.5 release.

  3. Market

    Qwen3 Max halved in price war

    Alibaba cuts Qwen3 Max roughly 50% as China's AI price war reignites, pressuring domestic and global rivals alike.

  4. Market

    GPT-5 sparks price war

    GPT-5 launches at commodity pricing ($1.25/$10) — TechCrunch calls it a price-war trigger.

  5. Market

    o3 price cut 80%

    OpenAI slashes o3 by 80% ($10→$2/1M input), making frontier reasoning mainstream-affordable.

  6. Market

    Long-context goes cheap

    GPT-4.1 lands a 1M-token window at mid-tier pricing, matching Gemini on context economics.

  7. Market

    GPT-4.5: ultra-premium experiment

    OpenAI tests $75/$150 pricing with GPT-4.5 — the most expensive API model ever offered. Quickly superseded by cheaper, better models.

  8. Market

    The DeepSeek moment

    DeepSeek R1 ships near-frontier reasoning at ~1/20th the price. Markets jolt; pricing pressure spikes industry-wide.

2024

  1. Market

    DeepSeek V3: frontier for cents

    DeepSeek releases a GPT-4o-class open model priced around $0.27/$1.10 per 1M, previewing the shock R1 would deliver a month later.

  2. Market

    OpenAI caching: auto 50% off

    DevDay brings automatic prompt caching — a no-code 50% discount on recently seen input tokens across GPT-4o and o1 models.

  3. Market

    Gemini 1.5 Pro cut 64%

    Google cuts Gemini 1.5 Pro to $1.25/$5.00 per 1M on prompts under 128K — the flagship tier joins the price war.

  4. Market

    o1 creates reasoning price tier

    OpenAI's o1-preview launches at $15/$60 — a new pricing category where chain-of-thought tokens make effective costs 2–5× the list price.

  5. Market

    Anthropic ships prompt caching

    Cached input tokens cost 90% less on Claude, making long-system-prompt and RAG workloads dramatically cheaper.

  6. Market

    Google slashes Flash 78%

    Gemini 1.5 Flash drops 78% on input / 71% on output to $0.075/$0.30 per 1M, undercutting GPT-4o mini by half.

  7. Market

    OpenAI cuts GPT-4o 50%

    GPT-4o input drops $5→$2.50/1M and prompt caching arrives — the first big frontier price war shot.

  8. Market

    Llama 3.1 405B opens frontier

    Meta releases a GPT-4-class model as open weights; hosted 405B undercuts proprietary frontier pricing and anchors expectations lower.

  9. Market

    GPT-4o mini at $0.15

    OpenAI replaces GPT-3.5 Turbo with GPT-4o mini at $0.15/$0.60 per 1M — over 60% cheaper and far more capable.

  10. Market

    Claude 3.5 Sonnet resets value

    Anthropic ships a model beating Opus at one-fifth Opus's price ($3/$15), collapsing the gap between mid-tier price and frontier quality.

  11. Market

    China's LLM price war erupts

    After DeepSeek-V2 launched at ~$0.14/1M input on May 6, Alibaba cuts Qwen prices up to 97% and Baidu makes Ernie Speed/Lite free hours later.

  12. Market

    GPT-4o: frontier at half price

    GPT-4o launches at $5/$15 per 1M — half of GPT-4 Turbo's price with better performance, resetting the frontier price bar.

  13. Market

    Claude 3 brings $0.25 Haiku

    The Claude 3 family ships with Haiku at $0.25/$1.25 per 1M, staking out the fast-and-cheap tier against GPT-3.5 Turbo.

  14. Market

    Gemini 1.5 Pro: 1M context

    Google announces a 1M-token context window, an order of magnitude beyond rivals — long context becomes a $/token battleground.

  15. Market

    OpenAI cuts GPT-3.5 Turbo 50%

    Third GPT-3.5 Turbo cut in a year: input drops 50% to $0.50/1M, output 25% to $1.50. New embedding models arrive 5x cheaper too.

2023

  1. Market

    Mixtral 8x7B goes open-weight

    Mistral releases Mixtral 8x7B under Apache 2.0 — GPT-3.5-class quality anyone can host, setting a price floor under proprietary small models.

  2. Market

    GPT-4 Turbo: 3× cheaper

    GPT-4 Turbo launches at $10/$30 with 128K context, cutting the frontier price by two-thirds and kicking off a year of rapid cuts.

  3. Market

    GPT-4 sets the frontier price

    GPT-4 launches at $30/$60 per MTok — 10× the cost of GPT-3.5 Turbo, establishing the first frontier pricing baseline.

No more events.