Mistral Small 4

Mistral · Small

Compare →

Fast, cost-effective model for high-volume and latency-sensitive workloads. Launched at $0.15/$0.60, later cut to $0.10/$0.30.

Strengths
Fast, cheap model for high-volume and latency-sensitive work.
Best for
Scaled text processing, routing and simple assistants.
Limitations
Small-tier quality; not for complex reasoning.

Input / 1M

$0.1

Output / 1M

$0.3

Cached input / 1M

Context window

262K

Where it sits

Cheapest small model

by blended $/Mtok among 5 listed small models

79% below the small median

blended $/Mtok across 5 small models

Output costs 3× input

$0.1 in / $0.3 out per 1M

Down 43% since launch (Mar 2026)

blended price vs launch

Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.

Price history

Input (solid)Output (dashed)
Price per 1M tokens over timeOutput went from $0.6 to $0.3; Input went from $0.15 to $0.1 between Mar 2026 and Jun 2026.Output on 16 Mar 2026: $0.6Output on 3 Jun 2026: $0.3Out $0.3Input on 16 Mar 2026: $0.15Input on 3 Jun 2026: $0.1In $0.1Mar 2026Jun 2026

Blended price change (3:1 I/O mix)

30d
decreased 42.9%
90d
decreased 42.9%
1y
decreased 42.9%
Since launch
decreased 42.9%

Snapshots

Effective Input Output Cached in Note Source
3 Jun 2026 $0.1 $0.3 Price cut to $0.10/$0.30 confirmed on Wayback capture 2026-06-03; exact cut date unpinnable (launch-era mistral.ai/pricing captures are JS shells), bracketed [2026-03-16, 2026-06-03] mistral.ai/pricing
16 Mar 2026 $0.15 $0.6 Launch pricing (corroborated by 2026 pricing trackers) mistral.ai/pricing

More from Mistral