Mistral Small 4
Mistral · Small
Fast, cost-effective model for high-volume and latency-sensitive workloads. Launched at $0.15/$0.60, later cut to $0.10/$0.30.
- Strengths
- Fast, cheap model for high-volume and latency-sensitive work.
- Best for
- Scaled text processing, routing and simple assistants.
- Limitations
- Small-tier quality; not for complex reasoning.
Input / 1M
$0.1
Output / 1M
$0.3
Cached input / 1M
—
Context window
262K
Where it sits
Cheapest small model
by blended $/Mtok among 5 listed small models
79% below the small median
blended $/Mtok across 5 small models
Output costs 3× input
$0.1 in / $0.3 out per 1M
Down 43% since launch (Mar 2026)
blended price vs launch
Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.
Price history
Input (solid)Output (dashed)
Blended price change (3:1 I/O mix)
- 30d
- decreased 42.9%
- 90d
- decreased 42.9%
- 1y
- decreased 42.9%
- Since launch
- decreased 42.9%
Snapshots
| Effective | Input | Output | Cached in | Note | Source |
|---|---|---|---|---|---|
| 3 Jun 2026 | $0.1 | $0.3 | — | Price cut to $0.10/$0.30 confirmed on Wayback capture 2026-06-03; exact cut date unpinnable (launch-era mistral.ai/pricing captures are JS shells), bracketed [2026-03-16, 2026-06-03] | mistral.ai/pricing |
| 16 Mar 2026 | $0.15 | $0.6 | — | Launch pricing (corroborated by 2026 pricing trackers) | mistral.ai/pricing |