Llama 4 Scout
Meta · Mid
Meta's smaller open-weight MoE model (17B active / 109B total). Supports up to 10M context on some providers.
- Strengths
- Smaller open MoE supporting up to a 10M-token context on some hosts.
- Best for
- Extreme long-context tasks and budget self-hosting.
- Limitations
- Lighter than Maverick; hosted pricing and context limits vary by provider.
Input / 1M
$0.08
Output / 1M
$0.3
Cached input / 1M
—
Context window
1M
Where it sits
41st-cheapest mid model
by blended $/Mtok among 262 listed mid models
76% below the mid median
blended $/Mtok across 262 mid models
Output costs 3.8× input
$0.08 in / $0.3 out per 1M
Held flat since launch (Apr 2025)
no blended price change recorded
Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.
Price history
Only one price on record so far — the history chart appears once a price changes.
Snapshots
| Effective | Input | Output | Cached in | Note | Source |
|---|---|---|---|---|---|
| 5 Apr 2025 | $0.08 | $0.3 | — | Representative hosted rate (DeepInfra); varies by provider | pricepertoken.com |