Llama 4 Scout

Meta · Released Apr 2025

Meta's smaller open-weight MoE model (17B active / 109B total). Supports up to 10M context on some providers.

Strengths: Smaller open MoE supporting up to a 10M-token context on some hosts.
Best for: Extreme long-context tasks and budget self-hosting.
Limitations: Lighter than Maverick; hosted pricing and context limits vary by provider.

Input / 1M

$0.08

Output / 1M

Cached input / 1M

Context window

Price history

Effective	Input	Output	Cached in	Note	Source
5 Apr 2025	$0.08	$0.3	—	Representative hosted rate (DeepInfra); varies by provider	pricepertoken.com

Muse Spark 1.1

in $1.25 · out $4.25

Llama Guard 4 12B

in $0.18 · out $0.18

Llama 4 Maverick

in $0.15 · out $0.6

Llama Guard 3 8B

in $0.484 · out $0.03

Data updated Jun 11, 2026 Report a problem