Llama 3.3 Nemotron Super 49B V1.5
NVIDIA · Mid
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...
Input / 1M
$0.4
Output / 1M
$0.4
Cached input / 1M
—
Context window
131K
Where it sits
104th-cheapest mid model
by blended $/Mtok among 262 listed mid models
30% below the mid median
blended $/Mtok across 262 mid models
Held flat since launch (Jun 2026)
no blended price change recorded
Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.
Price history
Only one price on record so far — the history chart appears once a price changes.
Snapshots
| Effective | Input | Output | Cached in | Note | Source |
|---|---|---|---|---|---|
| 11 Jun 2026 | $0.4 | $0.4 | — | Imported from OpenRouter | openrouter.ai |