Llama 3.3 Nemotron Super 49B V1.5

NVIDIA · Mid

Compare →

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

Input / 1M

$0.4

Output / 1M

$0.4

Cached input / 1M

Context window

131K

Where it sits

104th-cheapest mid model

by blended $/Mtok among 262 listed mid models

30% below the mid median

blended $/Mtok across 262 mid models

Held flat since launch (Jun 2026)

no blended price change recorded

Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.

Price history

Only one price on record so far — the history chart appears once a price changes.

Snapshots

Effective Input Output Cached in Note Source
11 Jun 2026 $0.4 $0.4 Imported from OpenRouter openrouter.ai

More from NVIDIA