Llama 3.3 Nemotron Super 49B V1.5

NVIDIA · Released Oct 2025

A 49B-parameter model optimized for reasoning and agentic workflows, derived from Llama 3.3 and supporting 131K token context.

Strengths: Handles structured tasks like tool calling, code generation, and RAG workflows effectively through post-training on reasoning and function-calling patterns.
Best for: Building agents and applications that require tool use, code execution, retrieval-augmented generation, or multi-step reasoning across a large document context.
Limitations: Designed primarily for English and reasoning tasks; may not perform as well on creative writing, translation, or non-English languages compared to its larger parent model.

Input / 1M

Output / 1M

Cached input / 1M

Context window

Price history

Effective	Input	Output	Cached in	Note	Source
11 Jun 2026	$0.4	$0.4	—	Imported from OpenRouter	openrouter.ai

Nemotron 3 Ultra

in $0.5 · out $2.20

Nemotron 3 Super

in $0.085 · out $0.4

Nemotron 3 Nano 30B A3B

in $0.05 · out $0.2

Data updated Jun 29, 2026 Report a problem