NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

—

Rank

128K

Context Window

$0.10

Min Input /1M

$0.40

Min Output /1M

Model Info

Model ID	nvidia-llama-3-3-nemotron-super-49b-v1-5
Modality	text->text
Context Window	128K tokens
Platforms	OR

Usage & Rankings

Rank	Unranked
Weekly Usage	—

Cross-Platform Pricing

Platform	Input /1M	Output /1M	Context	Model ID
OR OpenRouter	$0.100	$0.400	128K	nvidia/llama-3.3-nemotron-super-49b-v1.5

Input

$0.100

/ 1M tokens

Output

$0.400

/ 1M tokens

Round Trip

$0.50

/ 1M tokens

Value Rating

🟢

🟢 Economy — great for high volume

About NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 is a large language model with a 128K token context window.

The best available input price is $0.100/1M tokens and output price is $0.400/1M tokens, with a round-trip cost of $0.50/1M tokens.