nvidia

NVIDIA: Nemotron 3 Ultra

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Quality Score
98/100
price + capability + benchmarks
Input Price
$0.50
per 1M tokens
Output Price
$2.50
per 1M tokens
Context Window
1,000,000
tokens
Model ID
nvidia/nemotron-3-ultra-550b-a55b
Vendor
nvidia
Tokenizer
Other
Input Modalities
text
Output Modalities
text
Max Output
16,384 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
text only
Audio
no
Moderated
no

Similar models