nvidia
NVIDIA: Nemotron 3 Ultra
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
Quality Score
98/100
price + capability + benchmarks
Input Price
$0.50
per 1M tokens
Output Price
$2.50
per 1M tokens
Context Window
1,000,000
tokens
- Model ID
- nvidia/nemotron-3-ultra-550b-a55b
- Vendor
- nvidia
- Tokenizer
- Other
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 16,384 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- text only
- Audio
- no
- Moderated
- no
Similar models
nvidia
NVIDIA: Nemotron 3 Super
$0.09 in / $0.45 out
1,000,000 ctx
99
nvidia
NVIDIA: Nemotron 3 Nano 30B A3B
$0.05 in / $0.20 out
262,144 ctx
99
nvidia
NVIDIA: Nemotron 3 Nano Omni (free)
Free
256,000 ctx
100
nvidia
NVIDIA: Nemotron 3 Super (free)
Free
1,000,000 ctx
93
nvidia
NVIDIA: Nemotron Nano 9B V2
$0.04 in / $0.16 out
131,072 ctx
91
nvidia
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
$0.10 in / $0.40 out
131,072 ctx
91