Llama 3.1 Nemotron Ultra 253B

NVIDIA-tuned Llama 3.1 253B for maximum capability.

llama-3.1-nemotron-ultra-253b
STABLEGet Started
Streaming
JSON Output

Select Provider

Nebius AI Pricing for Llama 3.1 Nemotron Ultra 253B

View detailed pricing and capabilities for this provider.

Nebius AI
Context: 128k
Input
$0.6
/M tokens
Cached
/M tokens
Output
$1.8
/M tokens
Get Started