Llama 3.1 405B Instruct

Largest Llama 3.1 model with 405B parameters.

llama-3.1-405b-instruct
STABLEModel DeactivatedGet Started
128,000 context
Starting at $1.00/M input tokens
Starting at $3.00/M output tokens
Streaming
Tools
JSON Output

Select Provider

All Providers for Llama 3.1 405B Instruct

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Nebius AI
Context: 128k
Deactivated since Nov 3, 2025
Input
$1
/M tokens
Cached
/M tokens
Output
$3
/M tokens
Get Started