Inference.net Provider
Inference.net is a platform for running large language models in the cloud.
Available Models
Llama 3.2 11B Instruct
meta
llama-3.2-11b-instructStreaming
JSON Output
Inference.net
Context: 128k
Input
$0.07
/M tokens
Cached
—
/M tokens
Output
$0.33
/M tokens
Llama 3.1 8B Instruct
meta
llama-3.1-8b-instructStreaming
Inference.net
Context: 128k
Input
$0.07
/M tokens
Cached
—
/M tokens
Output
$0.33
/M tokens