Inference.net Provider

Inference.net is a platform for running large language models in the cloud.

Available Models

Llama 3.2 11B Instruct

meta
llama-3.2-11b-instruct
Streaming
JSON Output
Inference.net
Context: 128k
Input
$0.07
/M tokens
Cached
/M tokens
Output
$0.33
/M tokens

Llama 3.1 8B Instruct

meta
llama-3.1-8b-instruct
Streaming
Inference.net
Context: 128k
Input
$0.07
/M tokens
Cached
/M tokens
Output
$0.33
/M tokens