NovitaAI Provider

NovitaAI's OpenAI-compatible large language models

Get started Try in Playground Visit company

Available Models

Qwen3.5 397B A17B

alibaba

qwen35-397b-a17b

Streaming

Vision

Tools

Reasoning

JSON Output

NovitaAI

Context: 262.1k

Input

$0.6

/M tokens

Cached

—

/M tokens

Output

$3.6

/M tokens

MiniMax M2.5

minimax

minimax-m2.5

Streaming

Tools

Reasoning

JSON Output

NovitaAI

Context: 204.8k

Input

$0.3

/M tokens

Cached

$0.03

/M tokens

Output

$1.2

/M tokens

GLM-5

glm

glm-5

Streaming

Tools

Reasoning

JSON Output

NovitaAI

Context: 202.8k

Input

$1

/M tokens

Cached

$0.2

/M tokens

Output

$3.2

/M tokens

MiniMax M2.1

minimax

minimax-m2.1

Streaming

Tools

Reasoning

JSON Output

NovitaAI

Context: 204.8k

Input

$0.3

/M tokens

Cached

$0.03

/M tokens

Output

$1.2

/M tokens

GLM-4.7

glm

glm-4.7

Streaming

Tools

Reasoning

JSON Output

NovitaAI

Context: 204.8k

Input

$0.6

/M tokens

Cached

$0.11

/M tokens

Output

$2.2

/M tokens

GLM-4.6V

glm

glm-4.6v

Streaming

Vision

Tools

Reasoning

JSON Output

NovitaAI

Context: 131.1k

Input

$0.3

/M tokens

Cached

$0.055

/M tokens

Output

$0.9

/M tokens

Qwen3 VL 8B Instruct

alibaba

qwen3-vl-8b-instruct

Streaming

Vision

JSON Output

NovitaAI

Context: 131.1k

Input

$0.08

/M tokens

Cached

—

/M tokens

Output

$0.5

/M tokens

Qwen3 VL 30B A3B Thinking

alibaba

qwen3-vl-30b-a3b-thinking

Streaming

Vision

Tools

Reasoning

JSON Output

NovitaAI

Context: 131.1k

Input

$0.2

/M tokens

Cached

—

/M tokens

Output

$1

/M tokens

Qwen3 VL 30B A3B Instruct

alibaba

qwen3-vl-30b-a3b-instruct

Streaming

Vision

Tools

NovitaAI

Context: 131.1k

Input

$0.2

/M tokens

Cached

—

/M tokens

Output

$0.7

/M tokens

GLM-4.6

glm

glm-4.6

Streaming

Tools

Reasoning

JSON Output

NovitaAI

Context: 204.8k

Input

$0.55

/M tokens

Cached

$0.11

/M tokens

Output

$2.2

/M tokens

DeepSeek V3.2

deepseek

deepseek-v3.2

Streaming

Tools

JSON Output

NovitaAI

Context: 163.8k

Input

$0.269

/M tokens

Cached

$0.1345

/M tokens

Output

$0.4

/M tokens

Qwen3 Max

alibaba

qwen3-max

Streaming

Tools

JSON Output

NovitaAI

Context: 262.1k

Input

$0.845

/M tokens

Cached

—

/M tokens

Output

$3.38

/M tokens

Qwen3 VL 235B A22B Instruct

alibaba

qwen3-vl-235b-a22b-instruct

Streaming

Vision

Tools

JSON Output

NovitaAI

Context: 131.1k

Input

$0.3

/M tokens

Cached

—

/M tokens

Output

$1.5

/M tokens

Qwen3 VL 235B A22B Thinking

alibaba

qwen3-vl-235b-a22b-thinking

Streaming

Vision

Reasoning

NovitaAI

Context: 131.1k

Input

$0.98

/M tokens

Cached

—

/M tokens

Output

$3.95

/M tokens

Qwen3 Next 80B A3B Thinking

alibaba

qwen3-next-80b-a3b-thinking

Streaming

Tools

Reasoning

NovitaAI

Context: 131.1k

Input

$0.15

/M tokens

Cached

—

/M tokens

Output

$1.5

/M tokens

Qwen3 Next 80B A3B Instruct

alibaba

qwen3-next-80b-a3b-instruct

Streaming

Tools

JSON Output

NovitaAI

Context: 131.1k

Input

$0.15

/M tokens

Cached

—

/M tokens

Output

$1.5

/M tokens

GLM-4.5V

glm

glm-4.5v

Streaming

Vision

Tools

Reasoning

JSON Output

NovitaAI

Context: 65.5k

Input

$0.6

/M tokens

Cached

$0.11

/M tokens

Output

$1.8

/M tokens

Qwen3 Coder 30B A3B Instruct

alibaba

qwen3-coder-30b-a3b-instruct

Streaming

Tools

JSON Output

NovitaAI

Context: 160k

Input

$0.07

/M tokens

Cached

—

/M tokens

Output

$0.27

/M tokens

Qwen3 235B A22B Thinking 2507

alibaba

qwen3-235b-a22b-thinking-2507

Streaming

Tools

NovitaAI

Context: 131.1k

Input

$0.3

/M tokens

Cached

—

/M tokens

Output

$3

/M tokens

Qwen3 235B A22B Instruct 2507

alibaba

qwen3-235b-a22b-instruct-2507

Streaming

Tools

JSON Output

NovitaAI

Context: 131.1k

Input

$0.09

/M tokens

Cached

—

/M tokens

Output

$0.58

/M tokens

Kimi K2

moonshot

kimi-k2

Streaming

Tools

NovitaAI

Context: 131.1k

Input

$0.57

/M tokens

Cached

—

/M tokens

Output

$2.3

/M tokens

Qwen3 235B A22B FP8

alibaba

qwen3-235b-a22b-fp8

Streaming

JSON Output

NovitaAI

Context: 41.0k

Input

$0.2

/M tokens

Cached

—

/M tokens

Output

$0.8

/M tokens

Qwen3 32B FP8

alibaba

qwen3-32b-fp8

Streaming

NovitaAI

Context: 41.0k

Input

$0.1

/M tokens

Cached

—

/M tokens

Output

$0.45

/M tokens

Qwen3 30B A3B FP8

alibaba

qwen3-30b-a3b-fp8

Streaming

NovitaAI

Context: 41.0k

Input

$0.09

/M tokens

Cached

—

/M tokens

Output

$0.45

/M tokens

Qwen3 4B FP8

alibaba

qwen3-4b-fp8

Streaming

NovitaAI

Context: 128k

Input

$0.03

/M tokens

Cached

—

/M tokens

Output

$0.03

/M tokens

Llama 4 Scout 17B Instruct

meta

llama-4-scout-17b-instruct

Streaming

Vision

JSON Output

NovitaAI

Context: 131.1k

Input

$0.18

/M tokens

Cached

—

/M tokens

Output

$0.59

/M tokens

Llama 4 Maverick 17B Instruct

meta

llama-4-maverick-17b-instruct

Streaming

Vision

JSON Output

NovitaAI

Context: 1.0M

Input

$0.27

/M tokens

Cached

—

/M tokens

Output

$0.85

/M tokens

Llama 3 8B Instruct

meta

llama-3-8b-instruct

Streaming

JSON Output

NovitaAI

Context: 8.2k

Input

$0.04

/M tokens

Cached

—

/M tokens

Output

$0.04

/M tokens

Qwen3 Coder 480B A35B Instruct

alibaba

qwen3-coder-480b-a35b-instruct

Streaming

Tools

JSON Output

NovitaAI

Context: 262.1k

Input

$0.3

/M tokens

Cached

—

/M tokens

Output

$1.3

/M tokens

Llama 3.3 70B Instruct

meta

llama-3.3-70b-instruct

Streaming

Tools

NovitaAI

Context: 131.1k

Input

$0.135

/M tokens

Cached

—

/M tokens

Output

$0.4

/M tokens

Llama 3.2 3B Instruct

meta

llama-3.2-3b-instruct

Streaming

JSON Output

NovitaAI

Context: 32.8k

Input

$0.03

/M tokens

Cached

—

/M tokens

Output

$0.05

/M tokens

Llama 3.1 8B Instruct

meta

llama-3.1-8b-instruct

Streaming

JSON Output

NovitaAI

Context: 16.4k

Input

$0.02

/M tokens

Cached

—

/M tokens

Output

$0.05

/M tokens

Hermes 2 Pro Llama 3 8B

nousresearch

hermes-2-pro-llama-3-8b

Streaming

NovitaAI

Context: 8.2k

Input

$0.14

/M tokens

Cached

—

/M tokens

Output

$0.14

/M tokens

Llama 3 70B Instruct

meta

llama-3-70b-instruct

Streaming

JSON Output

NovitaAI

Context: 8.2k

Input

$0.51

/M tokens

Cached

—

/M tokens

Output

$0.74

/M tokens

MiniMax M2.7

minimax

minimax-m2.7

Streaming

Tools

Reasoning

JSON Output

NovitaAI

Context: 204.8k

Input

$0.3

/M tokens

Cached

$0.06

/M tokens

Output

$1.2

/M tokens