Z AI Provider
Z AI's OpenAI-compatible large language models
Available Models
GLM-5
glm10% off
glm-5Streaming
Tools
Reasoning
JSON Output
10% Discount
Z AI
Context: 202.8k10% off
Input
$1$0.9
/M tokens
Cached
$0.2$0.18
/M tokens
Output
$3.2$2.88
/M tokens
+ $0.010 per search
GLM-4.7
glm10% off
glm-4.7Streaming
Tools
Reasoning
JSON Output
10% Discount
Z AI
Context: 200k10% off
Input
$0.6$0.54
/M tokens
Cached
$0.11$0.099
/M tokens
Output
$2.2$1.98
/M tokens
+ $0.010 per search
GLM-4.7 FlashX
glm10% off
glm-4.7-flashxStreaming
Tools
Reasoning
JSON Output
10% Discount
Z AI
Context: 200k10% off
Input
$0.07$0.063
/M tokens
Cached
$0.01$0.009
/M tokens
Output
$0.4$0.36
/M tokens
GLM-4.7 Flash
glm
glm-4.7-flashStreaming
Tools
Reasoning
JSON Output
Z AI
Context: 200k
Input
$0
/M tokens
Cached
$0
/M tokens
Output
$0
/M tokens
GLM-4.6V
glm10% off
glm-4.6vStreaming
Vision
Tools
Reasoning
JSON Output
10% Discount
Z AI
Context: 128k10% off
Input
$0.3$0.27
/M tokens
Cached
$0.05$0.045
/M tokens
Output
$0.9$0.81
/M tokens
GLM-4.6V FlashX
glm10% off
glm-4.6v-flashxStreaming
Vision
Tools
Reasoning
JSON Output
10% Discount
Z AI
Context: 128k10% off
Input
$0.04$0.036
/M tokens
Cached
$0.004$0.0036
/M tokens
Output
$0.4$0.36
/M tokens
GLM-4.6V Flash
glm
glm-4.6v-flashStreaming
Vision
Tools
Reasoning
JSON Output
Z AI
Context: 128k
Input
$0
/M tokens
Cached
$0
/M tokens
Output
$0
/M tokens
GLM-4.6
glm10% off
glm-4.6Streaming
Tools
Reasoning
JSON Output
10% Discount
Z AI
Context: 200k10% off
Input
$0.6$0.54
/M tokens
Cached
$0.11$0.099
/M tokens
Output
$2.2$1.98
/M tokens
+ $0.010 per search
GLM-4.5 Flash
glm
glm-4.5-flashStreaming
Tools
JSON Output
Z AI
Context: 128k
Input
$0
/M tokens
Cached
$0
/M tokens
Output
$0
/M tokens
GLM-4.5V
glm10% off
glm-4.5vStreaming
Vision
Tools
Reasoning
JSON Output
10% Discount
Z AI
Context: 128k10% off
Input
$0.6$0.54
/M tokens
Cached
$0.11$0.099
/M tokens
Output
$1.8$1.62
/M tokens
GLM-4.5
glm10% off
glm-4.5Streaming
Tools
Reasoning
JSON Output
10% Discount
Z AI
Context: 128k10% off
Input
$0.6$0.54
/M tokens
Cached
$0.11$0.099
/M tokens
Output
$2.2$1.98
/M tokens
+ $0.010 per search
GLM-4.5 X
glm10% off
glm-4.5-xStreaming
Tools
Reasoning
JSON Output
10% Discount
Z AI
Context: 128k10% off
Input
$2.2$1.98
/M tokens
Cached
$0.45$0.405
/M tokens
Output
$8.9$8.01
/M tokens
GLM-4.5 AirX
glm10% off
glm-4.5-airxStreaming
Tools
JSON Output
10% Discount
Z AI
Context: 128k10% off
Input
$1.1$0.99
/M tokens
Cached
$0.22$0.198
/M tokens
Output
$4.5$4.05
/M tokens
GLM-4.5 Air
glm10% off
glm-4.5-airStreaming
Tools
JSON Output
10% Discount
Z AI
Context: 128k10% off
Input
$0.2$0.18
/M tokens
Cached
$0.03$0.027
/M tokens
Output
$1.1$0.99
/M tokens
GLM-4 32B (0414-128k)
glm10% off
glm-4-32b-0414-128kStreaming
Tools
JSON Output
10% Discount
Z AI
Context: 128k10% off
Input
$0.1$0.09
/M tokens
Cached
$0$0
/M tokens
Output
$0.1$0.09
/M tokens
CogView-4
zai10% off
cogview-4Image Generation
10% Discount
Z AI
Context: 2k10% off
Input
$0$0
/M tokens
Cached
—
/M tokens
Output
$0$0
/M tokens
+ $0.010 per request
GLM-Image
glm10% off
glm-imageImage Generation
10% Discount
Z AI
Context: 2k10% off
Input
$0$0
/M tokens
Cached
—
/M tokens
Output
$0$0
/M tokens
+ $0.015 per request