GLM-4.6

Updated GLM with reasoning capabilities.

glm-4.6
STABLEGet Started
204,800 context
Starting at $0.32/M (30% off) input tokens
Starting at $1.05/M (30% off) output tokens
Streaming
Tools
Reasoning
JSON Output

All Providers for GLM-4.6

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Z AI
Context: 200k10% off
Input
$0.6$0.54
/M tokens
Cached
$0.11$0.099
/M tokens
Output
$2.2$1.98
/M tokens
+ $0.010 per search
Get Started
Cerebras
Context: 200k
Deactivated since Jan 20, 2026
Input
$2.25
/M tokens
Cached
/M tokens
Output
$2.75
/M tokens
Get Started
CanopyWave
Context: 202.8k30% off
Deactivated since Jan 1, 2026
Input
$0.45$0.315
/M tokens
Cached
/M tokens
Output
$1.5$1.05
/M tokens
Get Started
Alibaba Cloud
Context: 202.8k
Input
$0.431
/M tokens
Cached
/M tokens
Output
$2.007
/M tokens
Get Started
NovitaAI
Context: 204.8k
Input
$0.55
/M tokens
Cached
$0.11
/M tokens
Output
$2.2
/M tokens
Get Started