Zhipu AI (GLM) API Pricing 2026
Tsinghua-affiliated AI lab. GLM-5.1 is the latest flagship; GLM-4.7 Flash delivers extreme cost efficiency for high-throughput workloads.
Pricing verified 2026-06-06. Sourced from bigmodel.cn.
Get Zhipu AI (GLM) API access →Zhipu AI (GLM) Model Pricing
Prices in USD per 1M tokens
| Model | Input / 1M | Output / 1M | Context |
|---|---|---|---|
GLM-5-Turbo Faster GLM-5 variant; optimized throughput with same 200K context | $1.2 | $4 | 204,800 |
GLM-5.1 Latest GLM flagship; improved reasoning over GLM-5 | $1.4 | $4.4 | 204,800 |
GLM-5 Previous GLM flagship; 200K context; strong enterprise tasks | $1 | $3.2 | 204,800 |
GLM-4.7 Flash Budget Zhipu tier; excellent price-performance for high-volume workloads | $0.06 | $0.4 | 200,000 |
Estimated Monthly Cost (70% input / 30% output split)
| Model | 1M tokens/mo | 10M tokens/mo | 100M tokens/mo | 1B tokens/mo |
|---|---|---|---|---|
| GLM-5-Turbo | $2.04 | $20.40 | $204 | $2,040 |
| GLM-5.1 | $2.30 | $23.00 | $230 | $2,300 |
| GLM-5 | $1.66 | $16.60 | $166 | $1,660 |
| GLM-4.7 Flash | $0.162 | $1.62 | $16.20 | $162 |
Frequently Asked Questions
How much does Zhipu AI (GLM) LLM API cost?
Zhipu AI (GLM) offers 4 models ranging from $0.060/1M to $1.40/1M input tokens. Tsinghua-affiliated AI lab. GLM-5.1 is the latest flagship; GLM-4.7 Flash delivers extreme cost efficiency for high-throughput workloads.
Is Zhipu AI (GLM) cheaper than self-hosting?
For low-volume workloads (under 100M tokens/month), cloud APIs like Zhipu AI (GLM) are almost always cheaper than purchasing and maintaining GPU hardware. Use our calculator to find the exact break-even point for your usage.