Zhipu AI (GLM) API Pricing 2025
Tsinghua-affiliated AI lab. The GLM series is widely used in academic and enterprise settings. GLM-5 supports 200K context; GLM-4.7 Flash delivers extreme cost efficiency for high-throughput workloads.
Get Zhipu AI (GLM) API access →Zhipu AI (GLM) Model Pricing
Prices in USD per 1M tokens
| Model | Input / 1M | Output / 1M | Context |
|---|---|---|---|
GLM-4.7 Flash Budget Zhipu tier; excellent price-performance for high-volume workloads | $0.06 | $0.4 | 200,000 |
GLM-5 Zhipu AI flagship; 200K context; strong academic, research & enterprise tasks | $1 | $3.2 | 204,800 |
GLM-5-Turbo Faster GLM-5 variant; optimized throughput with same 200K context | $1.2 | $4 | 204,800 |
Estimated Monthly Cost (70% input / 30% output split)
| Model | 1M tokens/mo | 10M tokens/mo | 100M tokens/mo | 1B tokens/mo |
|---|---|---|---|---|
| GLM-4.7 Flash | $0.162 | $1.62 | $16.20 | $162 |
| GLM-5 | $1.66 | $16.60 | $166 | $1,660 |
| GLM-5-Turbo | $2.04 | $20.40 | $204 | $2,040 |
Frequently Asked Questions
How much does Zhipu AI (GLM) LLM API cost?
Zhipu AI (GLM) offers 3 models ranging from $0.060/1M to $1.20/1M input tokens. Tsinghua-affiliated AI lab. The GLM series is widely used in academic and enterprise settings. GLM-5 supports 200K context; GLM-4.7 Flash delivers extreme cost efficiency for high-throughput workloads.
Is Zhipu AI (GLM) cheaper than self-hosting?
For low-volume workloads (under 100M tokens/month), cloud APIs like Zhipu AI (GLM) are almost always cheaper than purchasing and maintaining GPU hardware. Use our calculator to find the exact break-even point for your usage.