ModelsGLM 4.6
GLM 4.6
by Zhipu AI
Cost-effective
Best value for production workloads
Pricing
Input
$0.30
per 1M tokens
Output
$0.55
per 1M tokens
Cached
N/A
per 1M tokens
Note: Strong Chinese/English performance, open-source
Context & Output
Context Window128K tokens
Max Output32K tokens
Latency
Fast
Capabilities
Multimodal
Streaming
Function Calling
Prompt Caching
Key Strengths
What makes this model stand out
355B MoE
Bilingual (CN/EN)
MIT license
Similar Models in Cost-effective Tier
Other models with similar pricing and performance characteristics
DeepSeek V3.1
DeepSeek
Input:$0.56/M
Context:128K tokens
GPT-5 Mini
OpenAI
Input:$0.25/M
Context:272K tokens
DeepSeek R1
DeepSeek
Input:$0.55/M
Context:128K tokens