ModelsGLM 4.6

GLM 4.6

Cost-effective

Best value for production workloads

Pricing

Input

$0.30

per 1M tokens

Output

$0.55

per 1M tokens

Cached

N/A

per 1M tokens

Note: Strong Chinese/English performance, open-source

Context & Output

Context Window128K tokens

Max Output32K tokens

Latency

Fast

Capabilities

Multimodal

Streaming

Function Calling

Prompt Caching

Key Strengths

What makes this model stand out

355B MoE

Bilingual (CN/EN)

MIT license

Similar Models in Cost-effective Tier

Other models with similar pricing and performance characteristics

Amazon Nova 2 Lite

Amazon

Input:TBD/M

Context:1M tokens

Amazon Nova 2 Sonic

Amazon

Input:TBD/M

Context:1M tokens

DeepSeek V3.1

DeepSeek

Input:$0.56/M

Context:128K tokens