ModelsQwen3-32B

Qwen3-32B

Cost-effective

Best value for production workloads

Pricing

Input

$0.20

per 1M tokens

Output

$0.40

per 1M tokens

Cached

N/A

per 1M tokens

Note: Latest Qwen model, excellent performance-to-cost ratio

Context & Output

Context Window128K tokens

Max Output32K tokens

Latency

Very Fast

Capabilities

Multimodal

Streaming

Function Calling

Prompt Caching

Key Strengths

What makes this model stand out

Outperforms o1-mini

Strong reasoning

Apache 2.0

Similar Models in Cost-effective Tier

Other models with similar pricing and performance characteristics

Amazon Nova 2 Lite

Amazon

Input:TBD/M

Context:1M tokens

Amazon Nova 2 Sonic

Amazon

Input:TBD/M

Context:1M tokens

DeepSeek V3.1

DeepSeek

Input:$0.56/M

Context:128K tokens