ModelsQwen3-32B
Qwen3-32B
by Alibaba
Cost-effective
Best value for production workloads
Pricing
Input
$0.20
per 1M tokens
Output
$0.40
per 1M tokens
Cached
N/A
per 1M tokens
Note: Latest Qwen model, excellent performance-to-cost ratio
Context & Output
Context Window128K tokens
Max Output32K tokens
Latency
Very Fast
Capabilities
Multimodal
Streaming
Function Calling
Prompt Caching
Key Strengths
What makes this model stand out
Outperforms o1-mini
Strong reasoning
Apache 2.0
Similar Models in Cost-effective Tier
Other models with similar pricing and performance characteristics
Amazon Nova 2 Lite
Amazon
Input:TBD/M
Context:1M tokens
Amazon Nova 2 Sonic
Amazon
Input:TBD/M
Context:1M tokens
DeepSeek V3.1
DeepSeek
Input:$0.56/M
Context:128K tokens