ModelsLlama 4 Maverick

Llama 4 Maverick

by Meta

Cost-effective

Best value for production workloads

Pricing
Input
$0.50
per 1M tokens
Output
$0.77
per 1M tokens
Cached
N/A
per 1M tokens
Note: Released April 2025. 17B active, 128 experts
Context & Output
Context Window1M tokens
Max Output32K tokens
Latency
Fast
Capabilities
Multimodal
Streaming
Function Calling
Prompt Caching
Key Strengths
What makes this model stand out
400B params MoE
Multimodal
Open weights
Similar Models in Cost-effective Tier
Other models with similar pricing and performance characteristics
DeepSeek V3.1
DeepSeek
Input:$0.56/M
Context:128K tokens
View Details
GPT-5 Mini
OpenAI
Input:$0.25/M
Context:272K tokens
View Details
DeepSeek R1
DeepSeek
Input:$0.55/M
Context:128K tokens
View Details