ModelsLlama 4 Maverick

Llama 4 Maverick

Cost-effective

Best value for production workloads

Pricing

Input

$0.50

per 1M tokens

Output

$0.77

per 1M tokens

Cached

N/A

per 1M tokens

Note: Released April 2025. 17B active, 128 experts

Context & Output

Context Window1M tokens

Max Output32K tokens

Latency

Fast

Capabilities

Multimodal

Streaming

Function Calling

Prompt Caching

Key Strengths

What makes this model stand out

400B params MoE

Multimodal

Open weights

Similar Models in Cost-effective Tier

Other models with similar pricing and performance characteristics

Amazon Nova 2 Lite

Amazon

Input:TBD/M

Context:1M tokens

Amazon Nova 2 Sonic

Amazon

Input:TBD/M

Context:1M tokens

DeepSeek V3.1

DeepSeek

Input:$0.56/M

Context:128K tokens