ModelsLlama 4 Scout
Llama 4 Scout
by Meta
Ultra-cheap
Maximum cost efficiency for high-volume tasks
Pricing
Input
$0.11
per 1M tokens
Output
$0.34
per 1M tokens
Cached
N/A
per 1M tokens
Note: Released April 2025. 10M = ~7,500 pages. 17B active, 109B total
Context & Output
Context Window10M tokens
Max Output32K tokens
Latency
Fast
Capabilities
Multimodal
Streaming
Function Calling
Prompt Caching
Key Strengths
What makes this model stand out
10M context!
Multimodal
Open weights
Similar Models in Ultra-cheap Tier
Other models with similar pricing and performance characteristics
GPT-5 Nano
OpenAI
Input:$0.05/M
Context:272K tokens
Llama 3.1 405B
Meta
Input:$0.30/M
Context:128K tokens
Llama 3.2 90B
Meta
Input:$0.20/M
Context:128K tokens