ModelsLlama 4 Scout

Llama 4 Scout

Ultra-cheap

Maximum cost efficiency for high-volume tasks

Pricing

Input

$0.11

per 1M tokens

Output

$0.34

per 1M tokens

Cached

N/A

per 1M tokens

Note: Released April 2025. 10M = ~7,500 pages. 17B active, 109B total

Context & Output

Context Window10M tokens

Max Output32K tokens

Latency

Fast

Capabilities

Multimodal

Streaming

Function Calling

Prompt Caching

Key Strengths

What makes this model stand out

10M context!

Multimodal

Open weights

Similar Models in Ultra-cheap Tier

Other models with similar pricing and performance characteristics

GPT-5 Nano

OpenAI

Input:$0.05/M

Context:272K tokens

Llama 3.1 405B