ModelsLlama 3.2 90B

Llama 3.2 90B

Ultra-cheap

Maximum cost efficiency for high-volume tasks

Pricing

Input

$0.20

per 1M tokens

Output

$0.40

per 1M tokens

Cached

N/A

per 1M tokens

Note: Released Sep 2024. First multimodal open model from Meta

Context & Output

Context Window128K tokens

Max Output32K tokens

Latency

Fast

Capabilities

Multimodal

Streaming

Function Calling

Prompt Caching

Key Strengths

What makes this model stand out

First multimodal Llama

Image+text

Open weights

Similar Models in Ultra-cheap Tier

Other models with similar pricing and performance characteristics

Llama 4 Scout