ModelsLlama 3.3 70B

Llama 3.3 70B

Cost-effective

Best value for production workloads

Pricing

Input

$0.35

per 1M tokens

Output

$0.55

per 1M tokens

Cached

N/A

per 1M tokens

Note: Released Dec 2024. 405B performance at fraction of cost

Context & Output

Context Window128K tokens

Max Output32K tokens

Latency

Fast

Capabilities

Multimodal

Streaming

Function Calling

Prompt Caching

Key Strengths

What makes this model stand out

Similar to 3.1 405B

Open weights

Cost-efficient

Similar Models in Cost-effective Tier

Other models with similar pricing and performance characteristics

Amazon Nova 2 Lite

Amazon

Input:TBD/M

Context:1M tokens

Amazon Nova 2 Sonic

Amazon

Input:TBD/M

Context:1M tokens

DeepSeek V3.1

DeepSeek

Input:$0.56/M

Context:128K tokens