ModelsLlama 3.3 70B
Llama 3.3 70B
by Meta
Cost-effective
Best value for production workloads
Pricing
Input
$0.35
per 1M tokens
Output
$0.55
per 1M tokens
Cached
N/A
per 1M tokens
Note: Released Dec 2024. 405B performance at fraction of cost
Context & Output
Context Window128K tokens
Max Output32K tokens
Latency
Fast
Capabilities
Multimodal
Streaming
Function Calling
Prompt Caching
Key Strengths
What makes this model stand out
Similar to 3.1 405B
Open weights
Cost-efficient
Similar Models in Cost-effective Tier
Other models with similar pricing and performance characteristics
Amazon Nova 2 Lite
Amazon
Input:TBD/M
Context:1M tokens
Amazon Nova 2 Sonic
Amazon
Input:TBD/M
Context:1M tokens
DeepSeek V3.1
DeepSeek
Input:$0.56/M
Context:128K tokens