Meta (via Together AI) AI Models
Open-source Llama models available through inference providers
Meta (via Together AI) Models & Pricing
| Model | Input Price | Output Price | Context Window | Max Output | Category | Capabilities |
|---|---|---|---|---|---|---|
Llama 4 Maverick Latest open-source Llama model with strong multimodal capabilities Released: 4/5/2025 | $0.27 per 1M tokens | $0.85 per 1M tokens | 1,000,000 tokens | 65,536 tokens | flagship | textvisioncode |
Llama 4 Scout Efficient multimodal Llama 4 model with 10M token context window and MoE architecture (17B active / 109B total params) Released: 4/5/2025 | $0.08 per 1M tokens | $0.30 per 1M tokens | 10,000,000 tokens | 32,768 tokens | efficient | textvisioncode |
Llama 3.3 70B Meta's improved 70B model with better performance than Llama 3.1 70B at the same price Released: 12/6/2024 | $0.88 per 1M tokens | $0.88 per 1M tokens | 131,072 tokens | 4,096 tokens | standard | textcode |
$3.50 per 1M tokens | $3.50 per 1M tokens | 128,000 tokens | 32,768 tokens | flagship | textcodereasoning | |
$0.88 per 1M tokens | $0.88 per 1M tokens | 128,000 tokens | 32,768 tokens | balanced | textcode | |
$0.18 per 1M tokens | $0.18 per 1M tokens | 128,000 tokens | 32,768 tokens | efficient | textcode |
Calculate Meta (via Together AI) Costs
Use our calculator to estimate costs for any Meta (via Together AI) model based on your usage patterns.
Compare Meta (via Together AI) Models
Llama 4 Maverick vs Llama 3.1 405B
Llama 4 Scout vs Llama 3.1 8B
Llama 3.3 70B vs Llama 3.1 405B
Llama 3.1 405B vs Llama 3.1 70B
Llama 3.1 70B vs Llama 3.1 8B
Meta (via Together AI) Pricing FAQ
How much does Meta (via Together AI) API cost?
Meta (via Together AI) offers 6 models with input pricing ranging from $0.08 to $3.50 per million tokens. The most affordable option is Llama 4 Scout at $0.08/1M input and $0.30/1M output tokens.
What is the cheapest Meta (via Together AI) model?
Llama 4 Scout is the cheapest Meta (via Together AI) model at $0.08 per million input tokens and $0.30 per million output tokens. It supports a 10,000,000-token context window.
Which Meta (via Together AI) model should I use?
It depends on your use case. Llama 4 Scout is best for cost-sensitive applications. Llama 3.1 405B ($3.50/1M input) offers the strongest capabilities. For high-volume workloads, their efficient-tier models offer the best cost-performance ratio.
What models does Meta (via Together AI) offer?
Meta (via Together AI) currently offers 6 models: Llama 4 Scout, Llama 4 Maverick, Llama 3.1 405B, Llama 3.3 70B, Llama 3.1 70B, Llama 3.1 8B. These span efficient, flagship, standard, balanced categories with context windows up to 10000K tokens.
How does Meta (via Together AI) pricing compare to competitors?
Meta (via Together AI)'s pricing starts at $0.08/1M input tokens with Llama 4 Scout. Use our comparison tool to see side-by-side pricing against other providers like OpenAI, Anthropic, and Google.
About Meta (via Together AI)
Open-source Llama models available through inference providers
Related Articles
Open-Source vs Proprietary AI Models: A Complete Cost Comparison for 2026
Llama 4, DeepSeek, and Mistral are closing the quality gap with GPT-5 and Claude — at a fraction of the price. We bre...
3/24/2026
Llama 4 Maverick: Is Meta's Open Model the Cheapest Option?
Meta's Llama 4 Maverick offers a 1M context window at budget pricing. We analyze costs via Together AI and compare ag...
2/20/2026