Skip to main content
🏆 FlagshipBudget-Friendly

Llama 3.1 405B

by Meta (via Together AI)

Largest open-source model rivaling proprietary alternatives

Try Meta (via Together AI)

Input Price

$3.5

per 1M tokens

Output Price

$3.5

per 1M tokens

Context Window

128K

tokens (32.768K max output)

Specifications

Release Date2024-07-23
Category🏆 Flagship
Context Window128,000 tokens
Max Output32,768 tokens
Input Cost$3.5 / 1M tokens
Output Cost$3.5 / 1M tokens
Price TierBudget-Friendly

Capabilities

textcodereasoning

Related Use Cases

Monthly Cost Estimates

Usage LevelDaily TokensDailyMonthlyYearly
Light~50 requests/day100K in / 20K out$0.42$12.60$153
Medium~200 requests/day500K in / 100K out$2.10$63.00$767
Heavy~1K requests/day2,000K in / 500K out$8.75$262.50$3194
Enterprise~5K requests/day10,000K in / 2,000K out$42.00$1260.00$15330

Cost Calculator

Alternatives to Llama 3.1 405B

Frequently Asked Questions

How much does Llama 3.1 405B API cost per million tokens?
Llama 3.1 405B costs $3.5 per million input tokens and $3.5 per million output tokens as of 2026. These are the standard API rates from Meta (via Together AI).
What is the Llama 3.1 405B context window?
Llama 3.1 405B supports a 128K context window (128,000 tokens), which means you can process up to 128K tokens in a single API call.
How much does Llama 3.1 405B cost per month?
At medium usage (~200 requests/day with 500K input and 100K output tokens/day), Llama 3.1 405B costs approximately $63.00/month. Light usage runs about $12.60/month, and heavy usage (~1K requests/day) around $262.50/month.
Is there a cheaper alternative to Llama 3.1 405B?
Yes — Claude Haiku 4.5 by Anthropic is a cheaper option at $1/M input tokens vs $3.5/M for Llama 3.1 405B. Other budget alternatives include models in the efficient tier.
Is Llama 3.1 405B good for ai agent / agentic workflows?
Llama 3.1 405B is a flagship model with support for text, code, reasoning. For ai agent / agentic workflows, it offers 128K context and costs $3.5/M input tokens — a budget-friendly choice for this use case.

Llama 3.1 405B Comparisons

Ready to use Llama 3.1 405B?

Get started with Meta (via Together AI)'s API — free tier available for most models.

Try Meta (via Together AI) API →