Skip to main content
⚖️ BalancedVery Cheap

Llama 3.1 70B

by Meta (via Together AI)

Strong open-source model for most tasks

Try Meta (via Together AI)

Input Price

$0.88

per 1M tokens

Output Price

$0.88

per 1M tokens

Context Window

128K

tokens (32.768K max output)

Specifications

Release Date2024-07-23
Category⚖️ Balanced
Context Window128,000 tokens
Max Output32,768 tokens
Input Cost$0.88 / 1M tokens
Output Cost$0.88 / 1M tokens
Price TierVery Cheap

Capabilities

textcode

Related Use Cases

Monthly Cost Estimates

Usage LevelDaily TokensDailyMonthlyYearly
Light~50 requests/day100K in / 20K out$0.11$3.17$39
Medium~200 requests/day500K in / 100K out$0.53$15.84$193
Heavy~1K requests/day2,000K in / 500K out$2.20$66.00$803
Enterprise~5K requests/day10,000K in / 2,000K out$10.56$316.80$3854

Cost Calculator

Alternatives to Llama 3.1 70B

Frequently Asked Questions

How much does Llama 3.1 70B API cost per million tokens?
Llama 3.1 70B costs $0.88 per million input tokens and $0.88 per million output tokens as of 2026. These are the standard API rates from Meta (via Together AI).
What is the Llama 3.1 70B context window?
Llama 3.1 70B supports a 128K context window (128,000 tokens), which means you can process up to 128K tokens in a single API call.
How much does Llama 3.1 70B cost per month?
At medium usage (~200 requests/day with 500K input and 100K output tokens/day), Llama 3.1 70B costs approximately $15.84/month. Light usage runs about $3.17/month, and heavy usage (~1K requests/day) around $66.00/month.
Is there a cheaper alternative to Llama 3.1 70B?
Yes — Gemini 3.1 Flash-Lite by Google is a cheaper option at $0.25/M input tokens vs $0.88/M for Llama 3.1 70B. Other budget alternatives include models in the efficient tier.
Is Llama 3.1 70B good for ai chatbot?
Llama 3.1 70B is a balanced model with support for text, code. For ai chatbot, it offers 128K context and costs $0.88/M input tokens — a budget-friendly choice for this use case.

Llama 3.1 70B Comparisons

Ready to use Llama 3.1 70B?

Get started with Meta (via Together AI)'s API — free tier available for most models.

Try Meta (via Together AI) API →