Skip to main content
⚡ EfficientVery Cheap

Llama 3.1 8B

by Meta (via Together AI)

Lightweight open-source model for simple tasks

Try Meta (via Together AI)

Input Price

$0.18

per 1M tokens

Output Price

$0.18

per 1M tokens

Context Window

128K

tokens (32.768K max output)

Specifications

Release Date2024-07-23
Category⚡ Efficient
Context Window128,000 tokens
Max Output32,768 tokens
Input Cost$0.18 / 1M tokens
Output Cost$0.18 / 1M tokens
Price TierVery Cheap

Capabilities

textcode

Related Use Cases

Monthly Cost Estimates

Usage LevelDaily TokensDailyMonthlyYearly
Light~50 requests/day100K in / 20K out$0.02$0.65$8
Medium~200 requests/day500K in / 100K out$0.11$3.24$39
Heavy~1K requests/day2,000K in / 500K out$0.45$13.50$164
Enterprise~5K requests/day10,000K in / 2,000K out$2.16$64.80$788

Cost Calculator

Alternatives to Llama 3.1 8B

Frequently Asked Questions

How much does Llama 3.1 8B API cost per million tokens?
Llama 3.1 8B costs $0.18 per million input tokens and $0.18 per million output tokens as of 2026. These are the standard API rates from Meta (via Together AI).
What is the Llama 3.1 8B context window?
Llama 3.1 8B supports a 128K context window (128,000 tokens), which means you can process up to 128K tokens in a single API call.
How much does Llama 3.1 8B cost per month?
At medium usage (~200 requests/day with 500K input and 100K output tokens/day), Llama 3.1 8B costs approximately $3.24/month. Light usage runs about $0.65/month, and heavy usage (~1K requests/day) around $13.50/month.
Is there a cheaper alternative to Llama 3.1 8B?
Yes — Mistral Small 3.2 by Mistral AI is a cheaper option at $0.06/M input tokens vs $0.18/M for Llama 3.1 8B. Other budget alternatives include models in the efficient tier.
Is Llama 3.1 8B good for ai chatbot?
Llama 3.1 8B is a efficient model with support for text, code. For ai chatbot, it offers 128K context and costs $0.18/M input tokens — a budget-friendly choice for this use case.

Llama 3.1 8B Comparisons

Ready to use Llama 3.1 8B?

Get started with Meta (via Together AI)'s API — free tier available for most models.

Try Meta (via Together AI) API →