Skip to main content

Efficient AI Models — Pricing & Comparison

Budget-friendly models optimized for speed and cost-effectiveness. Ideal for high-volume tasks, classification, extraction, and simple generation.

28
Models
8
Providers
$0.05
Cheapest (per M input)
$$0.312679
Average (per M input)

All Efficient Models — Sorted by Price

#ModelProviderInput $/MOutput $/MContext1M Requests*
1GPT-5 nano
2025-08-07
OpenAI$0.05$0.4128K$$0.000250
2Gemini 2.0 Flash-Lite
2025-02-05
Google$0.075$0.31000K$$0.000225
3Llama 4 Scout
2025-04-05
Meta (via Together AI)$0.08$0.310000K$$0.000230
4GPT-4.1 nano
2025-04-14
OpenAI$0.1$0.4128K$$0.000300
5Gemini 2.0 Flash
2024-12-11
Google$0.1$0.41000K$$0.000300
6Gemini 2.5 Flash-Lite
2025-06-17
Google$0.1$0.41000K$$0.000300
7Mistral Small 3.2
2025-12-02
Mistral AI$0.1$0.3128K$$0.000250
8Ministral 3 3B
2025-12-02
Mistral AI$0.1$0.1256K$$0.000150
9DeepSeek V4 Flash
2026-04-24
DeepSeek$0.14$0.281000K$$0.000280
10GPT-4o mini
2024-07-18
OpenAI$0.15$0.6128K$$0.000450
11Mistral Small 4
2026-03-18
Mistral AI$0.15$0.6128K$$0.000450
12Ministral 3 8B
2025-12-02
Mistral AI$0.15$0.15256K$$0.000225
13Command R
2024-03-11
Cohere$0.15$0.6128K$$0.000450
14Llama 3.1 8B
2024-07-23
Meta (via Together AI)$0.18$0.18128K$$0.000270
15GPT-5.4 nano
2026-03-06
OpenAI$0.2$1.25128K$$0.000825
16Grok 4.1 Fast
2026-01-15
xAI$0.2$0.52000K$$0.000450
17GPT-5 mini
2025-08-07
OpenAI$0.25$2500K$$0.001250
18Gemini 3.1 Flash-Lite Preview
2026-03-03
Google$0.25$1.51000K$$0.001000
19DeepSeek V3.2
2025-12-01
DeepSeek$0.28$0.42128K$$0.000490
20Gemini 2.5 Flash
2025-05-20
Google$0.3$2.51000K$$0.001550
21Grok 3 Mini
2025-02-17
xAI$0.3$0.5128K$$0.000550
22GPT-4.1 mini
2025-04-14
OpenAI$0.4$1.6200K$$0.001200
23Devstral 2
2025-12-09
Mistral AI$0.4$2262K$$0.001400
24Gemini 3 Flash
2025-12-17
Google$0.5$31000K$$0.002000
25GPT-5.4 mini
2026-03-06
OpenAI$0.75$4.51050K$$0.003000
26Claude 3.5 Haiku
2024-11-04
Anthropic$0.8$4200K$$0.002800
27Claude Haiku 4.5
2025-10-15
Anthropic$1$5200K$$0.003500
28Codex Mini
2026-02-02
OpenAI$1.5$6200K$$0.004500

* Estimated cost for 1M requests at 1,000 input + 500 output tokens each.

Calculate Your Efficient Model Costs

💡 Tips for Using Efficient Models

Efficient models can handle 10-100x more requests for the same budget as flagship models

Great for preprocessing, classification, summarization, and data extraction

Consider using efficient models as a "first pass" filter before sending complex items to flagship models

Compare Efficient Models

Frequently Asked Questions

What are efficient AI models best at?

Text classification, entity extraction, summarization, simple Q&A, translation, and other well-defined tasks. They excel when the task is straightforward and volume is high.

How much cheaper are efficient models?

Typically 10-50x cheaper per token than flagship models. For example, GPT-4o mini costs a fraction of GPT-5.2, making it viable for millions of daily requests.

Will I notice a quality difference?

For simple tasks, often not. For nuanced creative work or complex reasoning, yes. The key is matching model capability to task complexity.

Browse Other Categories