Skip to main content

Efficient AI Models — Pricing & Comparison

Budget-friendly models optimized for speed and cost-effectiveness. Ideal for high-volume tasks, classification, extraction, and simple generation.

25
Models
8
Providers
$0.05
Cheapest (per M input)
$$0.3398
Average (per M input)

All Efficient Models — Sorted by Price

#ModelProviderInput $/MOutput $/MContext1M Requests*
1GPT-5 nano
2025-08-07
OpenAI$0.05$0.4128K$$0.000250
2Mistral Small 3.2
2025-12-02
Mistral AI$0.06$0.18128K$$0.000150
3Gemini 2.0 Flash-Lite
2025-02-05
Google$0.075$0.31000K$$0.000225
4GPT-4.1 nano
2025-04-14
OpenAI$0.1$0.4128K$$0.000300
5Gemini 2.0 Flash
2024-12-11
Google$0.1$0.41000K$$0.000300
6Gemini 2.5 Flash-Lite
2025-06-17
Google$0.1$0.41000K$$0.000300
7GPT-4o mini
2024-07-18
OpenAI$0.15$0.6128K$$0.000450
8Mistral Small 4
2026-03-18
Mistral AI$0.15$0.6128K$$0.000450
9Command R
2024-03-11
Cohere$0.15$0.6128K$$0.000450
10Llama 3.1 8B
2024-07-23
Meta (via Together AI)$0.18$0.18128K$$0.000270
11GPT-5.4 nano
2026-03-06
OpenAI$0.2$1.25128K$$0.000825
12Grok 4.1 Fast
2026-01-15
xAI$0.2$0.52000K$$0.000450
13GPT-5 mini
2025-08-07
OpenAI$0.25$2500K$$0.001250
14Gemini 3.1 Flash-Lite Preview
2026-03-03
Google$0.25$1.51000K$$0.001000
15Gemini 3.1 Flash-Lite
2026-03-03
Google$0.25$1.51000K$$0.001000
16DeepSeek V3.2
2025-12-01
DeepSeek$0.28$0.42128K$$0.000490
17Gemini 2.5 Flash
2025-05-20
Google$0.3$2.51000K$$0.001550
18Grok 3 Mini
2025-02-17
xAI$0.3$0.5128K$$0.000550
19GPT-4.1 mini
2025-04-14
OpenAI$0.4$1.6200K$$0.001200
20Devstral 2
2025-12-09
Mistral AI$0.4$2262K$$0.001400
21Gemini 3 Flash
2025-12-17
Google$0.5$31000K$$0.002000
22GPT-5.4 mini
2026-03-06
OpenAI$0.75$4.51050K$$0.003000
23Claude 3.5 Haiku
2024-11-04
Anthropic$0.8$4200K$$0.002800
24Claude Haiku 4.5
2025-10-15
Anthropic$1$5200K$$0.003500
25Codex Mini
2026-02-02
OpenAI$1.5$6200K$$0.004500

* Estimated cost for 1M requests at 1,000 input + 500 output tokens each.

Calculate Your Efficient Model Costs

💡 Tips for Using Efficient Models

Efficient models can handle 10-100x more requests for the same budget as flagship models

Great for preprocessing, classification, summarization, and data extraction

Consider using efficient models as a "first pass" filter before sending complex items to flagship models

Compare Efficient Models

Frequently Asked Questions

What are efficient AI models best at?

Text classification, entity extraction, summarization, simple Q&A, translation, and other well-defined tasks. They excel when the task is straightforward and volume is high.

How much cheaper are efficient models?

Typically 10-50x cheaper per token than flagship models. For example, GPT-4o mini costs a fraction of GPT-5.2, making it viable for millions of daily requests.

Will I notice a quality difference?

For simple tasks, often not. For nuanced creative work or complex reasoning, yes. The key is matching model capability to task complexity.

Browse Other Categories