How much does Gemini 3.1 Pro cost per million tokens?

Gemini 3.1 Pro keeps the same base pricing as Gemini 3 Pro at $2.00 input and $12.00 output per 1M tokens for requests under 200K context. For 200K to 1M context usage, pricing increases to $4.00 input and $18.00 output. That tier change can materially affect long-context budgets.

Is Gemini 3.1 Pro cheaper than GPT-5 or Claude?

It sits between them in this article's flagship table. GPT-5 is lower at $1.25/$10.00, while Claude Sonnet 4.6 is higher at $3.00/$15.00 and Claude Opus 4.6 is $5.00/$25.00. Gemini 3.1 Pro is positioned as a middle-cost model with stronger reasoning.

Does upgrading from Gemini 3 Pro to 3.1 Pro increase costs?

No, the post frames it as a zero-cost upgrade for existing Gemini 3 Pro users. Pricing stays at $2.00/$12.00 in the base tier, but reasoning jumps to 77.1% on ARC-AGI-2. You get better logic performance without changing base per-token rates.

Can Gemini 3.1 Pro lower total spend in reasoning pipelines?

It can if it reduces fallbacks to premium models. The example shows a 50K requests/day pipeline where avoiding 10% fallbacks to Claude Opus 4.6 cuts spend from $17,500/month to $15,000/month. That is about $2,500 in monthly savings from routing fewer calls to expensive models.

Published February 20, 2026

Gemini 3.1 Pro: Double the Reasoning, Same Price

Google's Gemini 3.1 Pro scores 77.1% on ARC-AGI-2 — more than double its predecessor — while keeping API pricing at $2/$12 per million tokens.

newsgooglegeminipricing2026

Gemini 3.1 Pro: Double the Reasoning, Same Price

Google dropped Gemini 3.1 Pro with a headline that should make every AI budget holder pay attention: 77.1% on ARC-AGI-2 — more than double the reasoning performance of its predecessor — at the exact same price of $2.00/$12.00 per million tokens. If you're already running Gemini 3 Pro in production, this is a free upgrade that delivers flagship-tier reasoning without touching your bill.

That benchmark score matters. ARC-AGI-2 tests a model's ability to solve entirely new logic patterns it hasn't seen during training. Scoring 77.1% puts Gemini 3.1 Pro in direct competition with dedicated reasoning models like OpenAI's o3 and Anthropic's Claude Opus 4.6 — models that cost significantly more per token. Google is essentially giving away a reasoning upgrade that competitors would charge a premium for.

This move isn't just a product update. It's a pricing strategy that pressures every other provider to either match the reasoning gains or cut their prices. Let's break down what this means for your costs and how Gemini 3.1 Pro stacks up against every major competitor.

[stat] 2× reasoning Gemini 3.1 Pro's improvement over its predecessor on ARC-AGI-2 — at zero additional cost

The pricing: unchanged and competitive

Gemini 3.1 Pro maintains the same pricing as Gemini 3 Pro:

Context usage	Input / 1M tokens	Output / 1M tokens
Under 200K tokens	$2.00	$12.00
200K – 1M tokens	$4.00	$18.00

The context window stays at 1M input tokens with 64K output tokens. Available through the Gemini API, Google AI Studio, Vertex AI, the Gemini app, and NotebookLM.

The tiered pricing for long-context usage is worth noting — if you regularly use more than 200K tokens of context, your effective input price doubles to $4.00/M. Plan your architecture around this threshold to avoid unexpected cost increases.

How Gemini 3.1 Pro compares to every major model

Here's where Gemini 3.1 Pro sits in the current pricing landscape:

Model	Input / 1M tokens	Output / 1M tokens	Context window	Category
DeepSeek V3.2	$0.28	$0.42	128,000	Budget
Mistral Large 3	$0.50	$1.50	256,000	Mid-tier
GPT-5	$1.25	$10.00	1,000,000	Flagship
GPT-5.2	$1.75	$14.00	1,000,000	Flagship
Gemini 3.1 Pro	$2.00	$12.00	1,000,000	Flagship
Claude Sonnet 4.6	$3.00	$15.00	1,000,000	Balanced
Grok 4	$3.00	$15.00	256,000	Reasoning
Claude Opus 4.6	$5.00	$25.00	200,000	Premium

💡 Key Takeaway: Gemini 3.1 Pro is positioned right in the middle of the flagship tier — more expensive than GPT-5 ($1.25/$10.00) but cheaper than Claude Sonnet 4.6 ($3.00/$15.00). The reasoning upgrade makes it competitive with models that cost 2-3× more.

GPT-5 is still cheaper on raw per-token cost — $1.25/$10.00 vs $2.00/$12.00. But if Gemini 3.1 Pro's reasoning improvements mean you can use it instead of a pricier reasoning model like o3 ($2.00/$8.00) or Claude Opus 4.6 ($5.00/$25.00), the effective savings are substantial.

$2.00

Gemini 3.1 Pro input per 1M

$5.00

Claude Opus 4.6 input per 1M

What this means for your costs: zero-cost upgrade scenario

If you're already on Gemini 3 Pro, the math is simple — nothing changes on your bill. Here's a real scenario to illustrate.

Document analysis pipeline: 50K requests/day, averaging 2,000 input / 500 output tokens:

Daily input tokens: 50K × 2,000 = 100M tokens
Daily output tokens: 50K × 500 = 25M tokens
Monthly input cost: 3B tokens × $2.00/M = $6,000
Monthly output cost: 750M tokens × $12.00/M = $9,000
Total: $15,000/month — identical on 3 Pro and 3.1 Pro

The difference: 3.1 Pro handles complex edge cases that 3 Pro fumbled. If your pipeline currently falls back to a more expensive reasoning model for 10% of requests, eliminating those fallbacks saves real money.

📊 Quick Math: If 10% of your 50K daily requests currently fall back from Gemini 3 Pro to Claude Opus 4.6 for reasoning-heavy tasks, and Gemini 3.1 Pro eliminates those fallbacks, you save approximately $2,500/month in avoided premium model costs.

Fallback savings calculation

Metric	Before (3 Pro + fallback)	After (3.1 Pro only)
Regular requests (90%)	$13,500/month on Gemini 3 Pro	$15,000/month on Gemini 3.1 Pro
Fallback requests (10%)	$4,000/month on Claude Opus 4.6	$0 (handled by 3.1 Pro)
Total	$17,500/month	$15,000/month
Monthly savings		$2,500

Detailed cost comparison across workloads

Let's compare Gemini 3.1 Pro against the most popular alternatives across four workload types at 100K requests/month:

Workload 1: Simple chat (1,000 in / 400 out)

Model	Monthly cost
DeepSeek V3.2	$45
Mistral Large 3	$110
GPT-5	$525
Gemini 3.1 Pro	$680
Claude Sonnet 4.6	$900
Claude Opus 4.6	$1,500

For simple chat, Gemini 3.1 Pro is more expensive than GPT-5. The reasoning upgrade doesn't help much here — you're paying for capability you don't need.

Workload 2: Complex analysis (3,000 in / 2,000 out)

Model	Monthly cost
DeepSeek V3.2	$168
Mistral Large 3	$450
GPT-5	$2,375
Gemini 3.1 Pro	$3,000
Claude Sonnet 4.6	$3,900
Claude Opus 4.6	$6,500

Here Gemini 3.1 Pro is 23% cheaper than Claude Sonnet and 54% cheaper than Claude Opus, while offering competitive reasoning.

Workload 3: Long document processing (15,000 in / 3,000 out)

Model	Monthly cost
DeepSeek V3.2	$546
GPT-5	$4,875
Gemini 3.1 Pro	$6,600
Claude Sonnet 4.6	$9,000
Claude Opus 4.6	$15,000

⚠️ Warning: For long document processing that exceeds 200K context tokens per request, Gemini 3.1 Pro's pricing jumps to $4.00/$18.00 — doubling input cost and increasing output by 50%. At that tier, the cost advantage over Claude Sonnet 4.6 ($3.00/$15.00) disappears. Keep your context under 200K tokens when possible, or factor the tiered pricing into your estimates.

Workload 4: Reasoning-heavy tasks (5,000 in / 3,000 out)

This is where the 3.1 Pro upgrade matters most:

Model	Monthly cost	Notes
DeepSeek R1 V3.2	$168	Budget reasoning
Gemini 3.1 Pro	$4,600	New reasoning capability
o3	$3,400	Dedicated reasoning model
Claude Opus 4.6	$10,000	Premium reasoning
o3-pro	$26,000	Maximum reasoning

Gemini 3.1 Pro sits between o3 and Claude Opus on both price and reasoning capability. For teams that were paying Claude Opus prices for reasoning tasks, switching to Gemini 3.1 Pro saves 54%.

The competitive pressure: what happens next

Google's strategy is clear: improve capability while holding the price line. This creates a cascading effect across the industry:

Short-term impacts:

Teams on Gemini 3 Pro upgrade immediately (free reasoning boost)
Teams on Claude Opus 4.6 or o3 re-evaluate whether Gemini 3.1 Pro's reasoning is sufficient at a lower price
Anthropic and OpenAI face pressure to either improve their mid-tier models or cut prices

Historical precedent: This is exactly what happened when GPT-4o launched at $2.50/$10.00 — it forced price cuts across the board. Within six months, every major provider had adjusted their pricing strategy. Gemini 3.1 Pro's reasoning leap may trigger a similar round of competitive adjustments.

For developers, the strategic move is: Don't lock into any single provider. Build a model-agnostic abstraction layer that lets you swap providers when the next price-performance improvement drops. The team that can switch models in a day captures savings that teams locked into one ecosystem miss.

Google's broader Gemini pricing strategy

Gemini 3.1 Pro is one piece of a tiered strategy that covers every price point:

Model	Input / 1M	Output / 1M	Use case
Gemini 2.5 Flash-Lite	$0.10	$0.40	High-volume simple tasks
Gemini 2.5 Flash	$0.15	$0.60	Budget general purpose
Gemini 3 Flash	$0.50	$3.00	Fast mid-tier
Gemini 2.5 Pro	$1.25	$10.00	Previous flagship
Gemini 3.1 Pro	$2.00	$12.00	Current flagship + reasoning

💡 Key Takeaway: Google's pricing ladder gives you options at every budget level. For cost-sensitive workloads, Gemini 2.5 Flash at $0.15/$0.60 is 13× cheaper than 3.1 Pro on input. Use the flagship only when you need its reasoning capabilities.

A tiered routing strategy using Google's own model family could look like:

Simple classification/chat → Gemini 2.5 Flash-Lite ($0.10/$0.40)
Standard workloads → Gemini 2.5 Flash ($0.15/$0.60)
Complex analysis → Gemini 3.1 Pro ($2.00/$12.00)

This approach cuts your blended cost by 60-80% compared to using 3.1 Pro for everything.

Should you switch to Gemini 3.1 Pro?

Switch immediately if: You're already on Gemini 3 Pro. It's a free upgrade with better reasoning.

Evaluate switching if: You're using Claude Opus 4.6 ($5.00/$25.00) or Grok 4 ($3.00/$15.00) for reasoning tasks. Gemini 3.1 Pro offers competitive reasoning at lower pricing. Run a benchmark on your specific tasks to confirm quality parity.

Stay where you are if: You're on GPT-5 ($1.25/$10.00) and don't need the reasoning boost — GPT-5 is still cheaper for standard workloads. Or if you're on DeepSeek V3.2 ($0.28/$0.42) and need pure budget performance.

Use our model comparison tool or the AI Cost Check calculator to run the numbers for your exact workload.

✅ TL;DR: Gemini 3.1 Pro doubles reasoning performance at the same $2.00/$12.00 price. It's a free upgrade for existing Gemini 3 Pro users and a cost-effective alternative to Claude Opus 4.6 and o3 for reasoning-heavy tasks. For standard workloads, GPT-5 remains cheaper. For budget workloads, DeepSeek V3.2 and Mistral are still the cost leaders.

Frequently asked questions

Is Gemini 3.1 Pro more expensive than GPT-5?

Yes, slightly. Gemini 3.1 Pro costs $2.00/$12.00 per million tokens versus GPT-5's $1.25/$10.00 — that's 60% more on input and 20% more on output. However, Gemini 3.1 Pro's reasoning capabilities may eliminate the need for expensive dedicated reasoning models, which could reduce your overall spend. For standard non-reasoning tasks, GPT-5 is the cheaper choice.

Does Gemini 3.1 Pro have tiered pricing for long context?

Yes. Under 200K context tokens, pricing is $2.00/$12.00 per million. For requests using 200K–1M context tokens, pricing increases to $4.00/$18.00 — double on input, 50% more on output. This tiered structure means long-context workloads are significantly more expensive. Plan your prompts to stay under 200K when possible, or factor the higher tier into your budget.

How does Gemini 3.1 Pro compare to Claude Opus 4.6 for reasoning?

Gemini 3.1 Pro scores 77.1% on ARC-AGI-2, putting it in competitive range with top reasoning models. At $2.00/$12.00 versus Claude Opus 4.6's $5.00/$25.00, Gemini 3.1 Pro is 60% cheaper on input and 52% cheaper on output. For teams paying Opus-tier prices for reasoning, switching to Gemini 3.1 Pro saves approximately $5,000/month on a 100K request/month workload. Test on your specific tasks to verify quality parity before switching.

Should I upgrade from Gemini 3 Pro to 3.1 Pro?

Absolutely, and immediately. The pricing is identical — $2.00/$12.00 — so you get doubled reasoning performance at zero additional cost. There's no downside. Update your API calls to use the 3.1 Pro model identifier and you'll see improved handling of complex logic, math, and multi-step reasoning tasks with no impact on your bill.

What's the cheapest Google model for simple tasks?

Gemini 2.5 Flash-Lite at $0.10/$0.40 per million tokens is Google's most affordable option, with a 1M context window. For slightly more capability, Gemini 2.5 Flash at $0.15/$0.60 adds audio support and better code generation. Both are 10-20× cheaper than Gemini 3.1 Pro and sufficient for classification, simple chat, and basic summarization. Check our budget model comparison for more options.

Related Cost Guides

Keep going with the closest pricing and optimization guides in this cluster.

Gemini 3.1 Pro: Double the Reasoning, Same Price

The pricing: unchanged and competitive

How Gemini 3.1 Pro compares to every major model

What this means for your costs: zero-cost upgrade scenario

Fallback savings calculation

Detailed cost comparison across workloads

Workload 1: Simple chat (1,000 in / 400 out)

Workload 2: Complex analysis (3,000 in / 2,000 out)

Workload 3: Long document processing (15,000 in / 3,000 out)

Workload 4: Reasoning-heavy tasks (5,000 in / 3,000 out)

The competitive pressure: what happens next

Google's broader Gemini pricing strategy

Should you switch to Gemini 3.1 Pro?

Frequently asked questions

Is Gemini 3.1 Pro more expensive than GPT-5?

Does Gemini 3.1 Pro have tiered pricing for long context?

How does Gemini 3.1 Pro compare to Claude Opus 4.6 for reasoning?

Should I upgrade from Gemini 3 Pro to 3.1 Pro?

What's the cheapest Google model for simple tasks?

Related Cost Guides

DeepSeek Reasonix Pricing in 2026: Can a Cache-First Coding Agent Cut Your AI Bill by 97%?

Google Gemma 4 Pricing 2026: Self-Hosting Cost vs API Cost

Google Gemini API Pricing Guide 2026: Gemini 2.5 Flash, 3 Pro, Free Tier & Rate Limits