Question 1

How much does Gemini 3.1 Pro cost per token?

Accepted Answer

Gemini 3.1 Pro costs $2.00 per 1M input tokens and $12.00 per 1M output tokens for contexts up to 200K. Above 200K context, pricing jumps to $4.00 input / $18.00 output. Cached input drops to $0.20 per 1M — a 90% discount. Gemini 3.1 Pro is priced head-to-head with GPT-5.4.

Question 2

Does Google Gemini have a free tier?

Accepted Answer

Partially. As of April 1, 2026, Google removed Gemini Pro models (including Gemini 3.1 Pro, 3 Pro, and 2.5 Pro) from the free tier — they are paid-only now. Flash models (Gemini 3 Flash and 3.1 Flash-Lite) remain free with reduced daily quotas. If you need free Pro-tier access, your only remaining option is Claude's $5 trial credits.

Question 3

What changed on April 1, 2026?

Accepted Answer

Google removed all Pro-tier models from the free API tier — Gemini 3.1 Pro, Gemini 3 Pro, and Gemini 2.5 Pro now require billing to be enabled. Flash and Flash-Lite remain free but with tightened daily quotas. Google did not publish a formal changelog; the change surfaced through API error messages and updated pricing documentation.

Question 4

How does Gemini compare to OpenAI and Claude?

Accepted Answer

At the flagship tier, Gemini 3.1 Pro ($2.00/$12.00) matches GPT-5.4 on pricing and undercuts Claude Opus 4.8 ($5.00/$25.00) by roughly 60%. Gemini still wins on context window (2M tokens vs 1M/270K) and vision resolution. For budget volume, Flash-Lite at $0.25/$1.50 is the cheapest Tier-1 budget model on the market.

Question 5

What Gemini models does Google offer?

Accepted Answer

Google offers Gemini 3.1 Pro (flagship, tiered pricing), Gemini 3.1 Flash-Lite (budget), Gemini 3 Pro, Gemini 3 Flash (new default), and legacy Gemini 2.5 Pro / Flash / Flash-Lite models. The 3.x family is now the recommended production tier; 2.5 remains available but is moving to legacy status.

Question 6

What is Gemini 3.1 Pro's context window?

Accepted Answer

Gemini 3.1 Pro supports a 2-million-token context window — the largest in production among Tier-1 providers. This is 2x Claude Opus 4.8's 1M-token Claude API window and ~7x GPT-5.4's 270K window. Pricing tiers at 200K: $2/$12 per 1M under, $4/$18 per 1M over.

Question 7

How much does Gemini context caching save?

Accepted Answer

Google offers 90% discounts on cached context. Gemini 3.1 Pro drops from $2.00 to $0.20 per 1M input tokens on cache hits. For applications with large system prompts or reference documents, caching can reduce total input cost by 80-90%.

Model↕	Provider↕	Input $/1M↑	Cached $/1M↕	Output $/1M↕
Gemini 3.5 Flash Gemini 3.5	Google	$1.50	$0.15	$9.00
Gemini 2.5 Pro (>200k tokens) Gemini 2.5	Google	$2.50	$0.25	$15.00

Google Gemini API Pricing (June 2026)

Gemini 3.x (Current Production)

Gemini 2.5 (Legacy — Paid Tier Only)

Free Tier: What Changed April 1, 2026

Context Caching: 75-90% Savings

How Gemini Compares

All Google Gemini Models — Price per 1M Tokens

Price History

Gemini 2.5 Flash

Gemini 2.5 Flash-Lite

Gemini 2.0 Flash

Gemini 2.0 Flash-Lite

Gemini 2.5 Pro

Gemini 2.5 Pro (>200k tokens)

Gemini 3 Flash

Gemini 3 Pro

Gemini 3.1 Flash-Lite

Gemini 3.5 Flash

Gemini 3.1 Pro

Frequently asked questions

How much does Gemini 3.1 Pro cost per token?

Does Google Gemini have a free tier?

What changed on April 1, 2026?

How does Gemini compare to OpenAI and Claude?

What Gemini models does Google offer?

What is Gemini 3.1 Pro's context window?

How much does Gemini context caching save?

Methodology

Compare All Providers