LLM Cost Calculator

Google Gemini API Pricing 2026

Gemini 3.5 Flash is the latest frontier model. The 2.5 series offers excellent price/performance with 1M token context. Gemma 4 open-weights models are among the best open-source options.

Pricing verified 2026-06-06. Sourced from ai.google.dev.

Get Google Gemini API access →

Google Gemini Model Pricing

Prices in USD per 1M tokens

ModelInput / 1MOutput / 1MContext
Gemini 3.5 Flash
Latest Gemini; frontier intelligence + superior search & grounding
$1.5$91,000,000
Gemini 3.1 Flash-Lite
Most cost-efficient Gemini; high-volume agentic tasks
$0.25$1.51,000,000
Gemini 3.1 Pro Preview
Advanced multimodal + agentic capabilities
$2$121,000,000
Gemma 4 26B A4B
Gemma 4 MoE 26B (4B active); fast open-weights model
$0.06$0.33262,144
Gemma 4 31B
Gemma 4 dense 31B; top open-weights model from Google
$0.12$0.36262,144
Gemini 2.5 Flash
Hybrid reasoning; 1M context with thinking budgets
$0.3$2.51,000,000
Gemini 2.5 Flash-Lite
Smallest & cheapest Gemini 2.5; built for scale
$0.1$0.41,000,000
Gemini 2.5 Pro
Stable release; strong coding & complex reasoning
$1.25$101,000,000
Gemma 3n 4B
Auto-added 2026-06
$0.06$0.1232,768
Gemma 3 12B
Auto-added 2026-06
$0.04$0.13131,072
Gemma 3 27B
Auto-added 2026-06
$0.08$0.16131,072
Gemma 3 4B
Auto-added 2026-06
$0.04$0.08131,072

Estimated Monthly Cost (70% input / 30% output split)

Model1M tokens/mo10M tokens/mo100M tokens/mo1B tokens/mo
Gemini 3.5 Flash$3.75$37.50$375$3,750
Gemini 3.1 Flash-Lite$0.625$6.25$62.50$625
Gemini 3.1 Pro Preview$5.00$50.00$500$5,000
Gemma 4 26B A4B $0.141$1.41$14.10$141
Gemma 4 31B$0.192$1.92$19.20$192
Gemini 2.5 Flash$0.960$9.60$96.00$960
Gemini 2.5 Flash-Lite$0.190$1.90$19.00$190
Gemini 2.5 Pro$3.88$38.75$388$3,875
Gemma 3n 4B$0.078$0.780$7.80$78.00
Gemma 3 12B$0.067$0.670$6.70$67.00
Gemma 3 27B$0.104$1.04$10.40$104
Gemma 3 4B$0.052$0.520$5.20$52.00

Frequently Asked Questions

How much does Google Gemini LLM API cost?

Google Gemini offers 12 models ranging from $0.040/1M to $2.00/1M input tokens. Gemini 3.5 Flash is the latest frontier model. The 2.5 series offers excellent price/performance with 1M token context. Gemma 4 open-weights models are among the best open-source options.

Is Google Gemini cheaper than self-hosting?

For low-volume workloads (under 100M tokens/month), cloud APIs like Google Gemini are almost always cheaper than purchasing and maintaining GPU hardware. Use our calculator to find the exact break-even point for your usage.