Question 1

What does this AI API pricing comparison cover?

Accepted Answer

This page tracks pay-as-you-go API token pricing for 126 AI models across 12 providers — OpenAI, Anthropic, Google, DeepSeek, Mistral, xAI, Cohere, Groq, Together AI, Perplexity, and Meta Llama hosts. Each row shows input price, output price, cached-input price (where the provider offers prompt caching), and batch-discounted rates per million tokens. The data snapshot shown is from 2026-06-29.

Question 2

How often is the pricing data updated?

Accepted Answer

Prices are verified and reconciled daily by an automated pipeline that pulls from each provider's official pricing page and cross-checks against at least two independent sources before publishing. New models and price changes typically appear here within hours of the provider's announcement. If a provider hasn't moved its prices, the snapshot date stays the same — the current dataset is from 2026-06-29.

Question 3

Which AI API is the cheapest right now?

Accepted Answer

As of 2026-06-29, the absolute cheapest production model we track is LFM2 24B A2B (Together) at $0.03 per million input tokens. The cheapest flagship-class model — meaning a top-tier model from a major lab, not a small or distilled variant — is Mistral Large 3 at $0.50 per million input tokens. The most expensive model we track is GPT-5.4 Pro at $30.00 input / 180.00 output per 1M tokens.

Question 4

How do I compare two specific AI providers like OpenAI and Anthropic?

Accepted Answer

Use the table search and filter controls above to narrow to a single provider, then sort by input or output price. For deeper provider-level pages with FAQs, methodology, and historical pricing, see /openai-pricing/, /anthropic-pricing/, /google-pricing/, /deepseek-pricing/, /mistral-ai-pricing/, /cohere-pricing/, or any other provider listed. For workload-level math, the token cost calculator at /calculators/token-cost/ runs your input/output volume against every model side-by-side.

Question 5

Should I pay per token via API or subscribe to ChatGPT Plus or Claude Pro?

Accepted Answer

It depends on volume and access pattern. Monthly subscriptions like ChatGPT Plus, Claude Pro, and Gemini Advanced at roughly $20/month are usually cheaper than API access for chat-style usage under about 5 million input tokens per month. Production workloads, agents, and anything that needs API access almost always cost less pay-as-you-go on the API. The /subscription-vs-api/ tool computes the exact break-even point for your specific volume.

Question 6

Can I download the AI API pricing data as JSON?

Accepted Answer

Yes. The full live dataset is published at /api/pricing.json and updated whenever this page is. The schema is { id, name, family, provider, pricing: { inputPerM, cachedInputPerM, outputPerM }, status } for all 126 tracked models. The AI Pricing Guru dataset is provided for informational use. You may use it for personal, editorial, research, educational, and internal business purposes with appropriate attribution to AI Pricing Guru. Commercial redistribution, resale, republishing at scale, inclusion in a competing public dataset/API, or use as the primary data source for a commercial pricing-comparison product requires prior written permission.

Model↕	Provider↕	Input $/1M↑	Cached $/1M↕	Output $/1M↕
LFM2 24B A2B (Together) LFM2	together	$0.03	—	$0.12
Command R7B Command R	cohere	$0.0375	—	$0.15
Gemma 3n E4B Instruct (Together) Gemma 3n	together	$0.06	—	$0.12
Embed v3 Multilingual Embed	cohere	$0.10	—	$0.00
Llama 4 Scout Llama 4	Meta	$0.10	—	$0.30
Mistral Small 4 Mistral Small	Mistral	$0.10	—	$0.30
Devstral Small 2 Devstral	Mistral	$0.10	—	$0.30
Together Llama 3 8B Instruct Lite Llama 3	together	$0.14	—	$0.14
Openai/gpt Oss 120b groq	Groq	$0.15	$0.075	$0.60
GPT-4o mini GPT-4o	OpenAI	$0.15	$0.075	$0.60
Mistral NeMo Mistral Open	Mistral	$0.15	—	$0.15
GPT-OSS 120B (Together) GPT-OSS	together	$0.15	—	$0.60
Rnj-1 Instruct (Together) Rnj	together	$0.15	—	$0.15
Ministral 14B Ministral	Mistral	$0.20	—	$0.20
Together Qwen3 235B A22B Instruct 2507 Qwen3	together	$0.20	—	$0.60
Together Gemma 4 31B IT Pearl Gemma 4	together	$0.28	—	$0.86
MiniMax M2.7 (Together) MiniMax M2	together	$0.30	$0.06	$1.20
Together MiniMax M3 MiniMax M3	together	$0.30	$0.06	$1.20
Together MiniMax M2.5 MiniMax M2.5	together	$0.30	$0.06	$1.20
Together Qwen2.5 7B Instruct Turbo Qwen2.5	together	$0.30	—	$0.30
Together Qwen3.7 Plus Qwen3.7	together	$0.32	—	$1.28
GPT-4.1 mini GPT-4.1	OpenAI	$0.40	$0.10	$1.60
DeepSeek V4 Pro DeepSeek V4	DeepSeek	$0.435	$0.0036	$0.87
Magistral Small Magistral	Mistral	$0.50	—	$1.50
Qwen3.5 397B A17B (Together) Qwen3.5	together	$0.60	$0.35	$3.60
Together Nemotron 3 Ultra 550B A55B Nemotron 3	together	$0.60	$0.20	$3.60
Mixtral 8x7B Mixtral	Mistral	$0.70	—	$0.70
Together Kimi K2.7 Code Kimi K2.7	together	$0.95	$0.19	$4.00
Claude Haiku 4.5 Claude 4.5	Anthropic	$1.00	$0.10	$5.00
GLM-5 (Together) GLM	together	$1.00	—	$3.20
Llama 3.3 70B (Together) Llama 3.3	together	$1.04	—	$1.04
Kimi K2.6 (Together) Kimi K2	together	$1.20	$0.20	$4.50
Grok 4.3 Grok 4.3	xAI	$1.25	—	$2.50
Cogito v2.1 671B (Together) Cogito	together	$1.25	—	$1.25
GLM-5.1 (Together) GLM-5	together	$1.40	—	$4.40
GLM-5.2 GLM-5	Z.ai	$1.40	$0.26	$4.40
Gemini 3.5 Flash Gemini 3.5	Google	$1.50	$0.15	$9.00
Mistral Medium 3.5 Mistral Medium	Mistral	$1.50	—	$7.50
Together DeepSeek V4 Pro DeepSeek V4	together	$1.74	$0.20	$3.48
GPT-5.2 GPT-5	OpenAI	$1.75	$0.175	$14.00
Rerank v3 Rerank	cohere	$2.00	—	$0.00
Pixtral Large Mistral	Mistral	$2.00	—	$6.00
Sonar Reasoning Pro Sonar	perplexity	$2.00	—	$8.00
Gemini 2.5 Pro (>200k tokens) Gemini 2.5	Google	$2.50	$0.25	$15.00
GPT-5.6 Terra GPT-5.6	OpenAI	$2.50	$0.25	$15.00
Command A Command A	cohere	$2.50	—	$10.00
Claude Sonnet 4.6 Claude 4.6	Anthropic	$3.00	$0.30	$15.00
Claude Opus 4.8 Claude 4.8	Anthropic	$5.00	$0.50	$25.00
o3-pro o-series	OpenAI	$20.00	—	$80.00
GPT-5.4 Pro GPT-5.4	OpenAI	$30.00	—	$180.00
GPT-5.5 Pro GPT-5.5	OpenAI	$30.00	—	$180.00

AI API Token Pricing Comparison

Use the token cost calculator

Rank models by cost efficiency

Open detailed model pricing

Compare monthly plans vs API usage

Frequently asked questions