AI API Token Pricing Comparison
Pay-as-you-go API prices in USD per 1 million tokens. For monthly ChatGPT, Claude, Gemini, and Copilot plans, use subscription pricing. Data last updated: . Providers can change prices without notice, so verify directly before purchase decisions.
Which AI API is cheapest right now? We track 89 models across 11 providers. The cheapest flagship is Mistral Large 3 at $0.50 per 1M input tokens; the absolute cheapest production model is Command R7B at $0.04 per 1M input. The most expensive we track is GPT-5.4 Pro at $30.00 input / 180.00 output. Download the raw data as JSON.
Need exact math?
Use the token cost calculator
Enter your input/output token volume and estimate monthly spend before choosing a model.
Subscription or API?
Compare monthly plans vs API usage
If you use AI through a chat app, calculate whether a subscription is cheaper than raw API tokens.
| Model↕ | Provider↕ | Input $/1M↑ | Cached $/1M↕ | Output $/1M↕ |
|---|---|---|---|---|
Command R7B Command R | cohere | $0.0375 | — | $0.15 |
Llama 4 Scout Llama 4 | Meta | $0.08 | — | $0.30 |
Embed v3 Multilingual Embed | cohere | $0.10 | — | $0.00 |
GPT-4.1 nano GPT-4.1 | OpenAI | $0.10 | $0.025 | $0.40 |
Devstral Small 2 Devstral | Mistral | $0.10 | — | $0.30 |
Openai/gpt Oss 120b groq | Groq | $0.15 | $0.075 | $0.60 |
Mistral Small 4 Mistral Small | Mistral | $0.15 | — | $0.60 |
GPT-4o mini GPT-4o | OpenAI | $0.15 | $0.075 | $0.60 |
Mistral NeMo Mistral Open | Mistral | $0.15 | — | $0.15 |
GPT-OSS 120B (Together) GPT-OSS | together | $0.15 | — | $0.60 |
Ministral 14B Ministral | Mistral | $0.20 | — | $0.20 |
MiniMax M2.7 (Together) MiniMax M2 | together | $0.30 | $0.06 | $1.20 |
Magistral Small Magistral | Mistral | $0.50 | — | $1.50 |
Qwen3.6-Plus (Together) Qwen3.6 | together | $0.50 | — | $3.00 |
DeepSeek V3.1 (Together) DeepSeek | together | $0.60 | — | $1.70 |
Mixtral 8x7B Mixtral | Mistral | $0.70 | — | $0.70 |
Llama 3.3 70B (Together) Llama 3.3 | together | $0.88 | — | $0.88 |
Claude Haiku 4.5 Claude 4.5 | Anthropic | $1.00 | $0.10 | $5.00 |
o4-mini o-series | OpenAI | $1.10 | $0.275 | $4.40 |
Kimi K2.6 (Together) Kimi K2 | together | $1.20 | $0.20 | $4.50 |
Grok 4.3 Grok 4.3 | xAI | $1.25 | — | $2.50 |
GLM-5.1 (Together) GLM-5 | together | $1.40 | — | $4.40 |
Mistral Medium 3.5 Mistral Medium | Mistral | $1.50 | — | $7.50 |
DeepSeek V4 Pro DeepSeek V4 | DeepSeek | $1.74 | $0.0145 | $3.48 |
Rerank v3 Rerank | cohere | $2.00 | — | $0.00 |
Pixtral Large Mistral | Mistral | $2.00 | — | $6.00 |
Sonar Reasoning Pro Sonar | perplexity | $2.00 | — | $8.00 |
DeepSeek V4 Pro (Together) DeepSeek V4 | together | $2.10 | $0.20 | $4.40 |
Gemini 2.5 Pro (>200k tokens) Gemini 2.5 | $2.50 | $0.25 | $15.00 | |
Command A Command A | cohere | $2.50 | — | $10.00 |
Claude Sonnet 4.6 Claude 4.6 | Anthropic | $3.00 | $0.30 | $15.00 |
Claude Opus 4.7 Claude 4.7 | Anthropic | $5.00 | $0.50 | $25.00 |
GPT-5.4 Pro GPT-5.4 | OpenAI | $30.00 | — | $180.00 |
GPT-5.5 Pro GPT-5.5 | OpenAI | $30.00 | — | $180.00 |
- cohereCommand R7BCommand R
- Input
- $0.0375
- Cached
- —
- Output
- $0.15
- MetaLlama 4 ScoutLlama 4
- Input
- $0.08
- Cached
- —
- Output
- $0.30
- cohereEmbed v3 MultilingualEmbed
- Input
- $0.10
- Cached
- —
- Output
- $0.00
- OpenAIGPT-4.1 nanoGPT-4.1
- Input
- $0.10
- Cached
- $0.025
- Output
- $0.40
- MistralDevstral Small 2Devstral
- Input
- $0.10
- Cached
- —
- Output
- $0.30
- GroqOpenai/gpt Oss 120bgroq
- Input
- $0.15
- Cached
- $0.075
- Output
- $0.60
- MistralMistral Small 4Mistral Small
- Input
- $0.15
- Cached
- —
- Output
- $0.60
- OpenAIGPT-4o miniGPT-4o
- Input
- $0.15
- Cached
- $0.075
- Output
- $0.60
- MistralMistral NeMoMistral Open
- Input
- $0.15
- Cached
- —
- Output
- $0.15
- togetherGPT-OSS 120B (Together)GPT-OSS
- Input
- $0.15
- Cached
- —
- Output
- $0.60
- MistralMinistral 14BMinistral
- Input
- $0.20
- Cached
- —
- Output
- $0.20
- togetherMiniMax M2.7 (Together)MiniMax M2
- Input
- $0.30
- Cached
- $0.06
- Output
- $1.20
- MistralMagistral SmallMagistral
- Input
- $0.50
- Cached
- —
- Output
- $1.50
- togetherQwen3.6-Plus (Together)Qwen3.6
- Input
- $0.50
- Cached
- —
- Output
- $3.00
- togetherDeepSeek V3.1 (Together)DeepSeek
- Input
- $0.60
- Cached
- —
- Output
- $1.70
- MistralMixtral 8x7BMixtral
- Input
- $0.70
- Cached
- —
- Output
- $0.70
- togetherLlama 3.3 70B (Together)Llama 3.3
- Input
- $0.88
- Cached
- —
- Output
- $0.88
- AnthropicClaude Haiku 4.5Claude 4.5
- Input
- $1.00
- Cached
- $0.10
- Output
- $5.00
- OpenAIo4-minio-series
- Input
- $1.10
- Cached
- $0.275
- Output
- $4.40
- togetherKimi K2.6 (Together)Kimi K2
- Input
- $1.20
- Cached
- $0.20
- Output
- $4.50
- xAIGrok 4.3Grok 4.3
- Input
- $1.25
- Cached
- —
- Output
- $2.50
- togetherGLM-5.1 (Together)GLM-5
- Input
- $1.40
- Cached
- —
- Output
- $4.40
- MistralMistral Medium 3.5Mistral Medium
- Input
- $1.50
- Cached
- —
- Output
- $7.50
- DeepSeekDeepSeek V4 ProDeepSeek V4
- Input
- $1.74
- Cached
- $0.0145
- Output
- $3.48
- cohereRerank v3Rerank
- Input
- $2.00
- Cached
- —
- Output
- $0.00
- MistralPixtral LargeMistral
- Input
- $2.00
- Cached
- —
- Output
- $6.00
- perplexitySonar Reasoning ProSonar
- Input
- $2.00
- Cached
- —
- Output
- $8.00
- togetherDeepSeek V4 Pro (Together)DeepSeek V4
- Input
- $2.10
- Cached
- $0.20
- Output
- $4.40
- Gemini 2.5 Pro (>200k tokens)Gemini 2.5
- Input
- $2.50
- Cached
- $0.25
- Output
- $15.00
- cohereCommand ACommand A
- Input
- $2.50
- Cached
- —
- Output
- $10.00
- AnthropicClaude Sonnet 4.6Claude 4.6
- Input
- $3.00
- Cached
- $0.30
- Output
- $15.00
- AnthropicClaude Opus 4.7Claude 4.7
- Input
- $5.00
- Cached
- $0.50
- Output
- $25.00
- OpenAIGPT-5.4 ProGPT-5.4
- Input
- $30.00
- Cached
- —
- Output
- $180.00
- OpenAIGPT-5.5 ProGPT-5.5
- Input
- $30.00
- Cached
- —
- Output
- $180.00
Frequently asked questions
Quick answers about API token pricing, freshness, and how to compare providers.
What does this AI API pricing comparison cover?
This page tracks pay-as-you-go API token pricing for 89 AI models across 11 providers — OpenAI, Anthropic, Google, DeepSeek, Mistral, xAI, Cohere, Groq, Together AI, Perplexity, and Meta Llama hosts. Each row shows input price, output price, cached-input price (where the provider offers prompt caching), and batch-discounted rates per million tokens. The data snapshot shown is from 2026-05-15.
How often is the pricing data updated?
Prices are verified and reconciled daily by an automated pipeline that pulls from each provider's official pricing page and cross-checks against at least two independent sources before publishing. New models and price changes typically appear here within hours of the provider's announcement. If a provider hasn't moved its prices, the snapshot date stays the same — the current dataset is from 2026-05-15.
Which AI API is the cheapest right now?
As of 2026-05-15, the absolute cheapest production model we track is Command R7B at $0.04 per million input tokens. The cheapest flagship-class model — meaning a top-tier model from a major lab, not a small or distilled variant — is Mistral Large 3 at $0.50 per million input tokens. The most expensive model we track is GPT-5.4 Pro at $30.00 input / 180.00 output per 1M tokens.
How do I compare two specific AI providers like OpenAI and Anthropic?
Use the table search and filter controls above to narrow to a single provider, then sort by input or output price. For deeper provider-level pages with FAQs, methodology, and historical pricing, see /openai-pricing/, /anthropic-pricing/, /google-pricing/, /deepseek-pricing/, /mistral-ai-pricing/, /cohere-pricing/, or any other provider listed. For workload-level math, the token cost calculator at /calculators/token-cost/ runs your input/output volume against every model side-by-side.
Should I pay per token via API or subscribe to ChatGPT Plus or Claude Pro?
It depends on volume and access pattern. Monthly subscriptions like ChatGPT Plus, Claude Pro, and Gemini Advanced at roughly $20/month are usually cheaper than API access for chat-style usage under about 5 million input tokens per month. Production workloads, agents, and anything that needs API access almost always cost less pay-as-you-go on the API. The /calculators/subscription-vs-api/ tool computes the exact break-even point for your specific volume.
Can I download the AI API pricing data as JSON?
Yes. The full live dataset is published at /api/pricing.json under a CC BY 4.0 license, updated whenever this page is. The schema is { id, name, family, provider, pricing: { inputPerM, cachedInputPerM, outputPerM }, status } for all 89 tracked models. Use it in apps, dashboards, monitoring, or research without scraping HTML.