AI API Token Pricing Comparison
Pay-as-you-go API prices in USD per 1 million tokens. For monthly ChatGPT, Claude, Gemini, and Copilot plans, use subscription pricing. Data last updated: . Providers can change prices without notice, so verify directly before purchase decisions.
Which AI API is cheapest right now? We track 112 models across 11 providers. The cheapest flagship is Mistral Large 3 at $0.50 per 1M input tokens; the absolute cheapest production model is LFM2 24B A2B (Together) at $0.03 per 1M input. The most expensive we track is GPT-5.4 Pro at $30.00 input / 180.00 output. Download the raw data as JSON.
Need exact math?
Use the token cost calculator
Enter your input/output token volume and estimate monthly spend before choosing a model.
Subscription or API?
Compare monthly plans vs API usage
If you use AI through a chat app, calculate whether a subscription is cheaper than raw API tokens.
Voice AI tools
Compare ElevenLabs and Speechify before you buy
Voiceover, dubbing, voice cloning, and AI agents have different pricing traps than token APIs. Start with the pricing reviews, then test the tool that matches your workflow.
| Model↕ | Provider↕ | Input $/1M↑ | Cached $/1M↕ | Output $/1M↕ |
|---|---|---|---|---|
LFM2 24B A2B (Together) LFM2 | together | $0.03 | — | $0.12 |
Command R7B Command R | cohere | $0.0375 | — | $0.15 |
Gemma 3n E4B Instruct (Together) Gemma 3n | together | $0.06 | — | $0.12 |
Llama 4 Scout Llama 4 | Meta | $0.08 | — | $0.30 |
Embed v3 Multilingual Embed | cohere | $0.10 | — | $0.00 |
Mistral Small 4 Mistral Small | Mistral | $0.10 | — | $0.30 |
Devstral Small 2 Devstral | Mistral | $0.10 | — | $0.30 |
Openai/gpt Oss 120b groq | Groq | $0.15 | $0.075 | $0.60 |
GPT-4o mini GPT-4o | OpenAI | $0.15 | $0.075 | $0.60 |
Mistral NeMo Mistral Open | Mistral | $0.15 | — | $0.15 |
GPT-OSS 120B (Together) GPT-OSS | together | $0.15 | — | $0.60 |
Rnj-1 Instruct (Together) Rnj | together | $0.15 | — | $0.15 |
Ministral 14B Ministral | Mistral | $0.20 | — | $0.20 |
MiniMax M2.7 (Together) MiniMax M2 | together | $0.30 | $0.06 | $1.20 |
GPT-4.1 mini GPT-4.1 | OpenAI | $0.40 | $0.10 | $1.60 |
DeepSeek V4 Pro DeepSeek V4 | DeepSeek | $0.435 | $0.0036 | $0.87 |
Magistral Small Magistral | Mistral | $0.50 | — | $1.50 |
Qwen3.5 397B A17B (Together) Qwen3.5 | together | $0.60 | — | $3.60 |
Mixtral 8x7B Mixtral | Mistral | $0.70 | — | $0.70 |
Claude Haiku 4.5 Claude 4.5 | Anthropic | $1.00 | $0.10 | $5.00 |
GLM-5 (Together) GLM | together | $1.00 | — | $3.20 |
Llama 3.3 70B (Together) Llama 3.3 | together | $1.04 | — | $1.04 |
Kimi K2.6 (Together) Kimi K2 | together | $1.20 | $0.20 | $4.50 |
Grok 4.3 Grok 4.3 | xAI | $1.25 | — | $2.50 |
Qwen3.7-Max (Together) Qwen3.7 | together | $1.25 | $0.13 | $3.75 |
Cogito v2.1 671B (Together) Cogito | together | $1.25 | — | $1.25 |
GLM-5.1 (Together) GLM-5 | together | $1.40 | — | $4.40 |
Gemini 3.5 Flash Gemini 3.5 | $1.50 | $0.15 | $9.00 | |
Mistral Medium 3.5 Mistral Medium | Mistral | $1.50 | — | $7.50 |
Rerank v3 Rerank | cohere | $2.00 | — | $0.00 |
Pixtral Large Mistral | Mistral | $2.00 | — | $6.00 |
Sonar Reasoning Pro Sonar | perplexity | $2.00 | — | $8.00 |
DeepSeek V4 Pro (Together) DeepSeek V4 | together | $2.10 | $0.20 | $4.40 |
Gemini 2.5 Pro (>200k tokens) Gemini 2.5 | $2.50 | $0.25 | $15.00 | |
Command A Command A | cohere | $2.50 | — | $10.00 |
Claude Sonnet 4.6 Claude 4.6 | Anthropic | $3.00 | $0.30 | $15.00 |
Claude Opus 4.8 Claude 4.8 | Anthropic | $5.00 | $0.50 | $25.00 |
Claude Fable 5 Claude Fable 5 | Anthropic | $10.00 | $1.00 | $50.00 |
o3-pro o-series | OpenAI | $20.00 | — | $80.00 |
GPT-5.2 Pro GPT-5 | OpenAI | $21.00 | — | $168.00 |
GPT-5.4 Pro GPT-5.4 | OpenAI | $30.00 | — | $180.00 |
GPT-5.5 Pro GPT-5.5 | OpenAI | $30.00 | — | $180.00 |
- togetherLFM2 24B A2B (Together)LFM2
- Input
- $0.03
- Cached
- —
- Output
- $0.12
- cohereCommand R7BCommand R
- Input
- $0.0375
- Cached
- —
- Output
- $0.15
- togetherGemma 3n E4B Instruct (Together)Gemma 3n
- Input
- $0.06
- Cached
- —
- Output
- $0.12
- MetaLlama 4 ScoutLlama 4
- Input
- $0.08
- Cached
- —
- Output
- $0.30
- cohereEmbed v3 MultilingualEmbed
- Input
- $0.10
- Cached
- —
- Output
- $0.00
- MistralMistral Small 4Mistral Small
- Input
- $0.10
- Cached
- —
- Output
- $0.30
- MistralDevstral Small 2Devstral
- Input
- $0.10
- Cached
- —
- Output
- $0.30
- GroqOpenai/gpt Oss 120bgroq
- Input
- $0.15
- Cached
- $0.075
- Output
- $0.60
- OpenAIGPT-4o miniGPT-4o
- Input
- $0.15
- Cached
- $0.075
- Output
- $0.60
- MistralMistral NeMoMistral Open
- Input
- $0.15
- Cached
- —
- Output
- $0.15
- togetherGPT-OSS 120B (Together)GPT-OSS
- Input
- $0.15
- Cached
- —
- Output
- $0.60
- togetherRnj-1 Instruct (Together)Rnj
- Input
- $0.15
- Cached
- —
- Output
- $0.15
- MistralMinistral 14BMinistral
- Input
- $0.20
- Cached
- —
- Output
- $0.20
- togetherMiniMax M2.7 (Together)MiniMax M2
- Input
- $0.30
- Cached
- $0.06
- Output
- $1.20
- OpenAIGPT-4.1 miniGPT-4.1
- Input
- $0.40
- Cached
- $0.10
- Output
- $1.60
- DeepSeekDeepSeek V4 ProDeepSeek V4
- Input
- $0.435
- Cached
- $0.0036
- Output
- $0.87
- MistralMagistral SmallMagistral
- Input
- $0.50
- Cached
- —
- Output
- $1.50
- togetherQwen3.5 397B A17B (Together)Qwen3.5
- Input
- $0.60
- Cached
- —
- Output
- $3.60
- MistralMixtral 8x7BMixtral
- Input
- $0.70
- Cached
- —
- Output
- $0.70
- AnthropicClaude Haiku 4.5Claude 4.5
- Input
- $1.00
- Cached
- $0.10
- Output
- $5.00
- togetherGLM-5 (Together)GLM
- Input
- $1.00
- Cached
- —
- Output
- $3.20
- togetherLlama 3.3 70B (Together)Llama 3.3
- Input
- $1.04
- Cached
- —
- Output
- $1.04
- togetherKimi K2.6 (Together)Kimi K2
- Input
- $1.20
- Cached
- $0.20
- Output
- $4.50
- xAIGrok 4.3Grok 4.3
- Input
- $1.25
- Cached
- —
- Output
- $2.50
- togetherQwen3.7-Max (Together)Qwen3.7
- Input
- $1.25
- Cached
- $0.13
- Output
- $3.75
- togetherCogito v2.1 671B (Together)Cogito
- Input
- $1.25
- Cached
- —
- Output
- $1.25
- togetherGLM-5.1 (Together)GLM-5
- Input
- $1.40
- Cached
- —
- Output
- $4.40
- Gemini 3.5 FlashGemini 3.5
- Input
- $1.50
- Cached
- $0.15
- Output
- $9.00
- MistralMistral Medium 3.5Mistral Medium
- Input
- $1.50
- Cached
- —
- Output
- $7.50
- cohereRerank v3Rerank
- Input
- $2.00
- Cached
- —
- Output
- $0.00
- MistralPixtral LargeMistral
- Input
- $2.00
- Cached
- —
- Output
- $6.00
- perplexitySonar Reasoning ProSonar
- Input
- $2.00
- Cached
- —
- Output
- $8.00
- togetherDeepSeek V4 Pro (Together)DeepSeek V4
- Input
- $2.10
- Cached
- $0.20
- Output
- $4.40
- Gemini 2.5 Pro (>200k tokens)Gemini 2.5
- Input
- $2.50
- Cached
- $0.25
- Output
- $15.00
- cohereCommand ACommand A
- Input
- $2.50
- Cached
- —
- Output
- $10.00
- AnthropicClaude Sonnet 4.6Claude 4.6
- Input
- $3.00
- Cached
- $0.30
- Output
- $15.00
- AnthropicClaude Opus 4.8Claude 4.8
- Input
- $5.00
- Cached
- $0.50
- Output
- $25.00
- AnthropicClaude Fable 5Claude Fable 5
- Input
- $10.00
- Cached
- $1.00
- Output
- $50.00
- OpenAIo3-proo-series
- Input
- $20.00
- Cached
- —
- Output
- $80.00
- OpenAIGPT-5.2 ProGPT-5
- Input
- $21.00
- Cached
- —
- Output
- $168.00
- OpenAIGPT-5.4 ProGPT-5.4
- Input
- $30.00
- Cached
- —
- Output
- $180.00
- OpenAIGPT-5.5 ProGPT-5.5
- Input
- $30.00
- Cached
- —
- Output
- $180.00
Frequently asked questions
Quick answers about API token pricing, freshness, and how to compare providers.
What does this AI API pricing comparison cover?
This page tracks pay-as-you-go API token pricing for 112 AI models across 11 providers — OpenAI, Anthropic, Google, DeepSeek, Mistral, xAI, Cohere, Groq, Together AI, Perplexity, and Meta Llama hosts. Each row shows input price, output price, cached-input price (where the provider offers prompt caching), and batch-discounted rates per million tokens. The data snapshot shown is from 2026-06-11.
How often is the pricing data updated?
Prices are verified and reconciled daily by an automated pipeline that pulls from each provider's official pricing page and cross-checks against at least two independent sources before publishing. New models and price changes typically appear here within hours of the provider's announcement. If a provider hasn't moved its prices, the snapshot date stays the same — the current dataset is from 2026-06-11.
Which AI API is the cheapest right now?
As of 2026-06-11, the absolute cheapest production model we track is LFM2 24B A2B (Together) at $0.03 per million input tokens. The cheapest flagship-class model — meaning a top-tier model from a major lab, not a small or distilled variant — is Mistral Large 3 at $0.50 per million input tokens. The most expensive model we track is GPT-5.4 Pro at $30.00 input / 180.00 output per 1M tokens.
How do I compare two specific AI providers like OpenAI and Anthropic?
Use the table search and filter controls above to narrow to a single provider, then sort by input or output price. For deeper provider-level pages with FAQs, methodology, and historical pricing, see /openai-pricing/, /anthropic-pricing/, /google-pricing/, /deepseek-pricing/, /mistral-ai-pricing/, /cohere-pricing/, or any other provider listed. For workload-level math, the token cost calculator at /calculators/token-cost/ runs your input/output volume against every model side-by-side.
Should I pay per token via API or subscribe to ChatGPT Plus or Claude Pro?
It depends on volume and access pattern. Monthly subscriptions like ChatGPT Plus, Claude Pro, and Gemini Advanced at roughly $20/month are usually cheaper than API access for chat-style usage under about 5 million input tokens per month. Production workloads, agents, and anything that needs API access almost always cost less pay-as-you-go on the API. The /subscription-vs-api/ tool computes the exact break-even point for your specific volume.
Can I download the AI API pricing data as JSON?
Yes. The full live dataset is published at /api/pricing.json and updated whenever this page is. The schema is { id, name, family, provider, pricing: { inputPerM, cachedInputPerM, outputPerM }, status } for all 112 tracked models. The AI Pricing Guru dataset is provided for informational use. You may use it for personal, editorial, research, educational, and internal business purposes with appropriate attribution to AI Pricing Guru. Commercial redistribution, resale, republishing at scale, inclusion in a competing public dataset/API, or use as the primary data source for a commercial pricing-comparison product requires prior written permission.