AI API Token Pricing Comparison

Pay-as-you-go API prices in USD per 1 million tokens. For monthly ChatGPT, Claude, Gemini, and Copilot plans, use subscription pricing. Data last updated: . Providers can change prices without notice, so verify directly before purchase decisions.

Which AI API is cheapest right now? We track 112 models across 11 providers. The cheapest flagship is Mistral Large 3 at $0.50 per 1M input tokens; the absolute cheapest production model is LFM2 24B A2B (Together) at $0.03 per 1M input. The most expensive we track is GPT-5.4 Pro at $30.00 input / 180.00 output. Download the raw data as JSON.

Voice AI tools

Compare ElevenLabs and Speechify before you buy

Voiceover, dubbing, voice cloning, and AI agents have different pricing traps than token APIs. Start with the pricing reviews, then test the tool that matches your workflow.

Showing 42 current models. .
  • LFM2 24B A2B (Together)
    LFM2
    together
    Input
    $0.03
    Cached
    Output
    $0.12
  • Command R7B
    Command R
    cohere
    Input
    $0.0375
    Cached
    Output
    $0.15
  • Gemma 3n E4B Instruct (Together)
    Gemma 3n
    together
    Input
    $0.06
    Cached
    Output
    $0.12
  • Llama 4 Scout
    Llama 4
    Meta
    Input
    $0.08
    Cached
    Output
    $0.30
  • Embed v3 Multilingual
    Embed
    cohere
    Input
    $0.10
    Cached
    Output
    $0.00
  • Mistral Small 4
    Mistral Small
    Mistral
    Input
    $0.10
    Cached
    Output
    $0.30
  • Devstral Small 2
    Devstral
    Mistral
    Input
    $0.10
    Cached
    Output
    $0.30
  • Openai/gpt Oss 120b
    groq
    Groq
    Input
    $0.15
    Cached
    $0.075
    Output
    $0.60
  • GPT-4o mini
    GPT-4o
    OpenAI
    Input
    $0.15
    Cached
    $0.075
    Output
    $0.60
  • Mistral NeMo
    Mistral Open
    Mistral
    Input
    $0.15
    Cached
    Output
    $0.15
  • GPT-OSS 120B (Together)
    GPT-OSS
    together
    Input
    $0.15
    Cached
    Output
    $0.60
  • Rnj-1 Instruct (Together)
    Rnj
    together
    Input
    $0.15
    Cached
    Output
    $0.15
  • Ministral 14B
    Ministral
    Mistral
    Input
    $0.20
    Cached
    Output
    $0.20
  • MiniMax M2.7 (Together)
    MiniMax M2
    together
    Input
    $0.30
    Cached
    $0.06
    Output
    $1.20
  • GPT-4.1 mini
    GPT-4.1
    OpenAI
    Input
    $0.40
    Cached
    $0.10
    Output
    $1.60
  • DeepSeek V4 Pro
    DeepSeek V4
    DeepSeek
    Input
    $0.435
    Cached
    $0.0036
    Output
    $0.87
  • Magistral Small
    Magistral
    Mistral
    Input
    $0.50
    Cached
    Output
    $1.50
  • Qwen3.5 397B A17B (Together)
    Qwen3.5
    together
    Input
    $0.60
    Cached
    Output
    $3.60
  • Mixtral 8x7B
    Mixtral
    Mistral
    Input
    $0.70
    Cached
    Output
    $0.70
  • Claude Haiku 4.5
    Claude 4.5
    Anthropic
    Input
    $1.00
    Cached
    $0.10
    Output
    $5.00
  • GLM-5 (Together)
    GLM
    together
    Input
    $1.00
    Cached
    Output
    $3.20
  • Llama 3.3 70B (Together)
    Llama 3.3
    together
    Input
    $1.04
    Cached
    Output
    $1.04
  • Kimi K2.6 (Together)
    Kimi K2
    together
    Input
    $1.20
    Cached
    $0.20
    Output
    $4.50
  • Grok 4.3
    Grok 4.3
    xAI
    Input
    $1.25
    Cached
    Output
    $2.50
  • Qwen3.7-Max (Together)
    Qwen3.7
    together
    Input
    $1.25
    Cached
    $0.13
    Output
    $3.75
  • Cogito v2.1 671B (Together)
    Cogito
    together
    Input
    $1.25
    Cached
    Output
    $1.25
  • GLM-5.1 (Together)
    GLM-5
    together
    Input
    $1.40
    Cached
    Output
    $4.40
  • Gemini 3.5 Flash
    Gemini 3.5
    Google
    Input
    $1.50
    Cached
    $0.15
    Output
    $9.00
  • Mistral Medium 3.5
    Mistral Medium
    Mistral
    Input
    $1.50
    Cached
    Output
    $7.50
  • Rerank v3
    Rerank
    cohere
    Input
    $2.00
    Cached
    Output
    $0.00
  • Pixtral Large
    Mistral
    Mistral
    Input
    $2.00
    Cached
    Output
    $6.00
  • Sonar Reasoning Pro
    Sonar
    perplexity
    Input
    $2.00
    Cached
    Output
    $8.00
  • DeepSeek V4 Pro (Together)
    DeepSeek V4
    together
    Input
    $2.10
    Cached
    $0.20
    Output
    $4.40
  • Gemini 2.5 Pro (>200k tokens)
    Gemini 2.5
    Google
    Input
    $2.50
    Cached
    $0.25
    Output
    $15.00
  • Command A
    Command A
    cohere
    Input
    $2.50
    Cached
    Output
    $10.00
  • Claude Sonnet 4.6
    Claude 4.6
    Anthropic
    Input
    $3.00
    Cached
    $0.30
    Output
    $15.00
  • Claude Opus 4.8
    Claude 4.8
    Anthropic
    Input
    $5.00
    Cached
    $0.50
    Output
    $25.00
  • Claude Fable 5
    Claude Fable 5
    Anthropic
    Input
    $10.00
    Cached
    $1.00
    Output
    $50.00
  • o3-pro
    o-series
    OpenAI
    Input
    $20.00
    Cached
    Output
    $80.00
  • GPT-5.2 Pro
    GPT-5
    OpenAI
    Input
    $21.00
    Cached
    Output
    $168.00
  • GPT-5.4 Pro
    GPT-5.4
    OpenAI
    Input
    $30.00
    Cached
    Output
    $180.00
  • GPT-5.5 Pro
    GPT-5.5
    OpenAI
    Input
    $30.00
    Cached
    Output
    $180.00
Showing 42 of 112 models · USD per 1M tokens
Last synced:

Frequently asked questions

Quick answers about API token pricing, freshness, and how to compare providers.

What does this AI API pricing comparison cover?

This page tracks pay-as-you-go API token pricing for 112 AI models across 11 providers — OpenAI, Anthropic, Google, DeepSeek, Mistral, xAI, Cohere, Groq, Together AI, Perplexity, and Meta Llama hosts. Each row shows input price, output price, cached-input price (where the provider offers prompt caching), and batch-discounted rates per million tokens. The data snapshot shown is from 2026-06-11.

How often is the pricing data updated?

Prices are verified and reconciled daily by an automated pipeline that pulls from each provider's official pricing page and cross-checks against at least two independent sources before publishing. New models and price changes typically appear here within hours of the provider's announcement. If a provider hasn't moved its prices, the snapshot date stays the same — the current dataset is from 2026-06-11.

Which AI API is the cheapest right now?

As of 2026-06-11, the absolute cheapest production model we track is LFM2 24B A2B (Together) at $0.03 per million input tokens. The cheapest flagship-class model — meaning a top-tier model from a major lab, not a small or distilled variant — is Mistral Large 3 at $0.50 per million input tokens. The most expensive model we track is GPT-5.4 Pro at $30.00 input / 180.00 output per 1M tokens.

How do I compare two specific AI providers like OpenAI and Anthropic?

Use the table search and filter controls above to narrow to a single provider, then sort by input or output price. For deeper provider-level pages with FAQs, methodology, and historical pricing, see /openai-pricing/, /anthropic-pricing/, /google-pricing/, /deepseek-pricing/, /mistral-ai-pricing/, /cohere-pricing/, or any other provider listed. For workload-level math, the token cost calculator at /calculators/token-cost/ runs your input/output volume against every model side-by-side.

Should I pay per token via API or subscribe to ChatGPT Plus or Claude Pro?

It depends on volume and access pattern. Monthly subscriptions like ChatGPT Plus, Claude Pro, and Gemini Advanced at roughly $20/month are usually cheaper than API access for chat-style usage under about 5 million input tokens per month. Production workloads, agents, and anything that needs API access almost always cost less pay-as-you-go on the API. The /subscription-vs-api/ tool computes the exact break-even point for your specific volume.

Can I download the AI API pricing data as JSON?

Yes. The full live dataset is published at /api/pricing.json and updated whenever this page is. The schema is { id, name, family, provider, pricing: { inputPerM, cachedInputPerM, outputPerM }, status } for all 112 tracked models. The AI Pricing Guru dataset is provided for informational use. You may use it for personal, editorial, research, educational, and internal business purposes with appropriate attribution to AI Pricing Guru. Commercial redistribution, resale, republishing at scale, inclusion in a competing public dataset/API, or use as the primary data source for a commercial pricing-comparison product requires prior written permission.