AI API Token Pricing Comparison

Pay-as-you-go API prices in USD per 1 million tokens. For monthly ChatGPT, Claude, Gemini, and Copilot plans, use subscription pricing. Data last updated: . Providers can change prices without notice, so verify directly before purchase decisions.

Which AI API is cheapest right now? We track 89 models across 11 providers. The cheapest flagship is Mistral Large 3 at $0.50 per 1M input tokens; the absolute cheapest production model is Command R7B at $0.04 per 1M input. The most expensive we track is GPT-5.4 Pro at $30.00 input / 180.00 output. Download the raw data as JSON.

Showing 34 current models. .
  • Command R7B
    Command R
    cohere
    Input
    $0.0375
    Cached
    Output
    $0.15
  • Llama 4 Scout
    Llama 4
    Meta
    Input
    $0.08
    Cached
    Output
    $0.30
  • Embed v3 Multilingual
    Embed
    cohere
    Input
    $0.10
    Cached
    Output
    $0.00
  • GPT-4.1 nano
    GPT-4.1
    OpenAI
    Input
    $0.10
    Cached
    $0.025
    Output
    $0.40
  • Devstral Small 2
    Devstral
    Mistral
    Input
    $0.10
    Cached
    Output
    $0.30
  • Openai/gpt Oss 120b
    groq
    Groq
    Input
    $0.15
    Cached
    $0.075
    Output
    $0.60
  • Mistral Small 4
    Mistral Small
    Mistral
    Input
    $0.15
    Cached
    Output
    $0.60
  • GPT-4o mini
    GPT-4o
    OpenAI
    Input
    $0.15
    Cached
    $0.075
    Output
    $0.60
  • Mistral NeMo
    Mistral Open
    Mistral
    Input
    $0.15
    Cached
    Output
    $0.15
  • GPT-OSS 120B (Together)
    GPT-OSS
    together
    Input
    $0.15
    Cached
    Output
    $0.60
  • Ministral 14B
    Ministral
    Mistral
    Input
    $0.20
    Cached
    Output
    $0.20
  • MiniMax M2.7 (Together)
    MiniMax M2
    together
    Input
    $0.30
    Cached
    $0.06
    Output
    $1.20
  • Magistral Small
    Magistral
    Mistral
    Input
    $0.50
    Cached
    Output
    $1.50
  • Qwen3.6-Plus (Together)
    Qwen3.6
    together
    Input
    $0.50
    Cached
    Output
    $3.00
  • DeepSeek V3.1 (Together)
    DeepSeek
    together
    Input
    $0.60
    Cached
    Output
    $1.70
  • Mixtral 8x7B
    Mixtral
    Mistral
    Input
    $0.70
    Cached
    Output
    $0.70
  • Llama 3.3 70B (Together)
    Llama 3.3
    together
    Input
    $0.88
    Cached
    Output
    $0.88
  • Claude Haiku 4.5
    Claude 4.5
    Anthropic
    Input
    $1.00
    Cached
    $0.10
    Output
    $5.00
  • o4-mini
    o-series
    OpenAI
    Input
    $1.10
    Cached
    $0.275
    Output
    $4.40
  • Kimi K2.6 (Together)
    Kimi K2
    together
    Input
    $1.20
    Cached
    $0.20
    Output
    $4.50
  • Grok 4.3
    Grok 4.3
    xAI
    Input
    $1.25
    Cached
    Output
    $2.50
  • GLM-5.1 (Together)
    GLM-5
    together
    Input
    $1.40
    Cached
    Output
    $4.40
  • Mistral Medium 3.5
    Mistral Medium
    Mistral
    Input
    $1.50
    Cached
    Output
    $7.50
  • DeepSeek V4 Pro
    DeepSeek V4
    DeepSeek
    Input
    $1.74
    Cached
    $0.0145
    Output
    $3.48
  • Rerank v3
    Rerank
    cohere
    Input
    $2.00
    Cached
    Output
    $0.00
  • Pixtral Large
    Mistral
    Mistral
    Input
    $2.00
    Cached
    Output
    $6.00
  • Sonar Reasoning Pro
    Sonar
    perplexity
    Input
    $2.00
    Cached
    Output
    $8.00
  • DeepSeek V4 Pro (Together)
    DeepSeek V4
    together
    Input
    $2.10
    Cached
    $0.20
    Output
    $4.40
  • Gemini 2.5 Pro (>200k tokens)
    Gemini 2.5
    Google
    Input
    $2.50
    Cached
    $0.25
    Output
    $15.00
  • Command A
    Command A
    cohere
    Input
    $2.50
    Cached
    Output
    $10.00
  • Claude Sonnet 4.6
    Claude 4.6
    Anthropic
    Input
    $3.00
    Cached
    $0.30
    Output
    $15.00
  • Claude Opus 4.7
    Claude 4.7
    Anthropic
    Input
    $5.00
    Cached
    $0.50
    Output
    $25.00
  • GPT-5.4 Pro
    GPT-5.4
    OpenAI
    Input
    $30.00
    Cached
    Output
    $180.00
  • GPT-5.5 Pro
    GPT-5.5
    OpenAI
    Input
    $30.00
    Cached
    Output
    $180.00
Showing 34 of 89 models · USD per 1M tokens
Last synced:

Frequently asked questions

Quick answers about API token pricing, freshness, and how to compare providers.

What does this AI API pricing comparison cover?

This page tracks pay-as-you-go API token pricing for 89 AI models across 11 providers — OpenAI, Anthropic, Google, DeepSeek, Mistral, xAI, Cohere, Groq, Together AI, Perplexity, and Meta Llama hosts. Each row shows input price, output price, cached-input price (where the provider offers prompt caching), and batch-discounted rates per million tokens. The data snapshot shown is from 2026-05-15.

How often is the pricing data updated?

Prices are verified and reconciled daily by an automated pipeline that pulls from each provider's official pricing page and cross-checks against at least two independent sources before publishing. New models and price changes typically appear here within hours of the provider's announcement. If a provider hasn't moved its prices, the snapshot date stays the same — the current dataset is from 2026-05-15.

Which AI API is the cheapest right now?

As of 2026-05-15, the absolute cheapest production model we track is Command R7B at $0.04 per million input tokens. The cheapest flagship-class model — meaning a top-tier model from a major lab, not a small or distilled variant — is Mistral Large 3 at $0.50 per million input tokens. The most expensive model we track is GPT-5.4 Pro at $30.00 input / 180.00 output per 1M tokens.

How do I compare two specific AI providers like OpenAI and Anthropic?

Use the table search and filter controls above to narrow to a single provider, then sort by input or output price. For deeper provider-level pages with FAQs, methodology, and historical pricing, see /openai-pricing/, /anthropic-pricing/, /google-pricing/, /deepseek-pricing/, /mistral-ai-pricing/, /cohere-pricing/, or any other provider listed. For workload-level math, the token cost calculator at /calculators/token-cost/ runs your input/output volume against every model side-by-side.

Should I pay per token via API or subscribe to ChatGPT Plus or Claude Pro?

It depends on volume and access pattern. Monthly subscriptions like ChatGPT Plus, Claude Pro, and Gemini Advanced at roughly $20/month are usually cheaper than API access for chat-style usage under about 5 million input tokens per month. Production workloads, agents, and anything that needs API access almost always cost less pay-as-you-go on the API. The /calculators/subscription-vs-api/ tool computes the exact break-even point for your specific volume.

Can I download the AI API pricing data as JSON?

Yes. The full live dataset is published at /api/pricing.json under a CC BY 4.0 license, updated whenever this page is. The schema is { id, name, family, provider, pricing: { inputPerM, cachedInputPerM, outputPerM }, status } for all 89 tracked models. Use it in apps, dashboards, monitoring, or research without scraping HTML.