AI Token Cost Calculator

Enter your expected token usage and instantly see costs across all major AI providers. Last updated: 2026-06-02

How do I calculate AI API costs? Multiply your input tokens by the provider's input rate and your output tokens by the output rate, then divide by 1 million. For example, 500,000 input + 100,000 output on GPT-5.4 ($2.50/$15.00 per 1M) = $1.25 + $1.50 = $2.75 per request cycle. This calculator runs that math across all 103 tracked models so you can spot the cheapest option in one glance.

Input tokens

Output tokens · auto (1:3)

Cached input0%

Output ratioOutput auto-calculates from input — pick a workload or switch to Manual.

Tip: type 1M, 500k or 10,000. Cached-input tokens are billed at each model's discounted cached rate.

Show legacy

24 legacy models hidden —.

Provider	Model	Input cost ↕	Output cost ↕	Total ↑
cohere	Embed v3 English Embed	$0.10	$0.00	$0.10
cohere	Embed v3 Multilingual Embed	$0.10	$0.00	$0.10
groq	Llama 3.1 8b Instant groq	$0.05	$0.08	$0.13
cohere	Command R7B Command R	$0.0375	$0.15	$0.1875
Mistral	Ministral 3B Ministral	$0.10	$0.10	$0.20
together	GPT-OSS 20B (Together) GPT-OSS	$0.05	$0.20	$0.25
Mistral	Ministral 8B Ministral	$0.15	$0.15	$0.30
Mistral	Mistral NeMo Mistral Open	$0.15	$0.15	$0.30
Mistral	Pixtral 12B Mistral	$0.15	$0.15	$0.30
groq	GPT OSS Safeguard 20B GPT OSS	$0.075	$0.30	$0.375
groq	Openai/gpt Oss 20b groq	$0.075	$0.30	$0.375
Meta	Llama 4 Scout Llama 4	$0.08	$0.30	$0.38
Mistral	Devstral Small 2 Devstral	$0.10	$0.30	$0.40
Mistral	Ministral 14B Ministral	$0.20	$0.20	$0.40
Mistral	Mistral Small 4 Mistral Small	$0.10	$0.30	$0.40
DeepSeek	DeepSeek V4 Flash DeepSeek V4	$0.14	$0.28	$0.42
OpenAI	GPT-5 nano GPT-5	$0.05	$0.40	$0.45
groq	Llama 4 Scout 17B 16E Instruct Llama 4	$0.11	$0.34	$0.45
Google	Gemini 2.5 Flash-Lite Gemini 2.5	$0.10	$0.40	$0.50
Mistral	Mistral 7B Mistral Open	$0.25	$0.25	$0.50
cohere	Command R 08-2024 Command R	$0.15	$0.60	$0.75
OpenAI	GPT-4o mini GPT-4o	$0.15	$0.60	$0.75
together	GPT-OSS 120B (Together) GPT-OSS	$0.15	$0.60	$0.75
Meta	Llama 4 Maverick Llama 4	$0.15	$0.60	$0.75
groq	Openai/gpt Oss 120b groq	$0.15	$0.60	$0.75
groq	Qwen3 32B Qwen3	$0.29	$0.59	$0.88
Mistral	Codestral Mistral	$0.30	$0.90	$1.20
groq	Llama 3.3 70b Versatile groq	$0.59	$0.79	$1.38
Mistral	Mixtral 8x7B Mixtral	$0.70	$0.70	$1.40
OpenAI	GPT-5.4 nano GPT-5.4	$0.20	$1.25	$1.45
together	MiniMax M2.7 (Together) MiniMax M2	$0.30	$1.20	$1.50
Google	Gemini 3.1 Flash-Lite Gemini 3	$0.25	$1.50	$1.75
together	Llama 3.3 70B (Together) Llama 3.3	$0.88	$0.88	$1.76
OpenAI	GPT-4.1 mini GPT-4.1	$0.40	$1.60	$2.00
Mistral	Magistral Small Magistral	$0.50	$1.50	$2.00
Mistral	Mistral Large 3 Mistral	$0.50	$1.50	$2.00
cohere	Rerank v3 Rerank	$2.00	$0.00	$2.00
perplexity	Sonar Sonar	$1.00	$1.00	$2.00
OpenAI	GPT-5 mini GPT-5	$0.25	$2.00	$2.25
Mistral	Devstral Medium 2 Devstral	$0.40	$2.00	$2.40
Google	Gemini 2.5 Flash Gemini 2.5	$0.30	$2.50	$2.80
Google	Gemini 3 Flash Gemini 3	$0.50	$3.00	$3.50
xAI	Grok 4.3 Grok 4.3	$1.25	$2.50	$3.75
together	Qwen3.7-Max (Together) Qwen3.7	$1.25	$3.75	$5.00
DeepSeek	DeepSeek V4 Pro DeepSeek V4	$1.74	$3.48	$5.22
OpenAI	GPT-5.4 mini GPT-5.4	$0.75	$4.50	$5.25
together	Kimi K2.6 (Together) Kimi K2	$1.20	$4.50	$5.70
together	GLM-5.1 (Together) GLM-5	$1.40	$4.40	$5.80
Anthropic	Claude Haiku 4.5 Claude 4.5	$1.00	$5.00	$6.00
together	DeepSeek V4 Pro (Together) DeepSeek V4	$2.10	$4.40	$6.50
Mistral	Magistral Medium Magistral	$2.00	$5.00	$7.00
Mistral	Mixtral 8x22B Mistral	$2.00	$6.00	$8.00
Mistral	Pixtral Large Mistral	$2.00	$6.00	$8.00
Mistral	Mistral Medium 3.5 Mistral Medium	$1.50	$7.50	$9.00
OpenAI	GPT-4.1 GPT-4.1	$2.00	$8.00	$10.00
OpenAI	o3 o-series	$2.00	$8.00	$10.00
perplexity	Sonar Deep Research Sonar	$2.00	$8.00	$10.00
perplexity	Sonar Reasoning Pro Sonar	$2.00	$8.00	$10.00
Google	Gemini 3.5 Flash Gemini 3.5	$1.50	$9.00	$10.50
Google	Gemini 2.5 Pro Gemini 2.5	$1.25	$10.00	$11.25
OpenAI	GPT-5 GPT-5	$1.25	$10.00	$11.25
OpenAI	GPT-5.1 GPT-5	$1.25	$10.00	$11.25
cohere	Command A Command A	$2.50	$10.00	$12.50
cohere	Command R+ 08-2024 Command R	$2.50	$10.00	$12.50
OpenAI	GPT-4o GPT-4o	$2.50	$10.00	$12.50
Google	Gemini 3 Pro Gemini 3	$2.00	$12.00	$14.00
Google	Gemini 3.1 Pro Gemini 3	$2.00	$12.00	$14.00
OpenAI	GPT-5.2 GPT-5	$1.75	$14.00	$15.75
Google	Gemini 2.5 Pro (>200k tokens) Gemini 2.5	$2.50	$15.00	$17.50
OpenAI	GPT-5.4 GPT-5.4	$2.50	$15.00	$17.50
Anthropic	Claude Sonnet 4.6 Claude 4.6	$3.00	$15.00	$18.00
perplexity	Sonar Pro Sonar	$3.00	$15.00	$18.00
Anthropic	Claude Opus 4.8 Claude 4.8	$5.00	$25.00	$30.00
OpenAI	GPT-5.5 GPT-5.5	$5.00	$30.00	$35.00
OpenAI	o3-pro o-series	$20.00	$80.00	$100.00
OpenAI	GPT-5 Pro GPT-5	$15.00	$120.00	$135.00
OpenAI	GPT-5.2 Pro GPT-5	$21.00	$168.00	$189.00
OpenAI	GPT-5.4 Pro GPT-5.4	$30.00	$180.00	$210.00
OpenAI	GPT-5.5 Pro GPT-5.5	$30.00	$180.00	$210.00

Embed v3 English
cohere
Total
$0.10
Input$0.10
Output$0.00
Embed v3 Multilingual
cohere
Total
$0.10
Input$0.10
Output$0.00
Llama 3.1 8b Instant
groq
Total
$0.13
Input$0.05
Output$0.08
Command R7B
cohere
Total
$0.1875
Input$0.0375
Output$0.15
Ministral 3B
Mistral
Total
$0.20
Input$0.10
Output$0.10
GPT-OSS 20B (Together)
together
Total
$0.25
Input$0.05
Output$0.20
Ministral 8B
Mistral
Total
$0.30
Input$0.15
Output$0.15
Mistral NeMo
Mistral
Total
$0.30
Input$0.15
Output$0.15
Pixtral 12B
Mistral
Total
$0.30
Input$0.15
Output$0.15
GPT OSS Safeguard 20B
groq
Total
$0.375
Input$0.075
Output$0.30
Openai/gpt Oss 20b
groq
Total
$0.375
Input$0.075
Output$0.30
Llama 4 Scout
Meta
Total
$0.38
Input$0.08
Output$0.30
Devstral Small 2
Mistral
Total
$0.40
Input$0.10
Output$0.30
Ministral 14B
Mistral
Total
$0.40
Input$0.20
Output$0.20
Mistral Small 4
Mistral
Total
$0.40
Input$0.10
Output$0.30
DeepSeek V4 Flash
DeepSeek
Total
$0.42
Input$0.14
Output$0.28
GPT-5 nano
OpenAI
Total
$0.45
Input$0.05
Output$0.40
Llama 4 Scout 17B 16E Instruct
groq
Total
$0.45
Input$0.11
Output$0.34
Gemini 2.5 Flash-Lite
Google
Total
$0.50
Input$0.10
Output$0.40
Mistral 7B
Mistral
Total
$0.50
Input$0.25
Output$0.25
Command R 08-2024
cohere
Total
$0.75
Input$0.15
Output$0.60
GPT-4o mini
OpenAI
Total
$0.75
Input$0.15
Output$0.60
GPT-OSS 120B (Together)
together
Total
$0.75
Input$0.15
Output$0.60
Llama 4 Maverick
Meta
Total
$0.75
Input$0.15
Output$0.60
Openai/gpt Oss 120b
groq
Total
$0.75
Input$0.15
Output$0.60
Qwen3 32B
groq
Total
$0.88
Input$0.29
Output$0.59
Codestral
Mistral
Total
$1.20
Input$0.30
Output$0.90
Llama 3.3 70b Versatile
groq
Total
$1.38
Input$0.59
Output$0.79
Mixtral 8x7B
Mistral
Total
$1.40
Input$0.70
Output$0.70
GPT-5.4 nano
OpenAI
Total
$1.45
Input$0.20
Output$1.25
MiniMax M2.7 (Together)
together
Total
$1.50
Input$0.30
Output$1.20
Gemini 3.1 Flash-Lite
Google
Total
$1.75
Input$0.25
Output$1.50
Llama 3.3 70B (Together)
together
Total
$1.76
Input$0.88
Output$0.88
GPT-4.1 mini
OpenAI
Total
$2.00
Input$0.40
Output$1.60
Magistral Small
Mistral
Total
$2.00
Input$0.50
Output$1.50
Mistral Large 3
Mistral
Total
$2.00
Input$0.50
Output$1.50
Rerank v3
cohere
Total
$2.00
Input$2.00
Output$0.00
Sonar
perplexity
Total
$2.00
Input$1.00
Output$1.00
GPT-5 mini
OpenAI
Total
$2.25
Input$0.25
Output$2.00
Devstral Medium 2
Mistral
Total
$2.40
Input$0.40
Output$2.00
Gemini 2.5 Flash
Google
Total
$2.80
Input$0.30
Output$2.50
Gemini 3 Flash
Google
Total
$3.50
Input$0.50
Output$3.00
Grok 4.3
xAI
Total
$3.75
Input$1.25
Output$2.50
Qwen3.7-Max (Together)
together
Total
$5.00
Input$1.25
Output$3.75
DeepSeek V4 Pro
DeepSeek
Total
$5.22
Input$1.74
Output$3.48
GPT-5.4 mini
OpenAI
Total
$5.25
Input$0.75
Output$4.50
Kimi K2.6 (Together)
together
Total
$5.70
Input$1.20
Output$4.50
GLM-5.1 (Together)
together
Total
$5.80
Input$1.40
Output$4.40
Claude Haiku 4.5
Anthropic
Total
$6.00
Input$1.00
Output$5.00
DeepSeek V4 Pro (Together)
together
Total
$6.50
Input$2.10
Output$4.40
Magistral Medium
Mistral
Total
$7.00
Input$2.00
Output$5.00
Mixtral 8x22B
Mistral
Total
$8.00
Input$2.00
Output$6.00
Pixtral Large
Mistral
Total
$8.00
Input$2.00
Output$6.00
Mistral Medium 3.5
Mistral
Total
$9.00
Input$1.50
Output$7.50
GPT-4.1
OpenAI
Total
$10.00
Input$2.00
Output$8.00
o3
OpenAI
Total
$10.00
Input$2.00
Output$8.00
Sonar Deep Research
perplexity
Total
$10.00
Input$2.00
Output$8.00
Sonar Reasoning Pro
perplexity
Total
$10.00
Input$2.00
Output$8.00
Gemini 3.5 Flash
Google
Total
$10.50
Input$1.50
Output$9.00
Gemini 2.5 Pro
Google
Total
$11.25
Input$1.25
Output$10.00
GPT-5
OpenAI
Total
$11.25
Input$1.25
Output$10.00
GPT-5.1
OpenAI
Total
$11.25
Input$1.25
Output$10.00
Command A
cohere
Total
$12.50
Input$2.50
Output$10.00
Command R+ 08-2024
cohere
Total
$12.50
Input$2.50
Output$10.00
GPT-4o
OpenAI
Total
$12.50
Input$2.50
Output$10.00
Gemini 3 Pro
Google
Total
$14.00
Input$2.00
Output$12.00
Gemini 3.1 Pro
Google
Total
$14.00
Input$2.00
Output$12.00
GPT-5.2
OpenAI
Total
$15.75
Input$1.75
Output$14.00
Gemini 2.5 Pro (>200k tokens)
Google
Total
$17.50
Input$2.50
Output$15.00
GPT-5.4
OpenAI
Total
$17.50
Input$2.50
Output$15.00
Claude Sonnet 4.6
Anthropic
Total
$18.00
Input$3.00
Output$15.00
Sonar Pro
perplexity
Total
$18.00
Input$3.00
Output$15.00
Claude Opus 4.8
Anthropic
Total
$30.00
Input$5.00
Output$25.00
GPT-5.5
OpenAI
Total
$35.00
Input$5.00
Output$30.00
o3-pro
OpenAI
Total
$100.00
Input$20.00
Output$80.00
GPT-5 Pro
OpenAI
Total
$135.00
Input$15.00
Output$120.00
GPT-5.2 Pro
OpenAI
Total
$189.00
Input$21.00
Output$168.00
GPT-5.4 Pro
OpenAI
Total
$210.00
Input$30.00
Output$180.00
GPT-5.5 Pro
OpenAI
Total
$210.00
Input$30.00
Output$180.00

79 models · 1,000,000 in · 1,000,000 out

How do I use the AI token cost calculator?

Enter expected input tokens — roughly 0.75 words or 4 characters per token. A 2,000-word prompt is ~2,700 tokens.
Enter expected output tokens — model responses are usually 200–2,000 tokens unless you explicitly set max_tokens.
Set monthly request volume — multiplies the single-request cost to estimate monthly spend.
Compare rows — the table sorts cheapest-first. Cached-input rates drop many providers by 75–90%.
Click a model to jump to its provider page for context, FAQ, and rate-limit details.

Methodology

All prices come from the official API pricing pages of each provider, checked daily. The formula for a single request is:

cost = (input_tokens / 1,000,000 * input_rate) + (output_tokens / 1,000,000 * output_rate)

When the cached-input slider is above 0%, the input portion splits into cached and non-cached fractions, each multiplied by the respective rate. Models without a published cached rate use the standard input rate for both.

AI Token Cost Calculator

Count tokens from text

How do I use the AI token cost calculator?

Methodology