AI Token Cost Calculator
Enter your expected token usage and instantly see costs across all major AI providers. Last updated: . Provider prices can change without notice, so verify directly before committing spend.
How do I calculate AI API costs? Multiply your input tokens by the provider's input rate and your output tokens by the output rate, then divide by 1 million. For example, 500,000 input + 100,000 output on GPT-5.4 ($2.50/$15.00 per 1M) = $1.25 + $1.50 = $2.75 per request cycle. This calculator runs that math across all 89 tracked models so you can spot the cheapest option in one glance.
0%
Output auto-calculates from input — pick a workload or switch to Manual.
Tip: type 1M, 500k or 10,000. Cached-input tokens are billed at each model's discounted cached rate.
16 legacy models hidden —.
| Provider | Model | Input cost ↕ | Output cost ↕ | Total ↑ | |
|---|---|---|---|---|---|
| Mistral | Ministral 3B Ministral | $0.04 | $0.04 | $0.08 | |
| cohere | Embed v3 English Embed | $0.10 | $0.00 | $0.10 | |
| cohere | Embed v3 Multilingual Embed | $0.10 | $0.00 | $0.10 | |
| groq | Llama 3.1 8b Instant groq | $0.05 | $0.08 | $0.13 | |
| cohere | Command R7B Command R | $0.0375 | $0.15 | $0.1875 | |
| Mistral | Ministral 8B Ministral | $0.10 | $0.10 | $0.20 | |
| together | GPT-OSS 20B (Together) GPT-OSS | $0.05 | $0.20 | $0.25 | |
| Mistral | Mistral NeMo Mistral Open | $0.15 | $0.15 | $0.30 | |
| Mistral | Pixtral 12B Mistral | $0.15 | $0.15 | $0.30 | |
| groq | GPT OSS Safeguard 20B GPT OSS | $0.075 | $0.30 | $0.375 | |
| groq | Openai/gpt Oss 20b groq | $0.075 | $0.30 | $0.375 | |
| Meta | Llama 4 Scout Llama 4 | $0.08 | $0.30 | $0.38 | |
| Mistral | Devstral Small 2 Devstral | $0.10 | $0.30 | $0.40 | |
| Mistral | Ministral 14B Ministral | $0.20 | $0.20 | $0.40 | |
| DeepSeek | DeepSeek V4 Flash DeepSeek V4 | $0.14 | $0.28 | $0.42 | |
| groq | Llama 4 Scout 17B 16E Instruct Llama 4 | $0.11 | $0.34 | $0.45 | |
| OpenAI | GPT-4.1 nano GPT-4.1 | $0.10 | $0.40 | $0.50 | |
| Mistral | Mistral 7B Mistral Open | $0.25 | $0.25 | $0.50 | |
| cohere | Command R 08-2024 Command R | $0.15 | $0.60 | $0.75 | |
| OpenAI | GPT-4o mini GPT-4o | $0.15 | $0.60 | $0.75 | |
| together | GPT-OSS 120B (Together) GPT-OSS | $0.15 | $0.60 | $0.75 | |
| Mistral | Mistral Small 4 Mistral Small | $0.15 | $0.60 | $0.75 | |
| groq | Openai/gpt Oss 120b groq | $0.15 | $0.60 | $0.75 | |
| Meta | Llama 4 Maverick Llama 4 | $0.20 | $0.60 | $0.80 | |
| groq | Qwen3 32B Qwen3 | $0.29 | $0.59 | $0.88 | |
| Mistral | Codestral Mistral | $0.30 | $0.90 | $1.20 | |
| groq | Llama 3.3 70b Versatile groq | $0.59 | $0.79 | $1.38 | |
| Mistral | Mixtral 8x7B Mixtral | $0.70 | $0.70 | $1.40 | |
| OpenAI | GPT-5.4 nano GPT-5.4 | $0.20 | $1.25 | $1.45 | |
| together | MiniMax M2.7 (Together) MiniMax M2 | $0.30 | $1.20 | $1.50 | |
Gemini 3.1 Flash-Lite Gemini 3 | $0.25 | $1.50 | $1.75 | ||
| together | Llama 3.3 70B (Together) Llama 3.3 | $0.88 | $0.88 | $1.76 | |
| OpenAI | GPT-4.1 mini GPT-4.1 | $0.40 | $1.60 | $2.00 | |
| Mistral | Magistral Small Magistral | $0.50 | $1.50 | $2.00 | |
| Mistral | Mistral Large 3 Mistral | $0.50 | $1.50 | $2.00 | |
| cohere | Rerank v3 Rerank | $2.00 | $0.00 | $2.00 | |
| perplexity | Sonar Sonar | $1.00 | $1.00 | $2.00 | |
| together | DeepSeek V3.1 (Together) DeepSeek | $0.60 | $1.70 | $2.30 | |
| Mistral | Devstral Medium 2 Devstral | $0.40 | $2.00 | $2.40 | |
Gemini 2.5 Flash Gemini 2.5 | $0.30 | $2.50 | $2.80 | ||
Gemini 3 Flash Gemini 3 | $0.50 | $3.00 | $3.50 | ||
| together | Qwen3.6-Plus (Together) Qwen3.6 | $0.50 | $3.00 | $3.50 | |
| xAI | Grok 4.3 Grok 4.3 | $1.25 | $2.50 | $3.75 | |
| DeepSeek | DeepSeek V4 Pro DeepSeek V4 | $1.74 | $3.48 | $5.22 | |
| OpenAI | GPT-5.4 mini GPT-5.4 | $0.75 | $4.50 | $5.25 | |
| OpenAI | o4-mini o-series | $1.10 | $4.40 | $5.50 | |
| together | Kimi K2.6 (Together) Kimi K2 | $1.20 | $4.50 | $5.70 | |
| together | GLM-5.1 (Together) GLM-5 | $1.40 | $4.40 | $5.80 | |
| Anthropic | Claude Haiku 4.5 Claude 4.5 | $1.00 | $5.00 | $6.00 | |
| together | DeepSeek V4 Pro (Together) DeepSeek V4 | $2.10 | $4.40 | $6.50 | |
| Mistral | Magistral Medium Magistral | $2.00 | $5.00 | $7.00 | |
| Mistral | Mixtral 8x22B Mistral | $2.00 | $6.00 | $8.00 | |
| Mistral | Pixtral Large Mistral | $2.00 | $6.00 | $8.00 | |
| Mistral | Mistral Medium 3.5 Mistral Medium | $1.50 | $7.50 | $9.00 | |
| together | DeepSeek R1 (Together) DeepSeek | $3.00 | $7.00 | $10.00 | |
| OpenAI | GPT-4.1 GPT-4.1 | $2.00 | $8.00 | $10.00 | |
| OpenAI | o3 o-series | $2.00 | $8.00 | $10.00 | |
| perplexity | Sonar Deep Research Sonar | $2.00 | $8.00 | $10.00 | |
| perplexity | Sonar Reasoning Pro Sonar | $2.00 | $8.00 | $10.00 | |
Gemini 2.5 Pro Gemini 2.5 | $1.25 | $10.00 | $11.25 | ||
| cohere | Command A Command A | $2.50 | $10.00 | $12.50 | |
| cohere | Command R+ 08-2024 Command R | $2.50 | $10.00 | $12.50 | |
| OpenAI | GPT-4o GPT-4o | $2.50 | $10.00 | $12.50 | |
Gemini 3 Pro Gemini 3 | $2.00 | $12.00 | $14.00 | ||
Gemini 3.1 Pro Gemini 3 | $2.00 | $12.00 | $14.00 | ||
Gemini 2.5 Pro (>200k tokens) Gemini 2.5 | $2.50 | $15.00 | $17.50 | ||
| OpenAI | GPT-5.4 GPT-5.4 | $2.50 | $15.00 | $17.50 | |
| Anthropic | Claude Sonnet 4.6 Claude 4.6 | $3.00 | $15.00 | $18.00 | |
| perplexity | Sonar Pro Sonar | $3.00 | $15.00 | $18.00 | |
| Anthropic | Claude Opus 4.7 Claude 4.7 | $5.00 | $25.00 | $30.00 | |
| OpenAI | GPT-5.5 GPT-5.5 | $5.00 | $30.00 | $35.00 | |
| OpenAI | GPT-5.4 Pro GPT-5.4 | $30.00 | $180.00 | $210.00 | |
| OpenAI | GPT-5.5 Pro GPT-5.5 | $30.00 | $180.00 | $210.00 |
- Ministral 3BMistralTotal$0.08Input$0.04Output$0.04
- Embed v3 EnglishcohereTotal$0.10Input$0.10Output$0.00
- Embed v3 MultilingualcohereTotal$0.10Input$0.10Output$0.00
- Llama 3.1 8b InstantgroqTotal$0.13Input$0.05Output$0.08
- Command R7BcohereTotal$0.1875Input$0.0375Output$0.15
- Ministral 8BMistralTotal$0.20Input$0.10Output$0.10
- GPT-OSS 20B (Together)togetherTotal$0.25Input$0.05Output$0.20
- Mistral NeMoMistralTotal$0.30Input$0.15Output$0.15
- Pixtral 12BMistralTotal$0.30Input$0.15Output$0.15
- GPT OSS Safeguard 20BgroqTotal$0.375Input$0.075Output$0.30
- Openai/gpt Oss 20bgroqTotal$0.375Input$0.075Output$0.30
- Llama 4 ScoutMetaTotal$0.38Input$0.08Output$0.30
- Devstral Small 2MistralTotal$0.40Input$0.10Output$0.30
- Ministral 14BMistralTotal$0.40Input$0.20Output$0.20
- DeepSeek V4 FlashDeepSeekTotal$0.42Input$0.14Output$0.28
- Llama 4 Scout 17B 16E InstructgroqTotal$0.45Input$0.11Output$0.34
- GPT-4.1 nanoOpenAITotal$0.50Input$0.10Output$0.40
- Mistral 7BMistralTotal$0.50Input$0.25Output$0.25
- Command R 08-2024cohereTotal$0.75Input$0.15Output$0.60
- GPT-4o miniOpenAITotal$0.75Input$0.15Output$0.60
- GPT-OSS 120B (Together)togetherTotal$0.75Input$0.15Output$0.60
- Mistral Small 4MistralTotal$0.75Input$0.15Output$0.60
- Openai/gpt Oss 120bgroqTotal$0.75Input$0.15Output$0.60
- Llama 4 MaverickMetaTotal$0.80Input$0.20Output$0.60
- Qwen3 32BgroqTotal$0.88Input$0.29Output$0.59
- CodestralMistralTotal$1.20Input$0.30Output$0.90
- Llama 3.3 70b VersatilegroqTotal$1.38Input$0.59Output$0.79
- Mixtral 8x7BMistralTotal$1.40Input$0.70Output$0.70
- GPT-5.4 nanoOpenAITotal$1.45Input$0.20Output$1.25
- MiniMax M2.7 (Together)togetherTotal$1.50Input$0.30Output$1.20
- Gemini 3.1 Flash-LiteGoogleTotal$1.75Input$0.25Output$1.50
- Llama 3.3 70B (Together)togetherTotal$1.76Input$0.88Output$0.88
- GPT-4.1 miniOpenAITotal$2.00Input$0.40Output$1.60
- Magistral SmallMistralTotal$2.00Input$0.50Output$1.50
- Mistral Large 3MistralTotal$2.00Input$0.50Output$1.50
- Rerank v3cohereTotal$2.00Input$2.00Output$0.00
- SonarperplexityTotal$2.00Input$1.00Output$1.00
- DeepSeek V3.1 (Together)togetherTotal$2.30Input$0.60Output$1.70
- Devstral Medium 2MistralTotal$2.40Input$0.40Output$2.00
- Gemini 2.5 FlashGoogleTotal$2.80Input$0.30Output$2.50
- Gemini 3 FlashGoogleTotal$3.50Input$0.50Output$3.00
- Qwen3.6-Plus (Together)togetherTotal$3.50Input$0.50Output$3.00
- Grok 4.3xAITotal$3.75Input$1.25Output$2.50
- DeepSeek V4 ProDeepSeekTotal$5.22Input$1.74Output$3.48
- GPT-5.4 miniOpenAITotal$5.25Input$0.75Output$4.50
- o4-miniOpenAITotal$5.50Input$1.10Output$4.40
- Kimi K2.6 (Together)togetherTotal$5.70Input$1.20Output$4.50
- GLM-5.1 (Together)togetherTotal$5.80Input$1.40Output$4.40
- Claude Haiku 4.5AnthropicTotal$6.00Input$1.00Output$5.00
- DeepSeek V4 Pro (Together)togetherTotal$6.50Input$2.10Output$4.40
- Magistral MediumMistralTotal$7.00Input$2.00Output$5.00
- Mixtral 8x22BMistralTotal$8.00Input$2.00Output$6.00
- Pixtral LargeMistralTotal$8.00Input$2.00Output$6.00
- Mistral Medium 3.5MistralTotal$9.00Input$1.50Output$7.50
- DeepSeek R1 (Together)togetherTotal$10.00Input$3.00Output$7.00
- GPT-4.1OpenAITotal$10.00Input$2.00Output$8.00
- o3OpenAITotal$10.00Input$2.00Output$8.00
- Sonar Deep ResearchperplexityTotal$10.00Input$2.00Output$8.00
- Sonar Reasoning ProperplexityTotal$10.00Input$2.00Output$8.00
- Gemini 2.5 ProGoogleTotal$11.25Input$1.25Output$10.00
- Command AcohereTotal$12.50Input$2.50Output$10.00
- Command R+ 08-2024cohereTotal$12.50Input$2.50Output$10.00
- GPT-4oOpenAITotal$12.50Input$2.50Output$10.00
- Gemini 3 ProGoogleTotal$14.00Input$2.00Output$12.00
- Gemini 3.1 ProGoogleTotal$14.00Input$2.00Output$12.00
- Gemini 2.5 Pro (>200k tokens)GoogleTotal$17.50Input$2.50Output$15.00
- GPT-5.4OpenAITotal$17.50Input$2.50Output$15.00
- Claude Sonnet 4.6AnthropicTotal$18.00Input$3.00Output$15.00
- Sonar ProperplexityTotal$18.00Input$3.00Output$15.00
- Claude Opus 4.7AnthropicTotal$30.00Input$5.00Output$25.00
- GPT-5.5OpenAITotal$35.00Input$5.00Output$30.00
- GPT-5.4 ProOpenAITotal$210.00Input$30.00Output$180.00
- GPT-5.5 ProOpenAITotal$210.00Input$30.00Output$180.00
73 models · 1,000,000 in · 1,000,000 out
How do I use the AI token cost calculator?
- Enter expected input tokens — roughly 0.75 words or 4 characters per token. A 2,000-word prompt is ~2,700 tokens.
- Enter expected output tokens — model responses are usually 200–2,000 tokens unless you explicitly set
max_tokens. - Set monthly request volume — multiplies the single-request cost to estimate monthly spend.
- Compare rows — the table sorts cheapest-first. Cached-input rates drop many providers by 75–90%.
- Click a model to jump to its provider page for context, FAQ, and rate-limit details.