How Many AI Tokens Can I Get for My Budget?
Enter a monthly dollar amount and instantly see how many tokens each AI model gives you. Last updated:
How far does my AI budget go? Different models have wildly different per-token rates. A $50 monthly budget buys you about 1,333 million input tokens on Command R7B but only about 2 million on GPT-5.4 Pro. This calculator divides your budget by each model's input and output rates and ranks every model by total token allowance, so you can find the best bang for your buck.
Your Budget
Token Allowance per Model — $50.00/mo Budget
Sorted by total tokens (most to least). Split: 50% input / 50% output.
| # | Model | Provider | Input Tokens | Output Tokens | Total Tokens |
|---|---|---|---|---|---|
| 1 | Embed v3 English | cohere | 250.0M | InfinityB | InfinityB |
| 2 | Embed v3 Multilingual | cohere | 250.0M | InfinityB | InfinityB |
| 3 | Rerank v3 | cohere | 12.5M | InfinityB | InfinityB |
| 4 | Command R7B | cohere | 666.7M | 166.7M | 833.3M |
| 5 | Llama 3.1 8b Instant | groq | 500.0M | 312.5M | 812.5M |
| 6 | GPT-OSS 20B (Together) | together | 500.0M | 125.0M | 625.0M |
| 7 | GPT-5 nano | openai | 500.0M | 62.5M | 562.5M |
| 8 | Ministral 3B | mistral | 250.0M | 250.0M | 500.0M |
| 9 | Gemini 2.0 Flash-Lite | 333.3M | 83.3M | 416.7M | |
| 10 | GPT OSS Safeguard 20B | groq | 333.3M | 83.3M | 416.7M |
| 11 | Openai/gpt Oss 20b | groq | 333.3M | 83.3M | 416.7M |
| 12 | Llama 4 Scout | meta | 312.5M | 83.3M | 395.8M |
| 13 | Devstral Small 2 | mistral | 250.0M | 83.3M | 333.3M |
| 14 | Ministral 8B | mistral | 166.7M | 166.7M | 333.3M |
| 15 | Mistral NeMo | mistral | 166.7M | 166.7M | 333.3M |
| 16 | Mistral Small 4 | mistral | 250.0M | 83.3M | 333.3M |
| 17 | Pixtral 12B | mistral | 166.7M | 166.7M | 333.3M |
| 18 | Gemini 2.0 Flash | 250.0M | 62.5M | 312.5M | |
| 19 | Gemini 2.5 Flash-Lite | 250.0M | 62.5M | 312.5M | |
| 20 | GPT-4.1 nano | openai | 250.0M | 62.5M | 312.5M |
| 21 | Llama 4 Scout 17B 16E Instruct | groq | 227.3M | 73.5M | 300.8M |
| 22 | DeepSeek V4 Flash | deepseek | 178.6M | 89.3M | 267.9M |
| 23 | Ministral 14B | mistral | 125.0M | 125.0M | 250.0M |
| 24 | Command R 08-2024 | cohere | 166.7M | 41.7M | 208.3M |
| 25 | GPT-4o mini | openai | 166.7M | 41.7M | 208.3M |
| 26 | GPT-OSS 120B (Together) | together | 166.7M | 41.7M | 208.3M |
| 27 | Llama 4 Maverick | meta | 166.7M | 41.7M | 208.3M |
| 28 | Openai/gpt Oss 120b | groq | 166.7M | 41.7M | 208.3M |
| 29 | Mistral 7B | mistral | 100.0M | 100.0M | 200.0M |
| 30 | Grok 4 1 Fast Non Reasoning | xai | 125.0M | 50.0M | 175.0M |
| 31 | Grok 4 1 Fast Reasoning | xai | 125.0M | 50.0M | 175.0M |
| 32 | Grok 4.1 Fast | xai | 125.0M | 50.0M | 175.0M |
| 33 | GPT-5.4 nano | openai | 125.0M | 20.0M | 145.0M |
| 34 | Qwen3 32B | groq | 86.2M | 42.4M | 128.6M |
| 35 | Gemini 3.1 Flash-Lite | 100.0M | 16.7M | 116.7M | |
| 36 | GPT-5 mini | openai | 100.0M | 12.5M | 112.5M |
| 37 | Codestral | mistral | 83.3M | 27.8M | 111.1M |
| 38 | MiniMax M2.7 (Together) | together | 83.3M | 20.8M | 104.2M |
| 39 | Gemini 2.5 Flash | 83.3M | 10.0M | 93.3M | |
| 40 | GPT-4.1 mini | openai | 62.5M | 15.6M | 78.1M |
| 41 | Devstral Medium 2 | mistral | 62.5M | 12.5M | 75.0M |
| 42 | Mistral Medium 3 | mistral | 62.5M | 12.5M | 75.0M |
| 43 | Llama 3.3 70b Versatile | groq | 42.4M | 31.6M | 74.0M |
| 44 | Mixtral 8x7B | mistral | 35.7M | 35.7M | 71.4M |
| 45 | Magistral Small | mistral | 50.0M | 16.7M | 66.7M |
| 46 | Mistral Large 3 | mistral | 50.0M | 16.7M | 66.7M |
| 47 | Gemini 3 Flash | 50.0M | 8.3M | 58.3M | |
| 48 | Qwen3.6-Plus (Together) | together | 50.0M | 8.3M | 58.3M |
| 49 | Llama 3.3 70B (Together) | together | 28.4M | 28.4M | 56.8M |
| 50 | DeepSeek V3.1 (Together) | together | 41.7M | 14.7M | 56.4M |
| 51 | Sonar | perplexity | 25.0M | 25.0M | 50.0M |
| 52 | Mixtral 8x22B (Together) | together | 20.8M | 20.8M | 41.7M |
| 53 | Qwen 2.5 72B (Together) | together | 20.8M | 20.8M | 41.7M |
| 54 | DeepSeek V3 (Together) | together | 20.0M | 20.0M | 40.0M |
| 55 | GPT-5.4 mini | openai | 33.3M | 5.6M | 38.9M |
| 56 | Claude Haiku 4.5 | anthropic | 25.0M | 5.0M | 30.0M |
| 57 | Grok 4.20 | xai | 20.0M | 10.0M | 30.0M |
| 58 | Grok 4.3 | xai | 20.0M | 10.0M | 30.0M |
| 59 | o4-mini | openai | 22.7M | 5.7M | 28.4M |
| 60 | Qwen3.7-Max (Together) | together | 20.0M | 6.7M | 26.7M |
| 61 | Kimi K2.6 (Together) | together | 20.8M | 5.6M | 26.4M |
| 62 | GLM-5.1 (Together) | together | 17.9M | 5.7M | 23.5M |
| 63 | Gemini 2.5 Pro | 20.0M | 2.5M | 22.5M | |
| 64 | GPT-5 | openai | 20.0M | 2.5M | 22.5M |
| 65 | GPT-5.1 | openai | 20.0M | 2.5M | 22.5M |
| 66 | DeepSeek V4 Pro | deepseek | 14.4M | 7.2M | 21.6M |
| 67 | Mistral Medium 3.5 | mistral | 16.7M | 3.3M | 20.0M |
| 68 | Gemini 3.5 Flash | 16.7M | 2.8M | 19.4M | |
| 69 | DeepSeek V4 Pro (Together) | together | 11.9M | 5.7M | 17.6M |
| 70 | Magistral Medium | mistral | 12.5M | 5.0M | 17.5M |
| 71 | Mixtral 8x22B | mistral | 12.5M | 4.2M | 16.7M |
| 72 | Pixtral Large | mistral | 12.5M | 4.2M | 16.7M |
| 73 | GPT-5.2 | openai | 14.3M | 1.8M | 16.1M |
| 74 | GPT-4.1 | openai | 12.5M | 3.1M | 15.6M |
| 75 | o3 | openai | 12.5M | 3.1M | 15.6M |
| 76 | Sonar Deep Research | perplexity | 12.5M | 3.1M | 15.6M |
| 77 | Sonar Reasoning Pro | perplexity | 12.5M | 3.1M | 15.6M |
| 78 | Gemini 3 Pro | 12.5M | 2.1M | 14.6M | |
| 79 | Gemini 3.1 Pro | 12.5M | 2.1M | 14.6M | |
| 80 | Llama 3.1 405B (Together) | together | 7.1M | 7.1M | 14.3M |
| 81 | Command A | cohere | 10.0M | 2.5M | 12.5M |
| 82 | Command R+ 08-2024 | cohere | 10.0M | 2.5M | 12.5M |
| 83 | GPT-4o | openai | 10.0M | 2.5M | 12.5M |
| 84 | DeepSeek R1 (Together) | together | 8.3M | 3.6M | 11.9M |
| 85 | Gemini 2.5 Pro (>200k tokens) | 10.0M | 1.7M | 11.7M | |
| 86 | GPT-5.4 | openai | 10.0M | 1.7M | 11.7M |
| 87 | Mistral Large (Together) | together | 8.3M | 2.8M | 11.1M |
| 88 | Claude Sonnet 4 | anthropic | 8.3M | 1.7M | 10.0M |
| 89 | Claude Sonnet 4.5 | anthropic | 8.3M | 1.7M | 10.0M |
| 90 | Claude Sonnet 4.6 | anthropic | 8.3M | 1.7M | 10.0M |
| 91 | Sonar Pro | perplexity | 8.3M | 1.7M | 10.0M |
| 92 | Claude Opus 4.5 | anthropic | 5.0M | 1.0M | 6.0M |
| 93 | Claude Opus 4.6 | anthropic | 5.0M | 1.0M | 6.0M |
| 94 | Claude Opus 4.7 | anthropic | 5.0M | 1.0M | 6.0M |
| 95 | Claude Opus 4.8 | anthropic | 5.0M | 1.0M | 6.0M |
| 96 | GPT-5.5 | openai | 5.0M | 833.3K | 5.8M |
| 97 | Claude Opus 4 | anthropic | 1.7M | 333.3K | 2.0M |
| 98 | Claude Opus 4.1 | anthropic | 1.7M | 333.3K | 2.0M |
| 99 | GPT-5 Pro | openai | 1.7M | 208.3K | 1.9M |
| 100 | o3-pro | openai | 1.3M | 312.5K | 1.6M |
| 101 | GPT-5.2 Pro | openai | 1.2M | 148.8K | 1.3M |
| 102 | GPT-5.4 Pro | openai | 833.3K | 138.9K | 972.2K |
| 103 | GPT-5.5 Pro | openai | 833.3K | 138.9K | 972.2K |
How does this calculator work?
- Enter your monthly budget — the total dollar amount you want to spend on AI API calls per month.
- Optionally select a focus model — highlights that model in the table and shows a detailed breakdown.
- Choose a budget split — decide how to allocate between input and output tokens (50/50, 80/20, or 20/80).
- Compare the table — models are ranked from most tokens to least, so the best-value models appear first.
Methodology
Token allowance is calculated by inverting the standard cost formula:
input_tokens = (budget × split_ratio / input_rate_per_M) × 1,000,000
output_tokens = (budget × (1 - split_ratio) / output_rate_per_M) × 1,000,000
The budget split controls what fraction of your monthly spend goes toward input vs output tokens. For most chat use cases, 50/50 is a reasonable default. If you send long prompts with short replies, use 80/20. If you request long-form content generation, use 20/80.
All rates come from our daily-updated pricing database. Models with $0 rates (free tiers) are excluded from ranking.
Need AI-powered writing on a budget? Writesonic gives you unlimited AI writing features at a flat monthly rate — great if you want predictable costs without counting tokens.