How Many AI Tokens Can I Get for My Budget?

Enter a monthly dollar amount and instantly see how many tokens each AI model gives you. Last updated:

How far does my AI budget go? Different models have wildly different per-token rates. A $50 monthly budget buys you about 1,667 million input tokens on LFM2 24B A2B (Together) but only about 2 million on GPT-5.4 Pro. This calculator divides your budget by each model's input and output rates and ranks every model by total token allowance, so you can find the best bang for your buck.

Your Budget

$25.00 input / $25.00 output

Token Allowance per Model — $50.00/mo Budget

Sorted by total tokens (most to least). Split: 50% input / 50% output.

#ModelProviderInput TokensOutput TokensTotal Tokens
1Embed v3 Englishcohere250.0MInfinityBInfinityB
2Embed v3 Multilingualcohere250.0MInfinityBInfinityB
3Rerank v3cohere12.5MInfinityBInfinityB
4LFM2 24B A2B (Together)together833.3M208.3M1.0B
5Command R7Bcohere666.7M166.7M833.3M
6Llama 3.1 8b Instantgroq500.0M312.5M812.5M
7Gemma 3n E4B Instruct (Together)together416.7M208.3M625.0M
8GPT-OSS 20B (Together)together500.0M125.0M625.0M
9GPT-5 nanoopenai500.0M62.5M562.5M
10Ministral 3Bmistral250.0M250.0M500.0M
11Gemini 2.0 Flash-Litegoogle333.3M83.3M416.7M
12GPT OSS Safeguard 20Bgroq333.3M83.3M416.7M
13Openai/gpt Oss 20bgroq333.3M83.3M416.7M
14Llama 4 Scoutmeta312.5M83.3M395.8M
15Devstral Small 2mistral250.0M83.3M333.3M
16Ministral 8Bmistral166.7M166.7M333.3M
17Mistral NeMomistral166.7M166.7M333.3M
18Mistral Small 4mistral250.0M83.3M333.3M
19Pixtral 12Bmistral166.7M166.7M333.3M
20Rnj-1 Instruct (Together)together166.7M166.7M333.3M
21Gemini 2.0 Flashgoogle250.0M62.5M312.5M
22Gemini 2.5 Flash-Litegoogle250.0M62.5M312.5M
23GPT-4.1 nanoopenai250.0M62.5M312.5M
24Llama 4 Scout 17B 16E Instructgroq227.3M73.5M300.8M
25DeepSeek V4 Flashdeepseek178.6M89.3M267.9M
26Ministral 14Bmistral125.0M125.0M250.0M
27Qwen3.5 9B (Together)together147.1M100.0M247.1M
28Command R 08-2024cohere166.7M41.7M208.3M
29GPT-4o miniopenai166.7M41.7M208.3M
30GPT-OSS 120B (Together)together166.7M41.7M208.3M
31Llama 4 Maverickmeta166.7M41.7M208.3M
32Openai/gpt Oss 120bgroq166.7M41.7M208.3M
33Mistral 7Bmistral100.0M100.0M200.0M
34Grok 4 1 Fast Non Reasoningxai125.0M50.0M175.0M
35Grok 4 1 Fast Reasoningxai125.0M50.0M175.0M
36Grok 4.1 Fastxai125.0M50.0M175.0M
37GPT-5.4 nanoopenai125.0M20.0M145.0M
38Qwen3 32Bgroq86.2M42.4M128.6M
39Gemini 3.1 Flash-Litegoogle100.0M16.7M116.7M
40GPT-5 miniopenai100.0M12.5M112.5M
41Codestralmistral83.3M27.8M111.1M
42MiniMax M2.7 (Together)together83.3M20.8M104.2M
43Gemini 2.5 Flashgoogle83.3M10.0M93.3M
44DeepSeek V4 Prodeepseek57.5M28.7M86.2M
45GPT-4.1 miniopenai62.5M15.6M78.1M
46Devstral Medium 2mistral62.5M12.5M75.0M
47Mistral Medium 3mistral62.5M12.5M75.0M
48Llama 3.3 70b Versatilegroq42.4M31.6M74.0M
49Mixtral 8x7Bmistral35.7M35.7M71.4M
50Magistral Smallmistral50.0M16.7M66.7M
51Mistral Large 3mistral50.0M16.7M66.7M
52Gemini 3 Flashgoogle50.0M8.3M58.3M
53Qwen3.6-Plus (Together)together50.0M8.3M58.3M
54DeepSeek V3.1 (Together)together41.7M14.7M56.4M
55Sonarperplexity25.0M25.0M50.0M
56Qwen3.5 397B A17B (Together)together41.7M6.9M48.6M
57Llama 3.3 70B (Together)together24.0M24.0M48.1M
58Mixtral 8x22B (Together)together20.8M20.8M41.7M
59Qwen 2.5 72B (Together)together20.8M20.8M41.7M
60Cogito v2.1 671B (Together)together20.0M20.0M40.0M
61DeepSeek V3 (Together)together20.0M20.0M40.0M
62GPT-5.4 miniopenai33.3M5.6M38.9M
63GLM-5 (Together)together25.0M7.8M32.8M
64Claude Haiku 4.5anthropic25.0M5.0M30.0M
65Grok 4.20xai20.0M10.0M30.0M
66Grok 4.3xai20.0M10.0M30.0M
67o4-miniopenai22.7M5.7M28.4M
68Qwen3.7-Max (Together)together20.0M6.7M26.7M
69Kimi K2.6 (Together)together20.8M5.6M26.4M
70GLM-5.1 (Together)together17.9M5.7M23.5M
71Gemini 2.5 Progoogle20.0M2.5M22.5M
72GPT-5openai20.0M2.5M22.5M
73GPT-5.1openai20.0M2.5M22.5M
74Mistral Medium 3.5mistral16.7M3.3M20.0M
75Gemini 3.5 Flashgoogle16.7M2.8M19.4M
76DeepSeek V4 Pro (Together)together11.9M5.7M17.6M
77Magistral Mediummistral12.5M5.0M17.5M
78Mixtral 8x22Bmistral12.5M4.2M16.7M
79Pixtral Largemistral12.5M4.2M16.7M
80GPT-5.2openai14.3M1.8M16.1M
81GPT-4.1openai12.5M3.1M15.6M
82o3openai12.5M3.1M15.6M
83Sonar Deep Researchperplexity12.5M3.1M15.6M
84Sonar Reasoning Properplexity12.5M3.1M15.6M
85Gemini 3 Progoogle12.5M2.1M14.6M
86Gemini 3.1 Progoogle12.5M2.1M14.6M
87Llama 3.1 405B (Together)together7.1M7.1M14.3M
88Command Acohere10.0M2.5M12.5M
89Command R+ 08-2024cohere10.0M2.5M12.5M
90GPT-4oopenai10.0M2.5M12.5M
91DeepSeek R1 (Together)together8.3M3.6M11.9M
92Gemini 2.5 Pro (>200k tokens)google10.0M1.7M11.7M
93GPT-5.4openai10.0M1.7M11.7M
94Mistral Large (Together)together8.3M2.8M11.1M
95Claude Sonnet 4anthropic8.3M1.7M10.0M
96Claude Sonnet 4.5anthropic8.3M1.7M10.0M
97Claude Sonnet 4.6anthropic8.3M1.7M10.0M
98Sonar Properplexity8.3M1.7M10.0M
99Claude Opus 4.5anthropic5.0M1.0M6.0M
100Claude Opus 4.6anthropic5.0M1.0M6.0M
101Claude Opus 4.7anthropic5.0M1.0M6.0M
102Claude Opus 4.8anthropic5.0M1.0M6.0M
103GPT-5.5openai5.0M833.3K5.8M
104Claude Fable 5anthropic2.5M500.0K3.0M
105Claude Mythos 5anthropic2.5M500.0K3.0M
106Claude Opus 4anthropic1.7M333.3K2.0M
107Claude Opus 4.1anthropic1.7M333.3K2.0M
108GPT-5 Proopenai1.7M208.3K1.9M
109o3-proopenai1.3M312.5K1.6M
110GPT-5.2 Proopenai1.2M148.8K1.3M
111GPT-5.4 Proopenai833.3K138.9K972.2K
112GPT-5.5 Proopenai833.3K138.9K972.2K
112 models compared at $50.00/mo budget

How does this calculator work?

  1. Enter your monthly budget — the total dollar amount you want to spend on AI API calls per month.
  2. Optionally select a focus model — highlights that model in the table and shows a detailed breakdown.
  3. Choose a budget split — decide how to allocate between input and output tokens (50/50, 80/20, or 20/80).
  4. Compare the table — models are ranked from most tokens to least, so the best-value models appear first.

Methodology

Token allowance is calculated by inverting the standard cost formula:

input_tokens = (budget × split_ratio / input_rate_per_M) × 1,000,000

output_tokens = (budget × (1 - split_ratio) / output_rate_per_M) × 1,000,000

The budget split controls what fraction of your monthly spend goes toward input vs output tokens. For most chat use cases, 50/50 is a reasonable default. If you send long prompts with short replies, use 80/20. If you request long-form content generation, use 20/80.

All rates come from our daily-updated pricing database. Models with $0 rates (free tiers) are excluded from ranking.

Need AI-powered writing on a budget? Writesonic gives you unlimited AI writing features at a flat monthly rate — great if you want predictable costs without counting tokens.