How Many AI Tokens Can I Get for My Budget?

Enter a monthly dollar amount and instantly see how many tokens each AI model gives you. Last updated: 2026-07-17

How far does my AI budget go? Different models have wildly different per-token rates. A $50 monthly budget buys you about 1,333 million input tokens on Command R7B but only about 5 million on Claude Fable 5. This calculator divides your budget by each model's input and output rates and ranks every model by total token allowance, so you can find the best bang for your buck.

Your Budget

Monthly Budget ($)

Focus Model (optional)

Budget Split

$25.00 input / $25.00 output

Token Allowance per Model — $50.00/mo Budget

Sorted by total tokens (most to least). Split: 50% input / 50% output.

#	Model	Provider	Input Tokens	Output Tokens	Total Tokens
1	Embed v3 English	cohere	250.0M	InfinityB	InfinityB
2	Embed v3 Multilingual	cohere	250.0M	InfinityB	InfinityB
3	Rerank v3	cohere	12.5M	InfinityB	InfinityB
4	LFM2 24B A2B (Together)	together	833.3M	208.3M	1.0B
5	Command R7B	cohere	666.7M	166.7M	833.3M
6	Llama 3.1 8b Instant	groq	500.0M	312.5M	812.5M
7	Gemma 3n E4B Instruct (Together)	together	416.7M	208.3M	625.0M
8	GPT-OSS 20B (Together)	together	500.0M	125.0M	625.0M
9	GPT-5 nano	openai	500.0M	62.5M	562.5M
10	Ministral 3B	mistral	250.0M	250.0M	500.0M
11	Gemini 2.0 Flash-Lite	google	333.3M	83.3M	416.7M
12	GPT OSS Safeguard 20B	groq	333.3M	83.3M	416.7M
13	Openai/gpt Oss 20b	groq	333.3M	83.3M	416.7M
14	Together Llama 3 8B Instruct Lite	together	178.6M	178.6M	357.1M
15	Devstral Small 2	mistral	250.0M	83.3M	333.3M
16	Llama 4 Scout	meta	250.0M	83.3M	333.3M
17	Ministral 8B	mistral	166.7M	166.7M	333.3M
18	Mistral NeMo	mistral	166.7M	166.7M	333.3M
19	Mistral Small 4	mistral	250.0M	83.3M	333.3M
20	Pixtral 12B	mistral	166.7M	166.7M	333.3M
21	Rnj-1 Instruct (Together)	together	166.7M	166.7M	333.3M
22	Gemini 2.0 Flash	google	250.0M	62.5M	312.5M
23	Gemini 2.5 Flash-Lite	google	250.0M	62.5M	312.5M
24	GPT-4.1 nano	openai	250.0M	62.5M	312.5M
25	Llama 4 Scout 17B 16E Instruct	groq	227.3M	73.5M	300.8M
26	DeepSeek V4 Flash	deepseek	178.6M	89.3M	267.9M
27	Ministral 14B	mistral	125.0M	125.0M	250.0M
28	Qwen3.5 9B (Together)	together	147.1M	100.0M	247.1M
29	Command R 08-2024	cohere	166.7M	41.7M	208.3M
30	GPT-4o mini	openai	166.7M	41.7M	208.3M
31	GPT-OSS 120B (Together)	together	166.7M	41.7M	208.3M
32	Llama 4 Maverick	meta	166.7M	41.7M	208.3M
33	Openai/gpt Oss 120b	groq	166.7M	41.7M	208.3M
34	Mistral 7B	mistral	100.0M	100.0M	200.0M
35	Grok 4 1 Fast Non Reasoning	xai	125.0M	50.0M	175.0M
36	Grok 4 1 Fast Reasoning	xai	125.0M	50.0M	175.0M
37	Grok 4.1 Fast	xai	125.0M	50.0M	175.0M
38	Together Qwen2.5 7B Instruct Turbo	together	83.3M	83.3M	166.7M
39	Together Qwen3 235B A22B Instruct 2507	together	125.0M	41.7M	166.7M
40	GPT-5.4 nano	openai	125.0M	20.0M	145.0M
41	Qwen3 32B	groq	86.2M	42.4M	128.6M
42	Together Gemma 4 31B IT Pearl	together	89.3M	29.1M	118.4M
43	Gemini 3.1 Flash-Lite	google	100.0M	16.7M	116.7M
44	GPT-5 mini	openai	100.0M	12.5M	112.5M
45	Codestral	mistral	83.3M	27.8M	111.1M
46	MiniMax M2.7 (Together)	together	83.3M	20.8M	104.2M
47	Together MiniMax M2.5	together	83.3M	20.8M	104.2M
48	Together MiniMax M3	together	83.3M	20.8M	104.2M
49	Together Qwen3.7 Plus	together	78.1M	19.5M	97.7M
50	Gemini 2.5 Flash	google	83.3M	10.0M	93.3M
51	Together Gemma 4 31B IT	together	64.1M	25.8M	89.9M
52	DeepSeek V4 Pro	deepseek	57.5M	28.7M	86.2M
53	GPT-4.1 mini	openai	62.5M	15.6M	78.1M
54	Devstral Medium 2	mistral	62.5M	12.5M	75.0M
55	Mistral Medium 3	mistral	62.5M	12.5M	75.0M
56	Llama 3.3 70b Versatile	groq	42.4M	31.6M	74.0M
57	Mixtral 8x7B	mistral	35.7M	35.7M	71.4M
58	Magistral Small	mistral	50.0M	16.7M	66.7M
59	Mistral Large 3	mistral	50.0M	16.7M	66.7M
60	Gemini 3 Flash	google	50.0M	8.3M	58.3M
61	Qwen3.6-Plus (Together)	together	50.0M	8.3M	58.3M
62	DeepSeek V3.1 (Together)	together	41.7M	14.7M	56.4M
63	Sonar	perplexity	25.0M	25.0M	50.0M
64	Qwen3.5 397B A17B (Together)	together	41.7M	6.9M	48.6M
65	Together Nemotron 3 Ultra 550B A55B	together	41.7M	6.9M	48.6M
66	Llama 3.3 70B (Together)	together	24.0M	24.0M	48.1M
67	Mixtral 8x22B (Together)	together	20.8M	20.8M	41.7M
68	Qwen 2.5 72B (Together)	together	20.8M	20.8M	41.7M
69	Cogito v2.1 671B (Together)	together	20.0M	20.0M	40.0M
70	DeepSeek V3 (Together)	together	20.0M	20.0M	40.0M
71	GPT-5.4 mini	openai	33.3M	5.6M	38.9M
72	GLM-5 (Together)	together	25.0M	7.8M	32.8M
73	Together Kimi K2.7 Code	together	26.3M	6.3M	32.6M
74	Claude Haiku 4.5	anthropic	25.0M	5.0M	30.0M
75	Grok 4.3	xai	20.0M	10.0M	30.0M
76	GPT-5.6 Luna	openai	25.0M	4.2M	29.2M
77	o4-mini	openai	22.7M	5.7M	28.4M
78	Qwen3.7-Max (Together)	together	20.0M	6.7M	26.7M
79	Kimi K2.6 (Together)	together	20.8M	5.6M	26.4M
80	GLM-5.1 (Together)	together	17.9M	5.7M	23.5M
81	GLM-5.2	zai	17.9M	5.7M	23.5M
82	Gemini 2.5 Pro	google	20.0M	2.5M	22.5M
83	GPT-5	openai	20.0M	2.5M	22.5M
84	GPT-5.1	openai	20.0M	2.5M	22.5M
85	Together DeepSeek V4 Pro	together	14.4M	7.2M	21.6M
86	Mistral Medium 3.5	mistral	16.7M	3.3M	20.0M
87	Gemini 3.5 Flash	google	16.7M	2.8M	19.4M
88	Magistral Medium	mistral	12.5M	5.0M	17.5M
89	Grok 4.20	xai	12.5M	4.2M	16.7M
90	Grok 4.5	xai	12.5M	4.2M	16.7M
91	Mixtral 8x22B	mistral	12.5M	4.2M	16.7M
92	Pixtral Large	mistral	12.5M	4.2M	16.7M
93	GPT-5.2	openai	14.3M	1.8M	16.1M
94	GPT-4.1	openai	12.5M	3.1M	15.6M
95	o3	openai	12.5M	3.1M	15.6M
96	Sonar Deep Research	perplexity	12.5M	3.1M	15.6M
97	Sonar Reasoning Pro	perplexity	12.5M	3.1M	15.6M
98	Claude Sonnet 5	anthropic	12.5M	2.5M	15.0M
99	Gemini 3 Pro	google	12.5M	2.1M	14.6M
100	Gemini 3.1 Pro	google	12.5M	2.1M	14.6M
101	Llama 3.1 405B (Together)	together	7.1M	7.1M	14.3M
102	Command A	cohere	10.0M	2.5M	12.5M
103	Command R+ 08-2024	cohere	10.0M	2.5M	12.5M
104	GPT-4o	openai	10.0M	2.5M	12.5M
105	DeepSeek R1 (Together)	together	8.3M	3.6M	11.9M
106	Gemini 2.5 Pro (>200k tokens)	google	10.0M	1.7M	11.7M
107	GPT-5.4	openai	10.0M	1.7M	11.7M
108	GPT-5.6 Terra	openai	10.0M	1.7M	11.7M
109	Mistral Large (Together)	together	8.3M	2.8M	11.1M
110	Claude Sonnet 4	anthropic	8.3M	1.7M	10.0M
111	Claude Sonnet 4.5	anthropic	8.3M	1.7M	10.0M
112	Claude Sonnet 4.6	anthropic	8.3M	1.7M	10.0M
113	Kimi K3	moonshot	8.3M	1.7M	10.0M
114	Sonar Pro	perplexity	8.3M	1.7M	10.0M
115	Claude Opus 4.5	anthropic	5.0M	1.0M	6.0M
116	Claude Opus 4.6	anthropic	5.0M	1.0M	6.0M
117	Claude Opus 4.7	anthropic	5.0M	1.0M	6.0M
118	Claude Opus 4.8	anthropic	5.0M	1.0M	6.0M
119	GPT-5.5	openai	5.0M	833.3K	5.8M
120	GPT-5.6 Sol	openai	5.0M	833.3K	5.8M
121	Claude Fable 5	anthropic	2.5M	500.0K	3.0M
122	Claude Mythos 5	anthropic	2.5M	500.0K	3.0M
123	Claude Opus 4	anthropic	1.7M	333.3K	2.0M
124	Claude Opus 4.1	anthropic	1.7M	333.3K	2.0M
125	GPT-5 Pro	openai	1.7M	208.3K	1.9M
126	o3-pro	openai	1.3M	312.5K	1.6M
127	GPT-5.2 Pro	openai	1.2M	148.8K	1.3M
128	GPT-5.4 Pro	openai	833.3K	138.9K	972.2K
129	GPT-5.5 Pro	openai	833.3K	138.9K	972.2K

129 models compared at $50.00/mo budget

How does this calculator work?

Enter your monthly budget — the total dollar amount you want to spend on AI API calls per month.
Optionally select a focus model — highlights that model in the table and shows a detailed breakdown.
Choose a budget split — decide how to allocate between input and output tokens (50/50, 80/20, or 20/80).
Compare the table — models are ranked from most tokens to least, so the best-value models appear first.

Methodology

Token allowance is calculated by inverting the standard cost formula:

input_tokens = (budget × split_ratio / input_rate_per_M) × 1,000,000

output_tokens = (budget × (1 - split_ratio) / output_rate_per_M) × 1,000,000

The budget split controls what fraction of your monthly spend goes toward input vs output tokens. For most chat use cases, 50/50 is a reasonable default. If you send long prompts with short replies, use 80/20. If you request long-form content generation, use 20/80.

All rates come from our daily-updated pricing database. Models with $0 rates (free tiers) are excluded from ranking.

Does the budget only work at huge volume? If your workload is steady enough to keep GPUs busy, compare API spend with the GPU break-even guide, then test RunPod pricing against your real throughput.

The RunPod route may show a $5 referral credit after the first $10 added, but verify current terms and normal hourly pricing before treating it as part of your budget.

Affiliate disclosure: this link may earn us a commission at no extra cost to you. It does not affect token allowance rankings.