AI Model Cost Benchmarks

Cost efficiency rankings for 79 active models across 11 providers. Last updated:

Budget Tier

35 models

Under $0.50/M input tokens

Mid Tier

35 models

$0.50 – $3.00/M input tokens

Premium Tier

9 models

$3.00+/M input tokens

Cost Ranking: Cheapest to Most Expensive

# Model Provider Input $/1M Cached $/1M Output $/1M Tier
1 LFM2 24B A2B (Together) together $0.03 $0.12 Budget
2 Command R7B cohere $0.04 $0.15 Budget
3 Llama 3.1 8b Instant groq $0.05 $0.08 Budget
4 GPT-5 nano openai $0.05 $0.005 $0.40 Budget
5 GPT-OSS 20B (Together) together $0.05 $0.20 Budget
6 Gemma 3n E4B Instruct (Together) together $0.06 $0.12 Budget
7 Openai/gpt Oss 20b groq $0.07 $0.037 $0.30 Budget
8 Llama 4 Scout meta $0.08 $0.30 Budget
9 Embed v3 English cohere $0.10 $0.00 Budget
10 Embed v3 Multilingual cohere $0.10 $0.00 Budget
11 Gemini 2.5 Flash-Lite google $0.10 $0.010 $0.40 Budget
12 Ministral 3B mistral $0.10 $0.10 Budget
13 Mistral Small 4 mistral $0.10 $0.30 Budget
14 Devstral Small 2 mistral $0.10 $0.30 Budget
15 DeepSeek V4 Flash deepseek $0.14 $0.003 $0.28 Budget
16 Command R 08-2024 cohere $0.15 $0.60 Budget
17 Openai/gpt Oss 120b groq $0.15 $0.075 $0.60 Budget
18 Llama 4 Maverick meta $0.15 $0.60 Budget
19 Ministral 8B mistral $0.15 $0.15 Budget
20 Pixtral 12B mistral $0.15 $0.15 Budget
21 GPT-4o mini openai $0.15 $0.075 $0.60 Budget
22 Mistral NeMo mistral $0.15 $0.15 Budget
23 GPT-OSS 120B (Together) together $0.15 $0.60 Budget
24 Rnj-1 Instruct (Together) together $0.15 $0.15 Budget
25 Qwen3.5 9B (Together) together $0.17 $0.25 Budget
26 GPT-5.4 nano openai $0.20 $0.020 $1.25 Budget
27 Ministral 14B mistral $0.20 $0.20 Budget
28 GPT-5 mini openai $0.25 $0.025 $2.00 Budget
29 Mistral 7B mistral $0.25 $0.25 Budget
30 Gemini 2.5 Flash google $0.30 $0.030 $2.50 Budget
31 Codestral mistral $0.30 $0.90 Budget
32 MiniMax M2.7 (Together) together $0.30 $0.060 $1.20 Budget
33 GPT-4.1 mini openai $0.40 $0.100 $1.60 Budget
34 Devstral Medium 2 mistral $0.40 $2.00 Budget
35 DeepSeek V4 Pro deepseek $0.43 $0.004 $0.87 Budget
36 Magistral Small mistral $0.50 $1.50 Mid
37 Mistral Large 3 mistral $0.50 $1.50 Mid
38 Llama 3.3 70b Versatile groq $0.59 $0.79 Mid
39 Qwen3.5 397B A17B (Together) together $0.60 $3.60 Mid
40 Mixtral 8x7B mistral $0.70 $0.70 Mid
41 GPT-5.4 mini openai $0.75 $0.075 $4.50 Mid
42 Claude Haiku 4.5 anthropic $1.00 $0.100 $5.00 Mid
43 Sonar perplexity $1.00 $1.00 Mid
44 GLM-5 (Together) together $1.00 $3.20 Mid
45 Llama 3.3 70B (Together) together $1.04 $1.04 Mid
46 Kimi K2.6 (Together) together $1.20 $0.200 $4.50 Mid
47 Gemini 2.5 Pro google $1.25 $0.125 $10.00 Mid
48 GPT-5.1 openai $1.25 $0.125 $10.00 Mid
49 GPT-5 openai $1.25 $0.125 $10.00 Mid
50 Grok 4.3 xai $1.25 $2.50 Mid
51 Qwen3.7-Max (Together) together $1.25 $0.130 $3.75 Mid
52 Cogito v2.1 671B (Together) together $1.25 $1.25 Mid
53 GLM-5.1 (Together) together $1.40 $4.40 Mid
54 Gemini 3.5 Flash google $1.50 $0.150 $9.00 Mid
55 Mistral Medium 3.5 mistral $1.50 $7.50 Mid
56 GPT-5.2 openai $1.75 $0.175 $14.00 Mid
57 Rerank v3 cohere $2.00 $0.00 Mid
58 Magistral Medium mistral $2.00 $5.00 Mid
59 Mixtral 8x22B mistral $2.00 $6.00 Mid
60 Pixtral Large mistral $2.00 $6.00 Mid
61 GPT-4.1 openai $2.00 $0.500 $8.00 Mid
62 o3 openai $2.00 $0.500 $8.00 Mid
63 Sonar Deep Research perplexity $2.00 $8.00 Mid
64 Sonar Reasoning Pro perplexity $2.00 $8.00 Mid
65 DeepSeek V4 Pro (Together) together $2.10 $0.200 $4.40 Mid
66 Command R+ 08-2024 cohere $2.50 $10.00 Mid
67 Gemini 2.5 Pro (>200k tokens) google $2.50 $0.250 $15.00 Mid
68 GPT-4o openai $2.50 $1.250 $10.00 Mid
69 GPT-5.4 openai $2.50 $0.250 $15.00 Mid
70 Command A cohere $2.50 $10.00 Mid
71 Claude Sonnet 4.6 anthropic $3.00 $0.300 $15.00 Premium
72 Sonar Pro perplexity $3.00 $15.00 Premium
73 Claude Opus 4.8 anthropic $5.00 $0.500 $25.00 Premium
74 GPT-5.5 openai $5.00 $0.500 $30.00 Premium
75 GPT-5 Pro openai $15.00 $120.00 Premium
76 o3-pro openai $20.00 $80.00 Premium
77 GPT-5.2 Pro openai $21.00 $168.00 Premium
78 GPT-5.4 Pro openai $30.00 $180.00 Premium
79 GPT-5.5 Pro openai $30.00 $180.00 Premium

Models per Provider

anthropic

12 models

cohere

7 models

deepseek

2 models

google

11 models

groq

7 models

meta

2 models

mistral

18 models

openai

21 models

perplexity

4 models

together

23 models

xai

5 models

Note: These are cost benchmarks based on per-token pricing. Performance benchmarks (speed, latency, quality scores) are coming soon. Pricing data is updated daily from official provider pages.