AI Model Cost Benchmarks
Cost efficiency rankings for 79 active models across 11 providers. Last updated:
Budget Tier
35 models
Under $0.50/M input tokens
Mid Tier
35 models
$0.50 – $3.00/M input tokens
Premium Tier
9 models
$3.00+/M input tokens
Cost Ranking: Cheapest to Most Expensive
| # | Model | Provider | Input $/1M | Cached $/1M | Output $/1M | Tier |
|---|---|---|---|---|---|---|
| 1 | LFM2 24B A2B (Together) | together | $0.03 | — | $0.12 | Budget |
| 2 | Command R7B | cohere | $0.04 | — | $0.15 | Budget |
| 3 | Llama 3.1 8b Instant | groq | $0.05 | — | $0.08 | Budget |
| 4 | GPT-5 nano | openai | $0.05 | $0.005 | $0.40 | Budget |
| 5 | GPT-OSS 20B (Together) | together | $0.05 | — | $0.20 | Budget |
| 6 | Gemma 3n E4B Instruct (Together) | together | $0.06 | — | $0.12 | Budget |
| 7 | Openai/gpt Oss 20b | groq | $0.07 | $0.037 | $0.30 | Budget |
| 8 | Llama 4 Scout | meta | $0.08 | — | $0.30 | Budget |
| 9 | Embed v3 English | cohere | $0.10 | — | $0.00 | Budget |
| 10 | Embed v3 Multilingual | cohere | $0.10 | — | $0.00 | Budget |
| 11 | Gemini 2.5 Flash-Lite | $0.10 | $0.010 | $0.40 | Budget | |
| 12 | Ministral 3B | mistral | $0.10 | — | $0.10 | Budget |
| 13 | Mistral Small 4 | mistral | $0.10 | — | $0.30 | Budget |
| 14 | Devstral Small 2 | mistral | $0.10 | — | $0.30 | Budget |
| 15 | DeepSeek V4 Flash | deepseek | $0.14 | $0.003 | $0.28 | Budget |
| 16 | Command R 08-2024 | cohere | $0.15 | — | $0.60 | Budget |
| 17 | Openai/gpt Oss 120b | groq | $0.15 | $0.075 | $0.60 | Budget |
| 18 | Llama 4 Maverick | meta | $0.15 | — | $0.60 | Budget |
| 19 | Ministral 8B | mistral | $0.15 | — | $0.15 | Budget |
| 20 | Pixtral 12B | mistral | $0.15 | — | $0.15 | Budget |
| 21 | GPT-4o mini | openai | $0.15 | $0.075 | $0.60 | Budget |
| 22 | Mistral NeMo | mistral | $0.15 | — | $0.15 | Budget |
| 23 | GPT-OSS 120B (Together) | together | $0.15 | — | $0.60 | Budget |
| 24 | Rnj-1 Instruct (Together) | together | $0.15 | — | $0.15 | Budget |
| 25 | Qwen3.5 9B (Together) | together | $0.17 | — | $0.25 | Budget |
| 26 | GPT-5.4 nano | openai | $0.20 | $0.020 | $1.25 | Budget |
| 27 | Ministral 14B | mistral | $0.20 | — | $0.20 | Budget |
| 28 | GPT-5 mini | openai | $0.25 | $0.025 | $2.00 | Budget |
| 29 | Mistral 7B | mistral | $0.25 | — | $0.25 | Budget |
| 30 | Gemini 2.5 Flash | $0.30 | $0.030 | $2.50 | Budget | |
| 31 | Codestral | mistral | $0.30 | — | $0.90 | Budget |
| 32 | MiniMax M2.7 (Together) | together | $0.30 | $0.060 | $1.20 | Budget |
| 33 | GPT-4.1 mini | openai | $0.40 | $0.100 | $1.60 | Budget |
| 34 | Devstral Medium 2 | mistral | $0.40 | — | $2.00 | Budget |
| 35 | DeepSeek V4 Pro | deepseek | $0.43 | $0.004 | $0.87 | Budget |
| 36 | Magistral Small | mistral | $0.50 | — | $1.50 | Mid |
| 37 | Mistral Large 3 | mistral | $0.50 | — | $1.50 | Mid |
| 38 | Llama 3.3 70b Versatile | groq | $0.59 | — | $0.79 | Mid |
| 39 | Qwen3.5 397B A17B (Together) | together | $0.60 | — | $3.60 | Mid |
| 40 | Mixtral 8x7B | mistral | $0.70 | — | $0.70 | Mid |
| 41 | GPT-5.4 mini | openai | $0.75 | $0.075 | $4.50 | Mid |
| 42 | Claude Haiku 4.5 | anthropic | $1.00 | $0.100 | $5.00 | Mid |
| 43 | Sonar | perplexity | $1.00 | — | $1.00 | Mid |
| 44 | GLM-5 (Together) | together | $1.00 | — | $3.20 | Mid |
| 45 | Llama 3.3 70B (Together) | together | $1.04 | — | $1.04 | Mid |
| 46 | Kimi K2.6 (Together) | together | $1.20 | $0.200 | $4.50 | Mid |
| 47 | Gemini 2.5 Pro | $1.25 | $0.125 | $10.00 | Mid | |
| 48 | GPT-5.1 | openai | $1.25 | $0.125 | $10.00 | Mid |
| 49 | GPT-5 | openai | $1.25 | $0.125 | $10.00 | Mid |
| 50 | Grok 4.3 | xai | $1.25 | — | $2.50 | Mid |
| 51 | Qwen3.7-Max (Together) | together | $1.25 | $0.130 | $3.75 | Mid |
| 52 | Cogito v2.1 671B (Together) | together | $1.25 | — | $1.25 | Mid |
| 53 | GLM-5.1 (Together) | together | $1.40 | — | $4.40 | Mid |
| 54 | Gemini 3.5 Flash | $1.50 | $0.150 | $9.00 | Mid | |
| 55 | Mistral Medium 3.5 | mistral | $1.50 | — | $7.50 | Mid |
| 56 | GPT-5.2 | openai | $1.75 | $0.175 | $14.00 | Mid |
| 57 | Rerank v3 | cohere | $2.00 | — | $0.00 | Mid |
| 58 | Magistral Medium | mistral | $2.00 | — | $5.00 | Mid |
| 59 | Mixtral 8x22B | mistral | $2.00 | — | $6.00 | Mid |
| 60 | Pixtral Large | mistral | $2.00 | — | $6.00 | Mid |
| 61 | GPT-4.1 | openai | $2.00 | $0.500 | $8.00 | Mid |
| 62 | o3 | openai | $2.00 | $0.500 | $8.00 | Mid |
| 63 | Sonar Deep Research | perplexity | $2.00 | — | $8.00 | Mid |
| 64 | Sonar Reasoning Pro | perplexity | $2.00 | — | $8.00 | Mid |
| 65 | DeepSeek V4 Pro (Together) | together | $2.10 | $0.200 | $4.40 | Mid |
| 66 | Command R+ 08-2024 | cohere | $2.50 | — | $10.00 | Mid |
| 67 | Gemini 2.5 Pro (>200k tokens) | $2.50 | $0.250 | $15.00 | Mid | |
| 68 | GPT-4o | openai | $2.50 | $1.250 | $10.00 | Mid |
| 69 | GPT-5.4 | openai | $2.50 | $0.250 | $15.00 | Mid |
| 70 | Command A | cohere | $2.50 | — | $10.00 | Mid |
| 71 | Claude Sonnet 4.6 | anthropic | $3.00 | $0.300 | $15.00 | Premium |
| 72 | Sonar Pro | perplexity | $3.00 | — | $15.00 | Premium |
| 73 | Claude Opus 4.8 | anthropic | $5.00 | $0.500 | $25.00 | Premium |
| 74 | GPT-5.5 | openai | $5.00 | $0.500 | $30.00 | Premium |
| 75 | GPT-5 Pro | openai | $15.00 | — | $120.00 | Premium |
| 76 | o3-pro | openai | $20.00 | — | $80.00 | Premium |
| 77 | GPT-5.2 Pro | openai | $21.00 | — | $168.00 | Premium |
| 78 | GPT-5.4 Pro | openai | $30.00 | — | $180.00 | Premium |
| 79 | GPT-5.5 Pro | openai | $30.00 | — | $180.00 | Premium |
Models per Provider
anthropic
12 models
cohere
7 models
deepseek
2 models
11 models
groq
7 models
meta
2 models
mistral
18 models
openai
21 models
perplexity
4 models
together
23 models
xai
5 models
Note: These are cost benchmarks based on per-token pricing. Performance benchmarks (speed, latency, quality scores) are coming soon. Pricing data is updated daily from official provider pages.