DeepSeek active budget

DeepSeek V4 Flash API Pricing

DeepSeek V4 Flash costs $0.14 per 1M input tokens and $0.28 per 1M output tokens. Prices were last refreshed on 2026-06-29 from the AI Pricing Guru daily pricing pipeline.

Calculate cost All API prices

Cost examples

These examples use published list prices only. They exclude taxes, enterprise discounts, minimum charges, retries, batch discounts, and provider-specific billing rules.

1M input + 1M output

$0.42

A direct per-million comparison against every other text model.

100K input + 25K output

$0.021

A compact chat, summarization, or analysis workload.

10M input + 2M output

$1.96

A monthly production estimate for heavier RAG or agent traffic.

75% cached input + 250K output

$0.1071

Cached input is about 98% cheaper than standard input at list price.

Calculator for DeepSeek V4 Flash

Enter your own input, output, and cached-token assumptions. This calculator is preloaded with only DeepSeek V4 Flash, so the result stays focused on this model.

Input tokensOutput tokensCached input %

Input cost

$0.14

Output cost

$0.28

Total

$0.42

Price history

No list-price movement across 66 daily snapshots from 2026-04-24 to 2026-06-29.

When to use DeepSeek V4 Flash

Good fit

reasoning-heavy prompts, planning, analysis, and multi-step tool use
high-volume routing, classification, extraction, summaries, and fallback traffic

Be careful with

decisions that depend on unpublished context-window limits without checking the provider docs first

Alternatives to compare

DeepSeek V4 Pro

DeepSeek

active

$0.435 input / $0.87 output

Llama 4 Scout 17B 16E Instruct

Groq

preview

$0.11 input / $0.34 output

LFM2 24B A2B (Together)

Together AI

active

$0.03 input / $0.12 output

Together Llama 3 8B Instruct Lite

Together AI

active

$0.14 input / $0.14 output