DeepSeek V4 Flash API Pricing
DeepSeek V4 Flash costs $0.14 per 1M input tokens and $0.28 per 1M output tokens. Prices were last refreshed on from the AI Pricing Guru daily pricing pipeline.
Cost examples
These examples use published list prices only. They exclude taxes, enterprise discounts, minimum charges, retries, batch discounts, and provider-specific billing rules.
1M input + 1M output
$0.42
A direct per-million comparison against every other text model.
100K input + 25K output
$0.021
A compact chat, summarization, or analysis workload.
10M input + 2M output
$1.96
A monthly production estimate for heavier RAG or agent traffic.
75% cached input + 250K output
$0.1071
Cached input is about 98% cheaper than standard input at list price.
Calculator for DeepSeek V4 Flash
Enter your own input, output, and cached-token assumptions. This calculator is preloaded with only DeepSeek V4 Flash, so the result stays focused on this model.
Input cost
$0.14
Output cost
$0.28
Total
$0.42
Price history
No list-price movement across 66 daily snapshots from 2026-04-24 to 2026-06-29.
When to use DeepSeek V4 Flash
Good fit
- reasoning-heavy prompts, planning, analysis, and multi-step tool use
- high-volume routing, classification, extraction, summaries, and fallback traffic
Be careful with
- decisions that depend on unpublished context-window limits without checking the provider docs first