Groq preview low cost

Qwen3 32B API Pricing

Qwen3 32B costs $0.29 per 1M input tokens and $0.59 per 1M output tokens. Prices were last refreshed on from the AI Pricing Guru daily pricing pipeline.

Cost examples

These examples use published list prices only. They exclude taxes, enterprise discounts, minimum charges, retries, batch discounts, and provider-specific billing rules.

1M input + 1M output

$0.88

A direct per-million comparison against every other text model.

100K input + 25K output

$0.0438

A compact chat, summarization, or analysis workload.

10M input + 2M output

$4.08

A monthly production estimate for heavier RAG or agent traffic.

75% cached input + 250K output

Not listed

Groq does not publish a cached-input rate for this model in our dataset.

Calculator for Qwen3 32B

Enter your own input, output, and cached-token assumptions. This calculator is preloaded with only Qwen3 32B, so the result stays focused on this model.

Input cost

$0.29

Output cost

$0.59

Total

$0.88

Price history

No list-price movement across 52 daily snapshots from 2026-05-09 to 2026-06-29.

When to use Qwen3 32B

Good fit

  • high-volume routing, classification, extraction, summaries, and fallback traffic

Be careful with

  • decisions that depend on unpublished context-window limits without checking the provider docs first

Alternatives to compare