Groq preview low cost

Qwen3 32B API Pricing

Qwen3 32B costs $0.29 per 1M input tokens and $0.59 per 1M output tokens. Prices were last refreshed on 2026-06-29 from the AI Pricing Guru daily pricing pipeline.

Calculate cost All API prices

Cost examples

These examples use published list prices only. They exclude taxes, enterprise discounts, minimum charges, retries, batch discounts, and provider-specific billing rules.

1M input + 1M output

$0.88

A direct per-million comparison against every other text model.

100K input + 25K output

$0.0438

A compact chat, summarization, or analysis workload.

10M input + 2M output

$4.08

A monthly production estimate for heavier RAG or agent traffic.

75% cached input + 250K output

Not listed

Groq does not publish a cached-input rate for this model in our dataset.

Calculator for Qwen3 32B

Enter your own input, output, and cached-token assumptions. This calculator is preloaded with only Qwen3 32B, so the result stays focused on this model.

Input tokensOutput tokensCached input %

Input cost

$0.29

Output cost

$0.59

Total

$0.88

Price history

No list-price movement across 52 daily snapshots from 2026-05-09 to 2026-06-29.

When to use Qwen3 32B

Good fit

high-volume routing, classification, extraction, summaries, and fallback traffic

Be careful with

decisions that depend on unpublished context-window limits without checking the provider docs first

Alternatives to compare

Llama 3.1 8b Instant

Groq

active

$0.05 input / $0.08 output

Together Gemma 4 31B IT Pearl

Together AI

active

$0.28 input / $0.86 output

LFM2 24B A2B (Together)

Together AI

active

$0.03 input / $0.12 output

Gemini 2.5 Flash

Google Gemini

active

$0.30 input / $2.50 output