Question 1

How much does Cohere Command R+ cost?

Accepted Answer

Command R+ 08-2024 is priced at $2.50 per million input tokens and $10.00 per million output tokens. That is roughly the same as GPT-5.4 on input and 33% cheaper on output, positioning Command R+ as a competitive flagship option — especially for retrieval-augmented generation where Cohere is purpose-built.

Question 2

Does Cohere have a free tier?

Accepted Answer

Yes. Cohere offers a generous free trial tier with rate-limited access to Command, Embed, and Rerank endpoints for prototyping. Production pay-as-you-go pricing kicks in once you exceed trial limits or request production keys — there are no monthly minimums.

Question 3

How does Cohere compare to OpenAI?

Accepted Answer

Cohere is narrower in focus than OpenAI but often better at what it does. Command R+ at $2.50 / $10.00 per 1M tokens matches GPT-5.4 on input cost and undercuts by 33% on output. Embed v3 at $0.10 per 1M tokens is 33% cheaper than OpenAI text-embedding-3-large ($0.13). Rerank v3 is unique — OpenAI does not ship a dedicated reranker.

Question 4

What are Cohere's embedding model prices?

Accepted Answer

Embed v3 English and Embed v3 Multilingual are both priced at $0.10 per million input tokens. The multilingual variant supports over 100 languages with near-English quality. Embeddings are input-only, so there is no output token cost.

Question 5

What's Rerank v3 priced at?

Accepted Answer

Rerank v3 costs $2.00 per million tokens of search queries + documents combined. Reranking is typically used after vector retrieval to boost top-K precision — you send 100-1000 candidate documents per query and Rerank scores them. At $2 / 1M, a typical 10-query / 100-doc-each workload costs about $0.02.

Question 6

What's Command R+'s context window?

Accepted Answer

Command R+ 08-2024 supports a 128K token context window, matching GPT-5.4. Command R 08-2024 also ships with 128K context. Cohere tunes these models aggressively for long-document RAG, with strong needle-in-a-haystack recall at depths up to 100K tokens.

Question 7

What are Cohere's API rate limits for the free trial and production tiers?

Accepted Answer

Trial (evaluation) keys are capped at 1,000 API calls per month total, with per-endpoint per-minute limits: Chat (Command A, R+, R, R7B) 20 req/min, Rerank 10 req/min, Embed 2,000 inputs/min, EmbedJob 5 req/min, Audio Transcriptions 5 req/min. Production keys lift Chat to 500 req/min, Rerank to 1,000 req/min, EmbedJob to 50 req/min, and remove the monthly cap. Newer models like Command A Reasoning, Command A Translate, and Command A Vision require contacting sales@cohere.com for production access — trial keys work for evaluation at 20 req/min.

Question 8

When was Command R+ last updated and is it still the flagship?

Accepted Answer

The current Command R+ generation is the 08-2024 release (Command R+ 08-2024 at $2.50/$10 per 1M tokens), still actively served. Cohere has since shipped Command A as a higher-end variant with reasoning, vision, and translation specializations — those are gated to evaluation use until you contact sales for production keys. For general-purpose RAG and chat workloads, Command R+ 08-2024 remains the flagship priced for self-serve production use.

Question 9

What's changed in Cohere's pricing and lineup in 2026?

Accepted Answer

The active 2026 Cohere lineup on self-serve production keys is Command R+ ($2.50/$10), Command R ($0.15/$0.60), Command R7B ($0.0375/$0.15 — Cohere's cheapest chat model), Embed v3 English and Multilingual ($0.10 input-only), and Rerank v3 ($2.00 per 1M tokens of query + documents). Newer Command A variants (Reasoning, Translate, Vision) are evaluation-only at the moment. Audio Transcriptions joined the rate-limit table in early 2026 at 5 req/min on trial keys. Prices on Command R+/R/R7B have held steady since the 08-2024 release — Cohere's competitive lever has been adding capabilities (Command A reasoning, image inputs, audio) rather than cutting per-token prices.

Cohere API Pricing 2026: Command R+, Command R, R7B, Embed v3, Rerank v3

Key facts about Cohere pricing

How much does each Cohere model cost per million tokens?

Why choose Cohere over OpenAI or Anthropic?

When Cohere is not the right choice

Price History

Command R 08-2024

Command R+ 08-2024

Command R7B

Embed v3 English

Embed v3 Multilingual

Rerank v3

Command A

Frequently asked questions

How much does Cohere Command R+ cost?

Does Cohere have a free tier?

How does Cohere compare to OpenAI?

What are Cohere's embedding model prices?

What's Rerank v3 priced at?

What's Command R+'s context window?

What are Cohere's API rate limits for the free trial and production tiers?

When was Command R+ last updated and is it still the flagship?

What's changed in Cohere's pricing and lineup in 2026?

Methodology

Compare Cohere to other providers

Model↕	Provider↕	Tier↕	Input $/1M↑	Cached $/1M↕	Output $/1M↕
Command R7B Command R	cohere	Low	$0.0375	—	$0.15
Embed v3 English Embed	cohere	Low	$0.10	—	$0.00
Embed v3 Multilingual Embed	cohere	Low	$0.10	—	$0.00
Command R 08-2024 Command R	cohere	Low	$0.15	—	$0.60
Rerank v3 Rerank	cohere	Low	$2.00	—	$0.00
Command R+ 08-2024 Command R	cohere	Mid	$2.50	—	$10.00