Retrieval-Augmented Generation
Grounds model responses in external knowledge sources by retrieving and citing relevant documents, reducing hallucinations in enterprise workflows.
Command R is an instruction-following conversational model developed by Cohere, designed for enterprise language tasks with a focus on reliability and scalability. It is available through Amazon Bedrock and carries a knowledge cutoff of March 2024. The model is purpose-built for retrieval-augmented generation (RAG) and tool use, making it well-suited for workflows that require grounding responses in external data sources or integrating with external APIs and functions. One of Command R's defining characteristics is its 128,000-token context window, which allows it to process long documents, extended multi-turn conversations, and complex inputs in a single pass. It also supports multilingual tasks and is tagged for low-latency performance, making it a practical choice for organizations building scalable AI applications where response speed and contextual accuracy matter. It is best suited for enterprise use cases such as document analysis, agentic pipelines, and knowledge-grounded question answering.
High-signal model metadata in a structured two-column overview table.
The entity that provides this model.
The routed model identifier exposed by upstream providers.
The number of tokens supported by the input context window.
The number of tokens that can be generated by the model in a single request.
Whether the model's code is available for public use.
When the model was first released.
When the model's knowledge was last updated.
The providers that offer this model. This is not an exhaustive list.
Types of data this model can process.
A fuller summary of positioning, capabilities, and source-specific details for Command R.
Command R is an instruction-following conversational model developed by Cohere, designed for enterprise language tasks with a focus on reliability and scalability. It is available through Amazon Bedrock and carries a knowledge cutoff of March 2024. The model is purpose-built for retrieval-augmented generation (RAG) and tool use, making it well-suited for workflows that require grounding responses in external data sources or integrating with external APIs and functions.
One of Command R's defining characteristics is its 128,000-token context window, which allows it to process long documents, extended multi-turn conversations, and complex inputs in a single pass. It also supports multilingual tasks and is tagged for low-latency performance, making it a practical choice for organizations building scalable AI applications where response speed and contextual accuracy matter. It is best suited for enterprise use cases such as document analysis, agentic pipelines, and knowledge-grounded question answering.
Grounds model responses in external knowledge sources by retrieving and citing relevant documents, reducing hallucinations in enterprise workflows.
Enables agentic workflows by allowing the model to call external tools and APIs, supporting multi-step task execution.
Processes up to 128,000 tokens in a single pass, enabling analysis of long documents and extended multi-turn conversations.
Handles tasks across multiple languages, making it suitable for globally distributed enterprise applications.
Optimized for speed in production environments, supporting real-time or near-real-time application requirements.
Follows detailed natural language instructions reliably, supporting structured enterprise use cases such as summarization, classification, and Q&A.
Primary API pricing shown in the same “quick compare” spirit as the reference page.
Additional usage-cost dimensions synced into the project for this model.
Places where this model is available, based on the synced detail-page metadata.
Endpoint-level provider data currently available for this model.
Benchmark scores synced from the current model source and normalized into the local catalog.
| Benchmark | Score |
|---|---|
|
AIME 2024
American math olympiad problems
|
|
|
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
|
|
|
HLE
Questions that challenge frontier models across many domains
|
|
|
LiveCodeBench
Real-world coding tasks from recent competitions
|
|
|
MATH-500
Undergraduate and competition-level math problems
|
|
|
MMLU-Pro
Expert knowledge across 14 academic disciplines
|
|
|
SciCode
Scientific research coding and numerical methods
|
Official model cards, release notes, docs, and other references synced from the source page.
Command R discussions are most active in r/LocalLLaMA. The strongest match in this snapshot has 479 upvotes and 213 comments.
What's new in 1.5:
* Up to 50% higher throughput and 25% lower latency
* Cut hardware requirements in half for Command R 1.5
* Enhanced multilingual capabilities with improved retrieval-augmented generation
* Better tool selection and usage
* Increased strengths in data analysis and creation
* More robustness to non-semantic prompt changes
* Declines to answer unsolvable questions
* Introducing configurable Safety Modes for nuanced content filtering
* Command R+ 1.5 priced at $2.50/M input tokens, $10/M output tokens
* Command R 1.5 priced at $0.15/M input tokens, $0.60/M output tokens
Blog link: [https://docs.cohere.com/changelog/command-gets-refreshed](https://docs.cohere.com/changelog/command-gets-refreshed)
Huggingface links:
Command R: [https://huggingface.co/CohereForAI/c4ai-command-r-08-2024](https://huggingface.co/CohereForAI/c4ai-command-r-08-2024)
Command R+: [https://huggingface.co/CohereForAI/c4ai-command-r-plus-08-2024](https://huggingface.co/CohereForAI/c4ai-command-r-plus-08-2024)
Command R supports a context window of 128,000 tokens, allowing it to process long documents and extended conversations in a single pass.
Command R has a training knowledge cutoff of March 2024, as noted in the model metadata.
Command R is available through Amazon Bedrock, and pricing is determined by AWS. You can find current pricing details on the Amazon Bedrock Pricing page.
Command R is purpose-built for retrieval-augmented generation (RAG) and tool use, making it well-suited for enterprise workflows that require grounding responses in external knowledge sources or building agentic pipelines.
Yes, Command R is tagged as multilingual and is designed to handle tasks across multiple languages, supporting globally distributed enterprise applications.
Continue browsing adjacent models from the same provider.