Retrieval-Augmented Generation
Purpose-built to integrate with external knowledge sources, allowing responses to be grounded in real retrieved data rather than model memory alone.
Command R+ is a large language model developed by Cohere, positioned as the company's flagship text generation model for enterprise use. It is available through Amazon Bedrock, allowing organizations to deploy it within AWS's managed cloud infrastructure. The model supports a 128,000-token context window and was trained on data up to January 2023. It is designed specifically for demanding enterprise workloads that require high accuracy and reliability. What distinguishes Command R+ is its purpose-built support for retrieval-augmented generation, enabling it to ground responses in external knowledge sources rather than relying solely on parametric memory. It also supports multi-step tool use and agentic workflows, allowing it to interact with APIs, databases, and other external systems. The model handles multiple languages, making it applicable for global deployments. It is best suited for production applications such as intelligent search, document summarization, customer support automation, and complex data analysis pipelines.
High-signal model metadata in a structured two-column overview table.
The entity that provides this model.
The routed model identifier exposed by upstream providers.
The number of tokens supported by the input context window.
The number of tokens that can be generated by the model in a single request.
Whether the model's code is available for public use.
When the model was first released.
When the model's knowledge was last updated.
The providers that offer this model. This is not an exhaustive list.
Types of data this model can process.
A fuller summary of positioning, capabilities, and source-specific details for Command R+.
Command R+ is a large language model developed by Cohere, positioned as the company's flagship text generation model for enterprise use. It is available through Amazon Bedrock, allowing organizations to deploy it within AWS's managed cloud infrastructure. The model supports a 128,000-token context window and was trained on data up to January 2023. It is designed specifically for demanding enterprise workloads that require high accuracy and reliability.
What distinguishes Command R+ is its purpose-built support for retrieval-augmented generation, enabling it to ground responses in external knowledge sources rather than relying solely on parametric memory. It also supports multi-step tool use and agentic workflows, allowing it to interact with APIs, databases, and other external systems. The model handles multiple languages, making it applicable for global deployments. It is best suited for production applications such as intelligent search, document summarization, customer support automation, and complex data analysis pipelines.
Purpose-built to integrate with external knowledge sources, allowing responses to be grounded in real retrieved data rather than model memory alone.
Supports agentic workflows where the model can call APIs, query databases, and chain multiple tool interactions autonomously across a single task.
Handles complex multi-step reasoning tasks including question answering, analysis, and decision support across long-form inputs.
Processes up to 128,000 tokens in a single context, enabling analysis of lengthy documents, transcripts, or multi-document inputs.
Understands and generates text across multiple languages, supporting enterprise deployments that serve international users.
Generates structured and unstructured text for use cases such as summarization, drafting, classification, and data extraction.
Primary API pricing shown in the same “quick compare” spirit as the reference page.
Additional usage-cost dimensions synced into the project for this model.
Places where this model is available, based on the synced detail-page metadata.
Endpoint-level provider data currently available for this model.
Benchmark scores synced from the current model source and normalized into the local catalog.
| Benchmark | Score |
|---|---|
|
AIME 2024
American math olympiad problems
|
|
|
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
|
|
|
HLE
Questions that challenge frontier models across many domains
|
|
|
LiveCodeBench
Real-world coding tasks from recent competitions
|
|
|
MATH-500
Undergraduate and competition-level math problems
|
|
|
MMLU-Pro
Expert knowledge across 14 academic disciplines
|
|
|
SciCode
Scientific research coding and numerical methods
|
Official model cards, release notes, docs, and other references synced from the source page.
Command R+ discussions are most active in r/LocalLLaMA, r/SillyTavernAI. The strongest match in this snapshot has 482 upvotes and 213 comments.
What's new in 1.5:
* Up to 50% higher throughput and 25% lower latency
* Cut hardware requirements in half for Command R 1.5
* Enhanced multilingual capabilities with improved retrieval-augmented generation
* Better tool selection and usage
* Increased strengths in data analysis and creation
* More robustness to non-semantic prompt changes
* Declines to answer unsolvable questions
* Introducing configurable Safety Modes for nuanced content filtering
* Command R+ 1.5 priced at $2.50/M input tokens, $10/M output tokens
* Command R 1.5 priced at $0.15/M input tokens, $0.60/M output tokens
Blog link: [https://docs.cohere.com/changelog/command-gets-refreshed](https://docs.cohere.com/changelog/command-gets-refreshed)
Huggingface links:
Command R: [https://huggingface.co/CohereForAI/c4ai-command-r-08-2024](https://huggingface.co/CohereForAI/c4ai-command-r-08-2024)
Command R+: [https://huggingface.co/CohereForAI/c4ai-command-r-plus-08-2024](https://huggingface.co/CohereForAI/c4ai-command-r-plus-08-2024)
The team at Hugging Face recently refreshed the list of models available on HuggingChat. You can now try out the following models for free and use them to create assistants:
* [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/chat/models/Qwen/Qwen2.5-72B-Instruct)
* [ meta-llama/Llama-3.2-11B-Vision-Instruct](https://huggingface.co/chat/models/meta-llama/Llama-3.2-11B-Vision-Instruct) (with vision enabled!)
* [mistralai/Mistral-Nemo-Instruct-2407](https://huggingface.co/chat/models/mistralai/Mistral-Nemo-Instruct-2407)
* [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/chat/models/NousResearch/Hermes-3-Llama-3.1-8B)
* [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/chat/models/microsoft/Phi-3.5-mini-instruct)
We also have the following models that have [tool calling](https://huggingface.co/chat/tools) enabled:
* [meta-llama/Meta-Llama-3.1-70B-Instruct](https://huggingface.co/chat/models/meta-llama/Meta-Llama-3.1-70B-Instruct)
* [CohereForAI/c4ai-command-r-plus-08-2024](https://huggingface.co/chat/models/CohereForAI/c4ai-command-r-plus-08-2024)
Are there any other models you would like to see on HuggingChat? Feel free to let us know, we're always trying to showcase the models the community is most interested in!
Command R+ supports a context window of 128,000 tokens, allowing it to process large documents or extended conversation histories in a single request.
According to the model metadata, the training data cutoff is January 2023. The model does not have knowledge of events after that date unless provided via retrieval or context.
Command R+ is available on MindStudio via Amazon Bedrock. No separate API key setup is required to use it within MindStudio.
Command R+ is designed for enterprise workloads including retrieval-augmented generation, multi-step tool use, intelligent search, document summarization, customer support automation, and complex data analysis pipelines.
Yes. Command R+ supports multi-step tool calling, enabling it to interact with external APIs, databases, and other systems as part of agentic workflows.
Continue browsing adjacent models from the same provider.