Chain-of-Thought Reasoning
Generates step-by-step reasoning traces for complex, multi-step problems. Reasoning mode is enabled by default in this variant of the model.
Grok 4.1 Fast Reasoning is a text generation model developed by xAI, the AI division of X. It is designed specifically for agentic and tool-calling workflows, trained through reinforcement learning in simulated environments across dozens of tool-use domains. The model supports a 2-million-token context window, accepts both text and image inputs, and produces text outputs with chain-of-thought reasoning enabled. The model is best suited for developers building autonomous agents, enterprise automation pipelines, and multi-step research or customer support applications. It supports structured outputs, function calling, and a range of tool integrations including web search, X search, code execution, file retrieval, and MCP tool integrations via the Agent Tools API. Its training cutoff is November 2025, and it is available through the xAI API as well as third-party cloud providers such as Oracle Cloud.
High-signal model metadata in a structured two-column overview table.
The entity that provides this model.
The number of tokens supported by the input context window.
The number of tokens that can be generated by the model in a single request.
Whether the model's code is available for public use.
When the model was first released.
When the model's knowledge was last updated.
The providers that offer this model. This is not an exhaustive list.
Types of data this model can process.
A fuller summary of positioning, capabilities, and source-specific details for Grok 4.1 Fast Reasoning.
Grok 4.1 Fast Reasoning is a text generation model developed by xAI, the AI division of X. It is designed specifically for agentic and tool-calling workflows, trained through reinforcement learning in simulated environments across dozens of tool-use domains. The model supports a 2-million-token context window, accepts both text and image inputs, and produces text outputs with chain-of-thought reasoning enabled.
The model is best suited for developers building autonomous agents, enterprise automation pipelines, and multi-step research or customer support applications. It supports structured outputs, function calling, and a range of tool integrations including web search, X search, code execution, file retrieval, and MCP tool integrations via the Agent Tools API. Its training cutoff is November 2025, and it is available through the xAI API as well as third-party cloud providers such as Oracle Cloud.
Generates step-by-step reasoning traces for complex, multi-step problems. Reasoning mode is enabled by default in this variant of the model.
Supports a context window of up to 2 million tokens, enabling processing of very long documents or extended multi-turn agent sessions.
Supports web search, X search, code execution, file retrieval, and MCP tool integrations through the Agent Tools API for autonomous task completion.
Returns structured JSON outputs and supports function calling via API, making it suitable for integration into typed application workflows.
Accepts both text and image inputs, producing text outputs, which allows it to handle tasks that involve visual content alongside written instructions.
Optimized for low-latency responses relative to full reasoning models, making it practical for real-time or high-throughput agentic applications.
Primary API pricing shown in the same “quick compare” spirit as the reference page.
Additional usage-cost dimensions synced into the project for this model.
Places where this model is available, based on the synced detail-page metadata.
Benchmark scores synced from the current model source and normalized into the local catalog.
| Benchmark | Score |
|---|---|
|
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
|
|
|
HLE
Questions that challenge frontier models across many domains
|
|
|
LiveCodeBench
Real-world coding tasks from recent competitions
|
|
|
MMLU-Pro
Expert knowledge across 14 academic disciplines
|
|
|
SciCode
Scientific research coding and numerical methods
|
Official model cards, release notes, docs, and other references synced from the source page.
Grok 4.1 Fast Reasoning supports a context window of up to 2 million tokens, which allows it to process very long documents or maintain extended multi-turn conversations within a single request.
The model's training data cutoff is November 2025, meaning it does not have knowledge of events or information published after that date unless provided via tool use or in-context retrieval.
Grok 4.1 Fast Reasoning generates chain-of-thought reasoning traces as part of its response process, working through complex or multi-step problems before producing a final answer. This is the defining difference between this variant and the non-reasoning version of Grok 4.1 Fast.
The model supports web search, X (Twitter) search, code execution, file retrieval, and MCP tool integrations through xAI's Agent Tools API. It also supports structured outputs and function calling via the standard API.
The model is available through the xAI API and is also accessible via third-party cloud providers, including Oracle Cloud Infrastructure's Generative AI service.
Continue browsing adjacent models from the same provider.