X.ai

Grok 3 Mini Fast

Grok 3 Mini Fast Beta is a compact text generation model developed by xAI, the AI division of X. It belongs to the Grok 3 model family and is designed to deliver faster response times compared to the full Grok 3 models, making it suitable for latency-sensitive applications. The model supports extended thinking, function calling, and real-time web search, and operates with a 131,072-token context window. Grok 3 Mini Fast Beta is well-suited for developers and businesses building high-throughput applications that require reasoning capability without the overhead of a larger model. Practical use cases include question answering, document summarization, data extraction, and tool-augmented agentic workflows. Its combination of speed, extended context, and tool integration makes it a practical option for production environments where response time is a priority.

April 2025 131,072 context 8,192 tokens output
Extended Thinking Function Calling Web Search Large Context Window Fast Inference Text Generation

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

X.ai

Input Context Window

The number of tokens supported by the input context window.

131,072 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

8,192 tokens tokens

Open Source

Whether the model's code is available for public use.

No

Release Date

When the model was first released.

April 2025

Knowledge Cut-off Date

When the model's knowledge was last updated.

April 2025

API Providers

The providers that offer this model. This is not an exhaustive list.

xAI API, OpenAI API

Modalities

Types of data this model can process.

Text

What is Grok 3 Mini Fast

A fuller summary of positioning, capabilities, and source-specific details for Grok 3 Mini Fast.

Grok 3 Mini Fast Beta is a compact text generation model developed by xAI, the AI division of X. It belongs to the Grok 3 model family and is designed to deliver faster response times compared to the full Grok 3 models, making it suitable for latency-sensitive applications. The model supports extended thinking, function calling, and real-time web search, and operates with a 131,072-token context window.

Grok 3 Mini Fast Beta is well-suited for developers and businesses building high-throughput applications that require reasoning capability without the overhead of a larger model. Practical use cases include question answering, document summarization, data extraction, and tool-augmented agentic workflows. Its combination of speed, extended context, and tool integration makes it a practical option for production environments where response time is a priority.

Capabilities

What Grok 3 Mini Fast supports

AI

Extended Thinking

Supports step-by-step reasoning before producing a final response, allowing the model to work through multi-step or complex problems more carefully.

AI

Function Calling

Enables the model to invoke external functions and tools, supporting integration into agentic and automated workflows.

AI

Web Search

Allows the model to retrieve real-time information from the web, enabling responses that reflect current events and up-to-date data.

CTX

Large Context Window

Handles up to 131,072 tokens in a single session, accommodating long documents, extended conversations, and complex multi-part inputs.

AI

Fast Inference

Optimized for low-latency responses within the Grok 3 family, making it suitable for high-throughput production applications.

AI

Text Generation

Generates coherent, contextually relevant text for tasks including summarization, question answering, and data extraction.

Pricing for Grok 3 Mini Fast

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

maxTemperature 1
maxResponseSize 8,192 tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

xAI API OpenAI API

Model Performance

Benchmark scores synced from the current model source and normalized into the local catalog.

Benchmark Score
AIME 2024
American math olympiad problems
93.3%
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
79.1%
HLE
Questions that challenge frontier models across many domains
11.1%
LiveCodeBench
Real-world coding tasks from recent competitions
69.6%
MATH-500
Undergraduate and competition-level math problems
99.2%
MMLU-Pro
Expert knowledge across 14 academic disciplines
82.8%
SciCode
Scientific research coding and numerical methods
40.6%

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

FAQ

Common questions about Grok 3 Mini Fast

What is the context window size for Grok 3 Mini Fast Beta?

Grok 3 Mini Fast Beta supports a context window of 131,072 tokens, allowing it to process long documents and extended multi-turn conversations in a single session.

Where can I find pricing information for this model?

Pricing details are available on the xAI Models and Pricing page at https://docs.x.ai/developers/models.

What is the training data cutoff for Grok 3 Mini Fast Beta?

Based on available metadata, the model's training data has a cutoff of April 2025.

Does Grok 3 Mini Fast Beta support function calling and tool use?

Yes, the model supports function calling, enabling developers to integrate it into agentic workflows where it can invoke external tools and APIs.

How does Grok 3 Mini Fast Beta differ from other Grok 3 models?

Grok 3 Mini Fast Beta is a smaller, faster variant within the Grok 3 family, prioritizing lower latency and efficiency while retaining reasoning capabilities such as extended thinking and web search.

More models from X.ai

Continue browsing adjacent models from the same provider.

← All AI Models