X.ai

Grok 3 Mini Fast

Grok 3 Mini Fast Beta is a compact text generation model developed by xAI, the AI division of X. It belongs to the Grok 3 model family and is designed to deliver faster response times compared to the full Grok 3 models, making it suitable for latency-sensitive applications. The model supports extended thinking, function calling, and real-time web search, and operates with a 131,072-token context window. Grok 3 Mini Fast Beta is well-suited for developers and businesses building high-throughput applications that require reasoning capability without the overhead of a larger model. Practical use cases include question answering, document summarization, data extraction, and tool-augmented agentic workflows. Its combination of speed, extended context, and tool integration makes it a practical option for production environments where response time is a priority.

April 2025 131,072 context 8,192 tokens output

Extended Thinking Function Calling Web Search Large Context Window Fast Inference Text Generation

Overview ↓ About ↓ Capabilities ↓ Pricing ↓ Price Comparison ↓ Benchmarks ↓ Tools ↓ Resources ↓ FAQ ↓

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

X.ai

Input Context Window

The number of tokens supported by the input context window.

131,072 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

8,192 tokens tokens

Open Source

Whether the model's code is available for public use.

Release Date

When the model was first released.

April 2025

Knowledge Cut-off Date

When the model's knowledge was last updated.

April 2025

API Providers

The providers that offer this model. This is not an exhaustive list.

xAI API, OpenAI API

Modalities

Types of data this model can process.

Text

What is Grok 3 Mini Fast

A fuller summary of positioning, capabilities, and source-specific details for Grok 3 Mini Fast.

Grok 3 Mini Fast Beta is a compact text generation model developed by xAI, the AI division of X. It belongs to the Grok 3 model family and is designed to deliver faster response times compared to the full Grok 3 models, making it suitable for latency-sensitive applications. The model supports extended thinking, function calling, and real-time web search, and operates with a 131,072-token context window.

Grok 3 Mini Fast Beta is well-suited for developers and businesses building high-throughput applications that require reasoning capability without the overhead of a larger model. Practical use cases include question answering, document summarization, data extraction, and tool-augmented agentic workflows. Its combination of speed, extended context, and tool integration makes it a practical option for production environments where response time is a priority.

Capabilities

What Grok 3 Mini Fast supports

Extended Thinking

Supports step-by-step reasoning before producing a final response, allowing the model to work through multi-step or complex problems more carefully.

Function Calling

Enables the model to invoke external functions and tools, supporting integration into agentic and automated workflows.

Web Search

Allows the model to retrieve real-time information from the web, enabling responses that reflect current events and up-to-date data.

CTX

Large Context Window

Handles up to 131,072 tokens in a single session, accommodating long documents, extended conversations, and complex multi-part inputs.

Fast Inference

Optimized for low-latency responses within the Grok 3 family, making it suitable for high-throughput production applications.

Text Generation

Generates coherent, contextually relevant text for tasks including summarization, question answering, and data extraction.

Pricing for Grok 3 Mini Fast

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Input tokens $0.60 Per million tokens

Output tokens N/A Per million tokens

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

maxTemperature 1

maxResponseSize 8,192 tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

xAI API OpenAI API

Model Performance

Benchmark scores synced from the current model source and normalized into the local catalog.

Benchmark	Score
AIME 2024 American math olympiad problems	93.3%
GPQA Diamond PhD-level science questions (biology, physics, chemistry)	79.1%
HLE Questions that challenge frontier models across many domains	11.1%
LiveCodeBench Real-world coding tasks from recent competitions	69.6%
MATH-500 Undergraduate and competition-level math problems	99.2%
MMLU-Pro Expert knowledge across 14 academic disciplines	82.8%
SciCode Scientific research coding and numerical methods	40.6%

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Official Documentation Documentation

→

Models and Pricing Documentation

→

Model Card (Inworld AI) Other

→

xAI API Playground Playground

→

xAI Developer Docs Documentation

→

FAQ

Common questions about Grok 3 Mini Fast

What is the context window size for Grok 3 Mini Fast Beta?

Grok 3 Mini Fast Beta supports a context window of 131,072 tokens, allowing it to process long documents and extended multi-turn conversations in a single session.

Where can I find pricing information for this model?

Pricing details are available on the xAI Models and Pricing page at https://docs.x.ai/developers/models.

What is the training data cutoff for Grok 3 Mini Fast Beta?

Based on available metadata, the model's training data has a cutoff of April 2025.

Does Grok 3 Mini Fast Beta support function calling and tool use?

Yes, the model supports function calling, enabling developers to integrate it into agentic workflows where it can invoke external tools and APIs.

How does Grok 3 Mini Fast Beta differ from other Grok 3 models?

Grok 3 Mini Fast Beta is a smaller, faster variant within the Grok 3 family, prioritizing lower latency and efficiency while retaining reasoning capabilities such as extended thinking and web search.

More models from X.ai

Continue browsing adjacent models from the same provider.

← All AI Models