Mistral

Mistral Medium 3

Mistral Medium 3 is a text generation model released on May 7, 2025 by Mistral, a French AI company. It is designed to balance performance with cost efficiency, priced at $0.40 per million input tokens and $2.00 per million output tokens. The model supports a 128,000-token context window and was trained on data through early 2025. It is available through Mistral La Plateforme and Amazon SageMaker, with additional platform support planned. Mistral Medium 3 is built with enterprise deployment in mind, supporting self-hosted setups with a minimum of four GPUs as well as any cloud environment. It can be customized through continuous pre-training, fine-tuning, and integration with enterprise knowledge bases, making it applicable to domain-specific workflows in sectors such as financial services, energy, and healthcare. The model is noted for its strengths in coding tasks and multimodal understanding, and is suited for use cases including customer service automation, business process personalization, and complex dataset analysis.

May 07, 2025 128,000 context 16,000 tokens output

Long Context Window Code Generation Multimodal Understanding Fine-Tuning Support Enterprise Deployment Cost-Efficient Pricing

Overview ↓ About ↓ Capabilities ↓ Pricing ↓ Price Comparison ↓ Providers ↓ Benchmarks ↓ Compare ↓ Tools ↓ Daily ↓ Resources ↓ Community ↓ FAQ ↓

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Mistral

Model ID

The routed model identifier exposed by upstream providers.

mistralai/mistral-medium-3

Input Context Window

The number of tokens supported by the input context window.

128,000 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

16,000 tokens tokens

Open Source

Whether the model's code is available for public use.

Release Date

When the model was first released.

May 07, 2025 1 year ago

Knowledge Cut-off Date

When the model's knowledge was last updated.

2025

API Providers

The providers that offer this model. This is not an exhaustive list.

Mistral

Modalities

Types of data this model can process.

Text Image File

What is Mistral Medium 3

A fuller summary of positioning, capabilities, and source-specific details for Mistral Medium 3.

Mistral Medium 3 is a text generation model released on May 7, 2025 by Mistral, a French AI company. It is designed to balance performance with cost efficiency, priced at $0.40 per million input tokens and $2.00 per million output tokens. The model supports a 128,000-token context window and was trained on data through early 2025. It is available through Mistral La Plateforme and Amazon SageMaker, with additional platform support planned.

Mistral Medium 3 is built with enterprise deployment in mind, supporting self-hosted setups with a minimum of four GPUs as well as any cloud environment. It can be customized through continuous pre-training, fine-tuning, and integration with enterprise knowledge bases, making it applicable to domain-specific workflows in sectors such as financial services, energy, and healthcare. The model is noted for its strengths in coding tasks and multimodal understanding, and is suited for use cases including customer service automation, business process personalization, and complex dataset analysis.

Capabilities

What Mistral Medium 3 supports

CTX

Long Context Window

Processes up to 128,000 tokens in a single request, enabling analysis of long documents, codebases, or extended conversations without truncation.

</>

Code Generation

Generates, explains, and debugs code across common programming languages, with coding identified as one of the model's primary strengths.

Multimodal Understanding

Handles tasks requiring multimodal comprehension, supporting analysis that goes beyond plain text inputs as noted in the model's official overview.

Fine-Tuning Support

Supports continuous pre-training and comprehensive fine-tuning, allowing organizations to adapt the model to domain-specific datasets and workflows.

Enterprise Deployment

Can be deployed on any cloud environment or self-hosted on a minimum of four GPUs, with integration options for enterprise knowledge bases.

Cost-Efficient Pricing

Priced at $0.40 per million input tokens and $2.00 per million output tokens, positioning it as an accessible option for organizations managing AI inference costs.

Pricing for Mistral Medium 3

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Input tokens $0.40 Per million tokens

Output tokens $2.00 Per million tokens

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

Cache read $0.04

maxTemperature 1

maxResponseSize 16,000 tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Mistral

Provider Endpoints

Endpoint-level provider data currently available for this model.

Mistral

1d uptime: 99.9% Supported params: 11 Implicit caching: No

Model Performance

Benchmark scores synced from the current model source and normalized into the local catalog.

Benchmark	Score
AIME 2024 American math olympiad problems	44.0%
GPQA Diamond PhD-level science questions (biology, physics, chemistry)	57.8%
HLE Questions that challenge frontier models across many domains	4.3%
LiveCodeBench Real-world coding tasks from recent competitions	40.0%
MATH-500 Undergraduate and competition-level math problems	90.7%
MMLU-Pro Expert knowledge across 14 academic disciplines	76.0%
SciCode Scientific research coding and numerical methods	33.1%

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Official Product Page Other

→

Product Announcement Announcements

→

Mistral API Documentation Documentation

→

Mistral La Plateforme Playground

→

OpenRouter Model Page OpenRouter

→