Google

Gemini 2.5 Pro

Gemini 2.5 Pro is a thinking model developed by Google DeepMind, designed to reason through complex problems rather than simply predict outputs. It is built to analyze information, draw logical conclusions, and incorporate contextual nuance across tasks in code, mathematics, and STEM. The model supports native multimodality, meaning it can process text, images, audio, video, and code repositories within a single context. The model features a 1,048,576-token context window, making it suited for tasks that require processing large documents, entire codebases, or extended conversations. It scored 63.8% on the SWE-Bench Verified coding evaluation and is available through the Gemini API and Google AI Studio. It is best suited for developers and researchers working on complex reasoning tasks, long-document analysis, and advanced code generation.

Jun 17, 2025 1,048,576 context 65,536 tokens output
Extended Context Window Structured Reasoning Multimodal Input Tool Use Code Generation Math and STEM Analysis

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Google

Model ID

The routed model identifier exposed by upstream providers.

google/gemini-2.5-pro

Input Context Window

The number of tokens supported by the input context window.

1,048,576 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

65,536 tokens tokens

Open Source

Whether the model's code is available for public use.

No

Release Date

When the model was first released.

Jun 17, 2025 11 months ago

Knowledge Cut-off Date

When the model's knowledge was last updated.

2025-01-31

API Providers

The providers that offer this model. This is not an exhaustive list.

Google, Gemini API, Google AI Studio

Modalities

Types of data this model can process.

Text Image Audio Video Code File

What is Gemini 2.5 Pro

A fuller summary of positioning, capabilities, and source-specific details for Gemini 2.5 Pro.

Gemini 2.5 Pro is a thinking model developed by Google DeepMind, designed to reason through complex problems rather than simply predict outputs. It is built to analyze information, draw logical conclusions, and incorporate contextual nuance across tasks in code, mathematics, and STEM. The model supports native multimodality, meaning it can process text, images, audio, video, and code repositories within a single context.

The model features a 1,048,576-token context window, making it suited for tasks that require processing large documents, entire codebases, or extended conversations. It scored 63.8% on the SWE-Bench Verified coding evaluation and is available through the Gemini API and Google AI Studio. It is best suited for developers and researchers working on complex reasoning tasks, long-document analysis, and advanced code generation.

Capabilities

What Gemini 2.5 Pro supports

CTX

Extended Context Window

Processes up to 1,048,576 tokens in a single context, enabling analysis of large documents, codebases, or long conversation histories without truncation.

RN

Structured Reasoning

Uses a thinking approach to work through multi-step problems, drawing logical conclusions before producing a final response rather than generating output directly.

MM

Multimodal Input

Accepts text, images, audio, video, and code as input within the same request, allowing mixed-media tasks to be handled in a single call.

TL

Tool Use

Supports function calling and external tool integration, allowing the model to invoke defined tools and return structured results as part of a response.

</>

Code Generation

Generates and debugs code across languages, achieving a 63.8% score on the SWE-Bench Verified benchmark for real-world software engineering tasks.

AI

Math and STEM Analysis

Handles complex mathematical reasoning and science problems, with benchmark performance cited across math and science evaluation suites.

Pricing for Gemini 2.5 Pro

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

Image input $1.25
Audio input $1.25
Web search $14000.00
Reasoning $10.00
Cache read $0.13
Cache write $0.38
maxTemperature 2
maxResponseSize 65,536 tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Google Gemini API Google AI Studio

Provider Endpoints

Endpoint-level provider data currently available for this model.

Google

Max output: 65,536 1d uptime: 97.6% Supported params: 11 Implicit caching: No

Google

Max output: 65,536 1d uptime: 91.0% Supported params: 11 Implicit caching: No

Google AI Studio

Max output: 65,536 1d uptime: 79.6% Supported params: 11 Implicit caching: No

Google

Max output: 65,536 1d uptime: 99.0% Supported params: 11 Implicit caching: No

Configuration & Parameters

The configurable options currently documented for this model.

Thinking Budget

Select
Default: auto
Off Manual Auto

Thinking Budget Limit

Number

Must be less than Max Response Size

Range: 1 - 24576

Priority Mode

Select
Default: false
Standard Priority (1.8x cost, higher reliability)

Supported Request Parameters

Parameters currently listed by OpenRouter or the local catalog for this model.

Thinking Budget Thinking Budget Limit Priority Mode

Model Performance

Benchmark scores synced from the current model source and normalized into the local catalog.

Benchmark Score
AIME 2024
American math olympiad problems
88.7%
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
84.4%
HLE
Questions that challenge frontier models across many domains
21.1%
LiveCodeBench
Real-world coding tasks from recent competitions
80.1%
MATH-500
Undergraduate and competition-level math problems
96.7%
MMLU-Pro
Expert knowledge across 14 academic disciplines
86.2%
SciCode
Scientific research coding and numerical methods
42.8%

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Community discussion

What people think about Gemini 2.5 Pro

Gemini 2.5 Pro discussions are most active in r/singularity, r/Bard, r/LocalLLaMA. Top Reddit threads cluster around benchmark and model-comparison threads, safety and censorship questions, coding workflow discussions.

The strongest match in this snapshot has 1704 upvotes and 48 comments.

r/SillyTavernAI 3 upvotes 16 comments February 17, 2026
Gemini 2.5 pro vs 3.0 pro vs flash

I used gemini 2.5 pro extensively back in the day (when it was 250/day free and then 50/day too), and it was by far my favorite model given its rare negativity bias. It's a real asshole towards the user and that is something that GLM doesn't show at all. I love GLM, but I want the model to be mean to me too...

So, how does 3 pro and 3 flash compare to 2.5 pro?

I recently got access to both 2.5 and 3 models, and am wondering if I should stick with flash or go with pro. Pro obviously costs more/has less messages a day, and I am wondering if there is a distinct difference or if it doesn't really matter with RP.

I don't ever do RPG RPs which seem to be favorable, but rather group and single RPs focusing on drama, tension, strife, romance, and so on. Oh, and a quick praise for GLM 4.7 (my second favorite model), it does really reallt good multi-character bots!

Open Reddit thread
View more discussions →
FAQ

Common questions about Gemini 2.5 Pro

What is the context window size for Gemini 2.5 Pro?

Gemini 2.5 Pro has a context window of 1,048,576 tokens, which allows it to process large documents, long codebases, or extended conversations in a single request.

What is the knowledge cutoff date for Gemini 2.5 Pro?

Based on the model metadata, the training data cutoff is June 2025.

What input types does Gemini 2.5 Pro support?

Gemini 2.5 Pro supports multimodal inputs including text, images, audio, video, and code. On MindStudio, it also accepts select, number, and tools input types.

What tasks is Gemini 2.5 Pro best suited for?

The model is designed for complex reasoning tasks including advanced code generation, mathematical problem solving, STEM analysis, and processing large documents or codebases using its long context window.

Does Gemini 2.5 Pro support function calling and tool use?

Yes, Gemini 2.5 Pro supports tool use and function calling, allowing it to integrate with external tools and return structured outputs as part of its responses.

More models from Google

Continue browsing adjacent models from the same provider.

← All AI Models