Extended Context Window
Processes up to 1,048,576 tokens in a single context, enabling analysis of large documents, codebases, or long conversation histories without truncation.
Gemini 2.5 Pro is a thinking model developed by Google DeepMind, designed to reason through complex problems rather than simply predict outputs. It is built to analyze information, draw logical conclusions, and incorporate contextual nuance across tasks in code, mathematics, and STEM. The model supports native multimodality, meaning it can process text, images, audio, video, and code repositories within a single context. The model features a 1,048,576-token context window, making it suited for tasks that require processing large documents, entire codebases, or extended conversations. It scored 63.8% on the SWE-Bench Verified coding evaluation and is available through the Gemini API and Google AI Studio. It is best suited for developers and researchers working on complex reasoning tasks, long-document analysis, and advanced code generation.
High-signal model metadata in a structured two-column overview table.
The entity that provides this model.
The routed model identifier exposed by upstream providers.
The number of tokens supported by the input context window.
The number of tokens that can be generated by the model in a single request.
Whether the model's code is available for public use.
When the model was first released.
When the model's knowledge was last updated.
The providers that offer this model. This is not an exhaustive list.
Types of data this model can process.
A fuller summary of positioning, capabilities, and source-specific details for Gemini 2.5 Pro.
Gemini 2.5 Pro is a thinking model developed by Google DeepMind, designed to reason through complex problems rather than simply predict outputs. It is built to analyze information, draw logical conclusions, and incorporate contextual nuance across tasks in code, mathematics, and STEM. The model supports native multimodality, meaning it can process text, images, audio, video, and code repositories within a single context.
The model features a 1,048,576-token context window, making it suited for tasks that require processing large documents, entire codebases, or extended conversations. It scored 63.8% on the SWE-Bench Verified coding evaluation and is available through the Gemini API and Google AI Studio. It is best suited for developers and researchers working on complex reasoning tasks, long-document analysis, and advanced code generation.
Processes up to 1,048,576 tokens in a single context, enabling analysis of large documents, codebases, or long conversation histories without truncation.
Uses a thinking approach to work through multi-step problems, drawing logical conclusions before producing a final response rather than generating output directly.
Accepts text, images, audio, video, and code as input within the same request, allowing mixed-media tasks to be handled in a single call.
Supports function calling and external tool integration, allowing the model to invoke defined tools and return structured results as part of a response.
Generates and debugs code across languages, achieving a 63.8% score on the SWE-Bench Verified benchmark for real-world software engineering tasks.
Handles complex mathematical reasoning and science problems, with benchmark performance cited across math and science evaluation suites.
Primary API pricing shown in the same “quick compare” spirit as the reference page.
Additional usage-cost dimensions synced into the project for this model.
Places where this model is available, based on the synced detail-page metadata.
Endpoint-level provider data currently available for this model.
The configurable options currently documented for this model.
Must be less than Max Response Size
Parameters currently listed by OpenRouter or the local catalog for this model.
Benchmark scores synced from the current model source and normalized into the local catalog.
| Benchmark | Score |
|---|---|
|
AIME 2024
American math olympiad problems
|
|
|
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
|
|
|
HLE
Questions that challenge frontier models across many domains
|
|
|
LiveCodeBench
Real-world coding tasks from recent competitions
|
|
|
MATH-500
Undergraduate and competition-level math problems
|
|
|
MMLU-Pro
Expert knowledge across 14 academic disciplines
|
|
|
SciCode
Scientific research coding and numerical methods
|
Official model cards, release notes, docs, and other references synced from the source page.
Gemini 2.5 Pro discussions are most active in r/singularity, r/Bard, r/LocalLLaMA. Top Reddit threads cluster around benchmark and model-comparison threads, safety and censorship questions, coding workflow discussions.
The strongest match in this snapshot has 1704 upvotes and 48 comments.
I used gemini 2.5 pro extensively back in the day (when it was 250/day free and then 50/day too), and it was by far my favorite model given its rare negativity bias. It's a real asshole towards the user and that is something that GLM doesn't show at all. I love GLM, but I want the model to be mean to me too...
So, how does 3 pro and 3 flash compare to 2.5 pro?
I recently got access to both 2.5 and 3 models, and am wondering if I should stick with flash or go with pro. Pro obviously costs more/has less messages a day, and I am wondering if there is a distinct difference or if it doesn't really matter with RP.
I don't ever do RPG RPs which seem to be favorable, but rather group and single RPs focusing on drama, tension, strife, romance, and so on. Oh, and a quick praise for GLM 4.7 (my second favorite model), it does really reallt good multi-character bots!
About 4 hours ago i could still use Gemini 2.5 Pro in AI Studio, but now it has disappeared. I thought it would get retired a couple of months from now, not today?
how I can get gemini 2.5 pro for roleplay . any cheap way or some tricks. plz plz help
Why am I not paying like 200 bucks per month for it? It is the best model ever and destroys any of open ai's models. It feels illegal. Doesn't make sense. Free in ai studio + Best model ever. I love GOOGLE (especially Logan).
Gemini 2.5 Pro has a context window of 1,048,576 tokens, which allows it to process large documents, long codebases, or extended conversations in a single request.
Based on the model metadata, the training data cutoff is June 2025.
Gemini 2.5 Pro supports multimodal inputs including text, images, audio, video, and code. On MindStudio, it also accepts select, number, and tools input types.
The model is designed for complex reasoning tasks including advanced code generation, mathematical problem solving, STEM analysis, and processing large documents or codebases using its long context window.
Yes, Gemini 2.5 Pro supports tool use and function calling, allowing it to integrate with external tools and return structured outputs as part of its responses.
Continue browsing adjacent models from the same provider.