Google

Gemini 3.5 Flash

Gemini 3.5 Flash provides sustained frontier-level intelligence optimized for real-world tasks at a higher speed and lower cost. Designed for the agentic era, it excels at sub-agent deployment, multi-step workflows, and long-horizon tasks at scale. This model is particularly effective for rapid agentic loops involving complex coding cycles and iterations.

May 19, 2026 $9.00 /MTok context 65,536 tokens output

Text Image File Video Tools Structured Output

Overview ↓ About ↓ Capabilities ↓ Pricing ↓ Price Comparison ↓ Providers ↓ Compare ↓ Tools ↓ Daily ↓ Resources ↓

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Google

Model ID

The routed model identifier exposed by upstream providers.

google/gemini-3.5-flash

Input Context Window

The number of tokens supported by the input context window.

$9.00 /MTok tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

65,536 tokens tokens

Open Source

Whether the model's code is available for public use.

Release Date

When the model was first released.

May 19, 2026 2 months ago

Knowledge Cut-off Date

When the model's knowledge was last updated.

2025-01-01

API Providers

The providers that offer this model. This is not an exhaustive list.

Google, Google AI Studio

Modalities

Types of data this model can process.

Text Image Audio Video File

What is Gemini 3.5 Flash

A fuller summary of positioning, capabilities, and source-specific details for Gemini 3.5 Flash.

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Capabilities

What Gemini 3.5 Flash supports

Reasoning Controls

OpenRouter lists GPT-5.5 with reasoning support and explicit reasoning-related request parameters.

JSON

Structured Outputs

Structured output settings are exposed through OpenRouter for schema-driven or format-controlled responses.

Tool Calling

Tool invocation and tool selection are supported in the routed OpenRouter interface for this model.

Multimodal I/O

This model accepts text input, image input, file input, audio input, video input and returns text output.

CTX

Large Context Window

OpenRouter currently lists a context window of $9.00 /MTok with up to 65,536 tokens maximum output tokens.

Pricing for Gemini 3.5 Flash

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Input tokens $1.50 Per million tokens

Output tokens $9.00 Per million tokens

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

maxTemperature 2

maxResponseSize 65,536 tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Google Google AI Studio

Provider Endpoints

Endpoint-level provider data currently available for this model.

Google

Max output: 65,536 1d uptime: 99.4% Supported params: 12 Implicit caching: Yes

Google

Max output: 65,536 1d uptime: 99.4% Supported params: 12 Implicit caching: Yes

Google

Max output: 65,536 1d uptime: 99.4% Supported params: 12 Implicit caching: Yes

Google AI Studio

Max output: 65,536 1d uptime: 99.5% Supported params: 11 Implicit caching: Yes

Google AI Studio

Max output: 65,536 1d uptime: 99.5% Supported params: 11 Implicit caching: Yes

Google AI Studio

Max output: 65,536 1d uptime: 99.5% Supported params: 11 Implicit caching: Yes

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Official Website

→

OpenRouter Model Page OpenRouter

→