Google

Veo 2

Veo 2 is Google's production-ready video generation model, released in April 2025 via the Gemini API under the model ID veo-2.0-generate-001. It accepts both text prompts and reference images as input, generating high-definition video output at resolutions up to 4K. The model includes physics-aware rendering that handles fluid dynamics, lighting, and object interactions, and it embeds SynthID watermarking in all generated videos to identify AI-created content. Veo 2 is available through both the Gemini API and Google's Vertex AI platform, making it accessible to developers via standard API calls without specialized infrastructure. It supports cinematic prompt controls such as aerial shots, panning, and time-lapses, and maintains consistent character appearance across scenes. The model is suited for developers, marketers, creative professionals, and educators who need to generate video content programmatically for use cases like product demos, ad campaigns, and educational visualizations.

Unknown 5,000 context N/A output

Text-to-Video Image-to-Video Physics-Aware Rendering Cinematic Camera Control High-Resolution Output SynthID Watermarking

Overview ↓ About ↓ Capabilities ↓ Pricing ↓ Price Comparison ↓ Parameters ↓ Resources ↓ Community ↓ FAQ ↓

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Google

Input Context Window

The number of tokens supported by the input context window.

5,000 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

N/A tokens

Open Source

Whether the model's code is available for public use.

Release Date

When the model was first released.

Unknown

Knowledge Cut-off Date

When the model's knowledge was last updated.

Unknown

API Providers

The providers that offer this model. This is not an exhaustive list.

Google, Vertex AI, Gemini API

Modalities

Types of data this model can process.

Video Text Image

What is Veo 2

A fuller summary of positioning, capabilities, and source-specific details for Veo 2.

Veo 2 is Google's production-ready video generation model, released in April 2025 via the Gemini API under the model ID veo-2.0-generate-001. It accepts both text prompts and reference images as input, generating high-definition video output at resolutions up to 4K. The model includes physics-aware rendering that handles fluid dynamics, lighting, and object interactions, and it embeds SynthID watermarking in all generated videos to identify AI-created content.

Veo 2 is available through both the Gemini API and Google's Vertex AI platform, making it accessible to developers via standard API calls without specialized infrastructure. It supports cinematic prompt controls such as aerial shots, panning, and time-lapses, and maintains consistent character appearance across scenes. The model is suited for developers, marketers, creative professionals, and educators who need to generate video content programmatically for use cases like product demos, ad campaigns, and educational visualizations.

Capabilities

What Veo 2 supports

VID

Text-to-Video

Generates video clips from written text prompts, interpreting scene descriptions, camera directions, and stylistic cues to produce coherent video output.

IMG

Image-to-Video

Animates a reference image into a video sequence, using the provided image as the visual starting point for the generated clip.

Physics-Aware Rendering

Models realistic physical behavior including fluid dynamics, lighting interactions, and object motion to produce visually consistent scenes.

Cinematic Camera Control

Responds to prompts describing specific camera movements such as aerial shots, panning, tracking, and time-lapses.

High-Resolution Output

Supports video generation at resolutions up to 4K, suitable for professional and commercial production workflows.

SynthID Watermarking

Embeds an imperceptible SynthID watermark in every generated video to enable identification of AI-created content.

API

API & Vertex AI Access

Available through both the Gemini API and Google Vertex AI, allowing integration via standard REST or SDK calls.

Pricing for Veo 2

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Input tokens N/A Per million tokens

Output tokens N/A Per million tokens

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

maxTemperature 1

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Google Vertex AI Gemini API

Configuration & Parameters

The configurable options currently documented for this model.

Duration

Select

Default: 5

5s 6s 7s 8s

Aspect Ratio

Select

Default: 16:9

16:9 (Landscape) 9:16 (Portrait)

Supported Request Parameters

Parameters currently listed by OpenRouter or the local catalog for this model.

Duration Aspect Ratio

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Official Veo Documentation Documentation

→

Release Notes Documentation

→

Announcement Blog Post Announcements

→

Vertex AI API Reference Documentation

→

Veo on Vertex AI Announcements

→

Google AI Studio Playground Playground

→

Gemini API Pricing Documentation

→

Community discussion

What people think about Veo 2

Veo 2 discussions are most active in r/singularity, r/OpenAI, r/Bard. Top Reddit threads cluster around benchmark and model-comparison threads. The strongest match in this snapshot has 2155 upvotes and 215 comments.

r/ChatGPT 2,155 upvotes 215 comments December 18, 2024

Veo 2 vs. Sora

Open Reddit thread

r/singularity 1,253 upvotes 297 comments December 17, 2024

Comparing video generation AI to slicing steak, including Veo 2

Open Reddit thread

r/singularity 1,398 upvotes 242 comments December 17, 2024

Everything here is 100% generated w/ Google Veo 2

Open Reddit thread

r/singularity 1,196 upvotes 223 comments December 16, 2024

Google about to announce Veo 2

Saw a bunch of videos on the deepmind YouTube channel pop up

Open Reddit thread

r/singularity 530 upvotes 266 comments January 3, 2025

AI Influencers are Coming[Google Veo 2]

Open Reddit thread

View more discussions →

FAQ

Common questions about Veo 2

What is the context window for Veo 2?

Veo 2 has a context window of 5,000 tokens, which applies to the text prompt input used to describe the video to be generated.

When was Veo 2 made generally available?

Veo 2 became generally available in April 2025, released via the Gemini API under the model ID veo-2.0-generate-001.

What input types does Veo 2 accept?

Veo 2 accepts text prompts and images as inputs, supporting both text-to-video and image-to-video generation workflows.

Where can I access Veo 2 via API?

Veo 2 is accessible through the Gemini API and Google's Vertex AI platform. Both provide standard API interfaces for integrating video generation into applications.

Does Veo 2 watermark its generated videos?

Yes. All videos generated by Veo 2 include an embedded SynthID watermark, which is Google's tool for identifying AI-generated content.

What is the maximum output resolution supported by Veo 2?

Veo 2 supports video output at resolutions up to 4K, making it suitable for professional and high-definition production use cases.

More models from Google

Continue browsing adjacent models from the same provider.

← All AI Models