Google

Imagen 4 Ultra

Imagen 4 Ultra is Google's flagship image generation model and the top tier of the Imagen 4 family, trained through early 2025. It accepts text prompts of up to 10,000 tokens and is designed to handle complex, multi-element descriptions including specific art styles, multi-scene compositions, and nuanced visual storytelling. The model supports image URL arrays as input, allowing users to reference existing images alongside text prompts. It is licensed for commercial use, making it available to businesses and creative professionals working on production-grade projects. Imagena 4 Ultra is best suited for use cases where image fidelity and detail are priorities, such as professional design work, advertising, and high-resolution visual content creation. It covers a wide range of output styles, from photorealistic portraits and landscapes to stylized illustrations and pixel art. According to community benchmarking discussions, Imagen 4 Ultra has achieved competitive Elo ratings in image arenas, including a reported tie with GPT-Image-1 in the Image Arena as of mid-2025. The model is accessible via the Google Gemini API as well as third-party inference platforms such as fal.ai.

Unknown 10,000 context N/A output

Text-to-Image Generation Image URL Input Style Selection Commercial Licensing High-Resolution Output API Access

Overview ↓ About ↓ Capabilities ↓ Pricing ↓ Price Comparison ↓ Parameters ↓ Tools ↓ Daily ↓ Resources ↓ Community ↓ FAQ ↓

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Google

Input Context Window

The number of tokens supported by the input context window.

10,000 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

N/A tokens

Open Source

Whether the model's code is available for public use.

Release Date

When the model was first released.

Unknown

Knowledge Cut-off Date

When the model's knowledge was last updated.

Unknown

API Providers

The providers that offer this model. This is not an exhaustive list.

Google, Gemini API

Modalities

Types of data this model can process.

Image Text

What is Imagen 4 Ultra

A fuller summary of positioning, capabilities, and source-specific details for Imagen 4 Ultra.

Imagen 4 Ultra is Google's flagship image generation model and the top tier of the Imagen 4 family, trained through early 2025. It accepts text prompts of up to 10,000 tokens and is designed to handle complex, multi-element descriptions including specific art styles, multi-scene compositions, and nuanced visual storytelling. The model supports image URL arrays as input, allowing users to reference existing images alongside text prompts. It is licensed for commercial use, making it available to businesses and creative professionals working on production-grade projects.

Imagena 4 Ultra is best suited for use cases where image fidelity and detail are priorities, such as professional design work, advertising, and high-resolution visual content creation. It covers a wide range of output styles, from photorealistic portraits and landscapes to stylized illustrations and pixel art. According to community benchmarking discussions, Imagen 4 Ultra has achieved competitive Elo ratings in image arenas, including a reported tie with GPT-Image-1 in the Image Arena as of mid-2025. The model is accessible via the Google Gemini API as well as third-party inference platforms such as fal.ai.

Capabilities

What Imagen 4 Ultra supports

IMG

Text-to-Image Generation

Generates images from text prompts with up to 10,000 tokens, enabling detailed and complex scene descriptions.

IMG

Image URL Input

Accepts arrays of image URLs as input, allowing reference images to be passed alongside text prompts for guided generation.

Style Selection

Supports a select input type for specifying output styles, covering photorealistic, illustrated, and stylized visual modes.

Commercial Licensing

Licensed for commercial applications, making generated images usable in business and professional production contexts.

High-Resolution Output

Produces high-resolution images suited for professional and commercial use cases where detail and fidelity are required.

API

API Access

Available via the Google Gemini API and third-party platforms like fal.ai, with documented endpoints for programmatic integration.

Pricing for Imagen 4 Ultra

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Input tokens N/A Per million tokens

Output tokens N/A Per million tokens

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

maxTemperature 1

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Google Gemini API

Configuration & Parameters

The configurable options currently documented for this model.

Source Images

Image URL Array

If you want to edit an existing image, provide the URL(s) or variables

Aspect Ratio

Select

Default: 16:9

1:1 16:9 9:16 3:4 4:3 2:3 3:2

Supported Request Parameters

Parameters currently listed by OpenRouter or the local catalog for this model.

Source Images Aspect Ratio

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Model Page on fal.ai Other

→

Interactive Playground Playground

→

API Reference Documentation

→

Model Overview & Benchmarks Other

→

Google Gemini API Pricing Documentation

→

Imagen on Google DeepMind Announcements

→

Imagen API Documentation (Google AI for Developers) Documentation

→

AI tools related to Imagen 4 Ultra

These tools are strongly connected to Imagen 4 Ultra through direct product references, provider mentions, or explicit model mappings.

AI Assistant

Summarizing Texts - Chrome Extension

The Summarizing Texts Chrome extension is an AI-powered tool designed to condense long documents into concise summaries. By utilizing algorithms to extract and synthesize the most important information, it serves as an efficient solution for various tasks, ranging from business documentation to marketing content. Additionally, it includes features for paraphrasing and rewriting text.

Free 17 saves

AI Agent

Plus AI for Google Slides™

Plus AI for Google Slides™ is an AI-powered presentation tool that enables users to create and edit slide decks directly within Google Slides. Using AI technology similar to ChatGPT and Duet AI, it generates outlines, builds initial drafts, suggests edits, creates custom themes, and maintains design consistency across team presentations. Additionally, users can integrate data from various analytics tools and applications using Plus Snapshots.

Free 0 visits 10 saves

AI Background Remover

YouTube Create

YouTube Create is a user-friendly video editing application designed to simplify the creation of high-quality videos without requiring complex software. It includes tools such as filters, effects, royalty-free music, voiceover recording, and auto-captions to help engage your audience. The app enables users to combine video, photo, and audio files, trim clips, apply transitions, adjust playback speed, and more.

Free 0 visits 9 saves

AI Marketing

Branalyzer Brand's Instant Analyzer - Chrome Extension

Branalyzer Brand's Instant Analyzer is a Chrome extension that provides rapid insights into a website's traffic and key performance metrics. It delivers immediate access to essential brand data, including SEO statistics, competitor analysis, social media metrics, ad performance, email contact information, and Trustpilot reviews. Designed for efficiency, the extension presents critical information upfront while offering seamless integration with the full Branalyzer platform for more comprehensive analysis.

Free 8 saves

Related Daily Briefs

Recent daily stories tied to Imagen 4 Ultra through direct model mentions or provider-level coverage.

Frontier Models

Hugging Face launches Mixture-of-Experts; Google DeepMind launches Flash-Lite; Hugging Face update lands

Hugging Face and Google are pushing more practical AI product shifts.

2026-07-21 AI Models AI API

Frontier Models

Google DeepMind, Alibaba, and Hugging Face Signal a Broader Shift Around Run AI

Google and Qwen move deeper into real workflows.

2026-07-21 AI API Integration

Frontier Models

Cohere, Mistral, and Google DeepMind Signal a Broader Shift Around LevelField-1

Mistral and Google move deeper into real workflows.

2026-07-21 AI Models Partnership

Frontier Models

OpenAI and Google DeepMind Signal a Broader Shift Around Explores Neural Network

OpenAI and Google are raising the stakes for enterprise adoption.

2026-07-10 AI Models

Community discussion

What people think about Imagen 4 Ultra

Imagen 4 Ultra discussions are most active in r/Bard, r/GeminiAI, r/singularity. Top Reddit threads cluster around benchmark and model-comparison threads, safety and censorship questions.

The strongest match in this snapshot has 347 upvotes and 62 comments.

r/singularity 347 upvotes 62 comments December 31, 2025

Alibaba drops Qwen-Image-2512: New strongest open-source image model that rivals Gemini 3 Pro and Imagen 4

Alibaba has officially ended 2025 by releasing **Qwen-Image-2512**, currently the world’s strongest open-source text-to-image model. Benchmarks from the AI Arena confirm it is now performing within the same tier as Google’s flagship proprietary models.

**The Performance Data:** In over 10,000 blind evaluation rounds, **Qwen-Image-2512** effectively matching Imagen 4 Ultra and challenging **Gemini 3 Pro.**

This is the **first time** an open-source weights model has consistently rivaled the top three closed-source giants in visual fidelity.

**Key Upgrades:**

**Skin & Hair Realism:** The model features a specific architectural update to reduce the **"AI plastic look"** focusing on natural skin pores and realistic hair textures.

**Complex Material Rendering:** Significant improvements in difficult-to-render textures like water ripples, landscapes and animal fur.

**Layout & Text Quality:** Building on the Qwen-VL foundation, it handles multi-line text and professional-grade layout composition with high precision.

**Open Weights Availability:** True to their roadmap, Alibaba has open-sourced the model **weights** under the Apache 2.0 license, making them available on Hugging Face and ModelScope for immediate local deployment.

[Source: Qwen Blog](https://qwen.ai/blog?id=qwen-image-2512)
[Source: Hugging Face Repository](https://huggingface.co/unsloth/Qwen-Image-2512-GGUF)

Open Reddit thread

r/aiwars 53 upvotes 124 comments September 2, 2025

Is this art? Am I an AI artist? Why or Why not? (Used Imagen 4 ultra)

Open Reddit thread

r/GeminiAI 98 upvotes 41 comments March 21, 2026

Flux.2 Klein 9B vs Imagen 4 Ultra vs Nano Banana Pro vs Nano Banana 2

Open Reddit thread

r/Bard 88 upvotes 31 comments March 21, 2026

Flux.2 Klein 9B vs Imagen 4 Ultra vs Nano Banana Pro vs Nano Banana 2

Open Reddit thread

r/StableDiffusion 47 upvotes 25 comments May 18, 2026

Trying to distill the soon-to-be-sunset Imagen 4 to a LoRA for Illustrious 2.0 but the result is a bit wonky, would appreciate some pointers

Google's Imagen 4 and Imagen 4 Ultra are being sunset on June 30 but are essentially the only models out there that can reliably output a convincing 1990s "Disney renaissance" look, with the blurry-edge shading that defines the [CAPS](https://en.wikipedia.org/wiki/Computer_Animation_Production_System)\-style of that era. So I'm trying to distill it into something that can be used until I come across another model that can do this.

I've made my first Illustrious 2.0 LoRA (through TensorArt because my graphics card is busted and I already had an account with them since before they started censoring everything) with a purely Imagen 4-generated 100 image dataset of 16:9, 1408x768 graphics. I did Repeat 3 / Epoch 10 = 2910 steps. Auto-labelled with "wd-v1-4-vit-tagger-v2". And the resulting images absolutely do capture the style, but... the result is a little wonky, it's got random artifacts, often shitty lines, weird eyes, IDK, the way AI gen looked like 2 years ago? Back when "AI slop" didn't mean it looked too polished, but that it actually looked sloppy?

It'd be easy to just jump back in and add more images, do more steps, but I've already wasted nearly $10 so I'd be so thankful if somebody with more experience could hint what I might be doing wrong. Should I use Imagen 4 ultra images for training instead? They tend to be a little sharper and I can get at 2x the resolution, though they cost $0.06 per image. Or should I try and automate some de-noising or upscaling or sharpening of the training set I already have? Or is like... my LoRA essentially fine and what is vexing me is just the limitations of using an older local model like Illustrious 2.0?

Edit: also tried doing a Qwen Image Edit 2511 LoRA (through FAL's trainer) that would just change the character but the results were not great there either)

EDIT2: After a lot of back and forth I realized what's bothering me is probably just that Illustrious is a very out of date model that's pretty far behind the curve. I re-evaluaed my Qwen Image Edit 2511 LoRA and while it does also edit the background (despite me not touching the backgrounds at all in the pairs!) it's actually really good for getting the character design right, so I guess I'll just fix the backgrounds manually instead.

Open Reddit thread

View more discussions →

FAQ

Common questions about Imagen 4 Ultra

What is the context window for Imagen 4 Ultra?

Imagen 4 Ultra supports a context window of 10,000 tokens, which applies to the text prompt input describing the desired image.

Is Imagen 4 Ultra licensed for commercial use?

Yes, Imagen 4 Ultra is licensed for commercial applications, making it suitable for businesses and creative professionals producing commercial content.

When was Imagen 4 Ultra trained?

According to the model metadata, Imagen 4 Ultra has a training data cutoff of early 2025.

Where can I find pricing information for Imagen 4 Ultra?

Pricing for Imagen 4 Ultra via the Google Gemini API is listed on the Google Gemini API pricing page at ai.google.dev/gemini-api/docs/pricing#imagen.

What input types does Imagen 4 Ultra accept?

Imagen 4 Ultra accepts image URL arrays and select-type inputs, in addition to text prompts, allowing users to provide reference images and specify style options alongside their descriptions.

More models from Google

Continue browsing adjacent models from the same provider.

← All AI Models