Gemini 3.1 Flash Image vs Gemini 1.5 Flash Deprecated
Compare Gemini 3.1 Flash Image and Gemini 1.5 Flash Deprecated across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for reasoning-heavy tasks versus general-purpose AI workloads.
Overview Comparison
Structured side-by-side differences for the highest-signal model metadata.
Provider
The entity that currently provides this model.
Model ID
The routed model identifier exposed by upstream providers.
Input Context Window
The number of tokens supported by the input context window.
Maximum Output Tokens
The number of tokens that can be generated by the model in a single request.
Open Source
Whether the model's code is available for public use.
Release Date
When the model was first released.
Knowledge Cut-off Date
When the model's knowledge was last updated.
API Providers
The providers that currently expose the model through an API.
Modalities
Types of data each model can process or return.
Pricing Comparison
Compare current token pricing before you choose the cheaper or more scalable API option.
Capabilities Comparison
See where each model overlaps, where they differ, and which one supports more of the features you care about.
What Reddit discussions say about Gemini 3.1 Flash Image vs Gemini 1.5 Flash Deprecated
Gemini 3.1 Flash Image and Gemini 1.5 Flash Deprecated are both surfacing live Reddit discussions, giving this comparison a community layer beyond specs and benchmarks.
The most visible threads right now are clustered in r/GeminiAI, r/Bard, r/promptingmagic.
**TLDR - Check out the attached presentation!**
Google just dropped Nano Banana 2 and it is the best AI image model in the world right now. It generates images from 512px to native 4K, supports 14 aspect ratios including ultra-wide 21:9 and vertical 9:16, renders legible text in any language inside images, maintains character consistency across up to 5 characters, pulls live data from Google Search to create accurate infographics, and works everywhere including Gemini, Google AI Studio, Google Flow at zero credits, Google Ads, Vertex AI, Pomelli, NotebookLM, and through third-party apps like Adobe Firefly, Perplexity, Figma, Notion, and Gamma. This post covers 160 use cases, 500 prompts, structured prompting secrets, and every platform where you can access it. It is free for consumer users.
**WHAT IS NANO BANANA 2?**
Nano Banana 2 is technically Gemini 3.1 Flash Image Preview. It is the third model in the Nano Banana family, following the original Nano Banana from August 2025 and Nano Banana Pro from November 2025. It runs on the Gemini 3.1 Flash reasoning backbone, which means it thinks before it renders. It plans the composition, resolves physics and spatial relationships, reasons about object interactions, and then produces pixels.
On February 26, 2026, it launched and immediately took the number one spot on the Artificial Analysis Image Arena, a blind human evaluation leaderboard, at roughly half the API cost of every comparable model. It is not a minor upgrade. It is a full architectural leap that collapses the gap between Pro-quality output and Flash-tier speed and pricing.
THE 6 CORE CAPABILITIES THAT MAKE IT DIFFERENT
1. It plans the image before rendering pixels. Nano Banana 2 uses a reasoning engine that understands physics, object interactions, geography, coordinates, diagrams, structure, and spelling. It generates interim thought images in the background to refine composition before producing the final output.
2. Real-time web and image search grounding. It can pull live data from Google Search and Google Image Search to create infographics, data visualizations, weather charts, and accurate depictions of real-world subjects. This is exclusive to Nano Banana 2 and not available in Nano Banana Pro.
3. Precision text rendering and translation. It spells correctly inside images. It renders legible, stylized text for marketing mockups, greeting cards, infographics, and posters. It can also translate embedded text from one language to another without altering the surrounding visual composition.
4. Character consistency across up to 5 characters. It maintains resemblance for up to 4 characters and fidelity for up to 10 objects in a single workflow, totaling 14 reference images. This enables storyboarding, product catalogs, and brand asset workflows where characters must look the same across dozens of images.
5. Native 512px to 4K resolution with 14 aspect ratios. Supported ratios include 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, and 8:1.
6. Flash-tier speed at production-ready quality. Vibrant lighting, richer textures, sharper details. Standard resolution images generate in under two seconds. The API costs approximately $0.067 per 2K image versus $0.134 for Nano Banana Pro.
THE STRUCTURED PROMPTING FRAMEWORK
This is the single most important section in this guide. Nano Banana 2 responds dramatically better when you structure your prompt using this pattern.
The formula: Subject -- What is the main focus of the image Composition -- Camera angle, framing, distance, layout Action -- What is happening in the scene Location -- Where the scene takes place Style -- Visual style, film stock, rendering approach, color palette Editing instructions -- When editing an existing image, what to change and what to preserve
Pro tips that separate beginners from experts:
* Write full sentences, not comma-separated keyword tags. Nano Banana 2 is a language model that generates images. Talk to it like a creative director briefing a photographer.
* Name the camera. Saying shot on Hasselblad X2D 135mm at f/5.6 gives radically different results than just saying portrait.
* Direct the light. Specify soft key light from upper left or golden hour backlight through floor-to-ceiling windows.
* Provide the why. Telling it the image is for a luxury perfume launch campaign changes the output mood and quality.
* Use the text distance rule. When adding text to images, specify the exact words, the font style, and the placement relative to other elements.
* Specify resolution and aspect ratio explicitly. Say 4K output, 16:9 aspect ratio at the end of your prompt.
HOW TO CREATE IMAGES AT DIFFERENT ASPECT RATIOS
Nano Banana 2 supports the widest range of aspect ratios of any major image model.
|Aspect Ratio|Best For|
|:-|:-|
|1:1|Instagram feed posts, profile icons, social cards|
|16:9|YouTube thumbnails, presentations, web banners|
|9:16|TikTok, Instagram Reels, Stories, mobile wallpapers|
|21:9|Cinematic concepts, panoramic images, ultrawide banners|
|3:2|Standard photography, print media|
|4:3|Web UI design, classic digital art, presentations|
|4:5|Instagram portrait feed, professional portraits|
|2:3|Phone wallpapers, book covers, magazine pages|
|1:4|Tall infographics, vertical banners|
|4:1|Website headers, horizontal banners|
|1:8|Extreme vertical content, scrolling social infographics|
|8:1|Extreme horizontal banners, ticker-style content|
In the Gemini app: Simply state the aspect ratio in your prompt. Say create this as a 16:9 widescreen image or make it 9:16 vertical for Instagram Stories.
In Google AI Studio: Select the aspect ratio from the dropdown in the right panel. You get all 14 options plus resolution control from 512px to 4K.
In the API: Set the aspect\_ratio and image\_size parameters in the ImageConfig object. Aspect ratio accepts strings like 16:9 and resolution accepts 512px, 1K, 2K, or 4K.
WHERE TO ACCESS NANO BANANA 2 -- EVERY PLATFORM
The Gemini App (Free) Nano Banana 2 is the default model for all users across Fast, Thinking, and Pro modes. Click the banana icon or just ask Gemini to create an image.
Google AI Studio (Free with API Key) Navigate to [aistudio.google.com](http://aistudio.google.com), select gemini-3.1-flash-image-preview from the model dropdown. Here you get full control over aspect ratio, resolution, thinking mode, and search grounding. This is where power users go when the Gemini app is not enough.
Google Flow (Free, Zero Credits) Google Flow is Google's AI filmmaking tool. Nano Banana 2 is the default image generation engine. It costs zero credits for all users. You can select the aspect ratio, choose how many images to generate in a batch (up to 4 at a time with specified resolution), and enter your prompt. This is the best-kept secret for batch generation without burning credits.
Pomelli (Free) Pomelli is Google Labs' free marketing tool for small and medium businesses. The new Photoshoot feature lets you upload any product photo and it generates professional studio-quality product shots in multiple templates: Studio, Floating, Ingredient, In Use with AI-generated models, and Lifestyle scenes.
NotebookLM (Free) Upload your source documents and click Create Slides or Create Infographic. NotebookLM uses Nano Banana to convert your content into visually stunning slide decks or single-page infographics. You can export directly to Google Slides for editing.
Google Ads (Free within Ads) Nano Banana 2 now powers the AI-generated creative suggestions when building campaigns. Performance marketers get higher-quality asset suggestions natively inside the campaign builder.
Third-Party Apps Confirmed third-party integrations include:
* Adobe Firefly: Integrated into the creative suite for image generation and editing.
* Perplexity: Uses Nano Banana 2 for image generation within research and browsing workflows.
* Figma: Tested for iterative design workflows and UI mockups.
* Notion: Integrated for in-document image generation.
* Gamma: Integrated into Studio Mode for generating theme-matched presentation images.
* Whering: Transforms clothing photos into studio-quality product imagery.
* WPP / Unilever: Used for enterprise-scale campaign testing.
HOW TO MAINTAIN CHARACTER CONSISTENCY ACROSS 5 CHARACTERS
This is the workflow that actually works:
Step 1: Create strong character reference sheets. Start with a clear, well-lit headshot or full-body photo for each character. Step 2: Upload reference images. In AI Studio or the API, you can upload up to 14 reference images total (up to 4 character images and up to 10 object images). Step 3: Describe each character consistently. Use the same physical description across every prompt in the workflow. Step 4: Use the multi-image prompt structure. Upload all character reference images alongside your scene description. Step 5: For video workflows, generate character reference sheets showing multiple angles of each character (front, left profile, right profile, etc.) to maintain 100 percent facial accuracy.
TOP 20 USE CASES
1. Live Data Infographics: Use search grounding to create charts based on real-time data.
2. Global Campaign Localization: Update backgrounds, language, and cultural cues for billboards from a single base creative.
3. Physics-Aware Virtual Try-On: Fabric drapes realistically on body models for fashion mockups.
4. Architectural Time Travel: Restore modern streets to their Victorian 1890s counterparts.
5. Text-Heavy Social Media Posts: Quote cards and posters with strong styled typography.
6. Product Photography at Scale: Professional shots from minimal product photos using Pomelli.
7. LinkedIn Professional Headshots: Transform selfies into studio-quality corporate photos.
8. 4K Image Upscaling: Regenerate low-res images into 4K resolution for free.
9. Old Photo Restoration: Restore damaged or faded memories with colorization and feature repair.
10. Action Figures and Collectibles: Turn likenesses into custom branded figurines.
11. Room Design and Floor Plans: Move from 2D floor plans to photorealistic 3D presentation boards.
12. YouTube Thumbnails: High-converting widescreen graphics with expressive subjects and bold text.
13. E-Commerce Catalog Generation: Maintain product fidelity across seasonal themes using reference images.
14. Brand Identity Kits: Complete brand boards including logos, palettes, and typography.
15. Multi-Panel Storytelling: Maintain visual identity across comic strips and storyboards.
16. Data Visualization from Articles: Paste a link to generate a custom infographic from the content.
17. Blurred Photo to Ultra Sharp: Editorial-quality restoration while preserving original composition.
18. Style Transfer: Swap image styles to watercolor, 3D render, anime, or pencil sketches.
19. Whiteboard and Sketch Visualization: Turn concepts into hand-drawn marker sketches.
20. Celebrity Selfies and Fun Photos: Photorealistic selfies in movie sets or absurd landmarks.
SECRETS MOST PEOPLE MISS
1. The Thinking Mode toggle changes everything. Enable it in AI Studio for complex layouts; it plans before rendering.
2. Image Search Grounding is exclusive to Nano Banana 2. It searches for visual references (buildings, specific products) before generating.
3. Multi-turn editing is the recommended workflow. Refine your image in follow-up messages rather than one massive prompt.
4. The 512px tier exists for rapid prototyping. Use it to find the best composition at low cost before upscaling to 4K.
5. You can generate up to 20 images in a single batch prompt through the API.
6. Flow generates at zero credits. It is the best hack for unlimited batch generation without a subscription.
7. You can use it as a real-time photo editor. Upload a photo and give natural language instructions to remove objects or change colors.
THE PROMPT LIBRARY -- 50 EPIC PROMPTS
**Professional and Business**
1. LinkedIn Headshot: Transform this selfie into a professional studio headshot. Clean neutral background, soft directional light, sharp focus on eyes, charcoal blazer. 4:5, 4K.
2. Infographic from Live Data: Search top 5 programming languages 2026. Create a 9:16 vertical infographic, flat vector style, icons, percentages, average salary.
3. Product Hero Shot: Matte-black wireless headphone on polished obsidian. 85mm macro, soft key light, reflection. 16:9, 4K.
4. SaaS Landing Page Hero: Landing page for FlowState tool. Headline on left, dashboard screenshot on right, two CTA buttons. 16:9, 2K.
5. Business Card Suite: Embossed matte cards, letterhead, wax stamp envelope on slate. Editorial flat lay. 3:2, 4K.
6. Social Media Content Calendar: 9:16 infographic showing 7-day blueprint for fitness brand. Icons for Reels and Stories.
7. Email Marketing Banner: 4:1 horizontal banner, field of wildflowers, text Spring Collection Now Live.
8. Pitch Deck Slide: Single slide, navy background, headline 3x Revenue Growth in Q4, teal line chart on right.
9. Executive Summary Dashboard: 16:9 infographic showing global sales metrics, heat map on left, key KPI cards on right.
10. Startup Team Mockup: Group of diverse professionals in a glass-walled conference room, futuristic Shinjuku city visible outside.
**Photography and Portraits**
11. Editorial Fashion: Model in vibrant red dress standing in desert, high contrast, blue sky, 35mm film grain.
12. Candid Street: Busy market in Marrakech, warm tones, natural lighting, shallow depth of field.
13. Macro Human Eye: Reflecting a city skyline, hyper-realistic, 8k textures.
14. Black and White Artist: Elderly artist in sunlit studio, high detail on skin and paint textures.
15. Gourmet Food Photography: Burger with steam rising, rustic wood background, professional lighting.
16. Cinematic Hiker: Wide shot on mountain peak at dawn, orange and purple sky, majestic mood.
17. Underwater Fashion: Model in silk dress, ethereal lighting, bubbles, fluid motion.
18. Brutalist Architecture: Concrete building shot from low angle, sharp shadows, dramatic sky.
19. Vintage 1970s Polaroid: Family picnic, faded colors, light leaks, nostalgic feel.
20. Cyberpunk Portrait: Close up of subject with neon light reflections on glasses, rainy city background.
**Architecture and Design**
21. 2D Floor Plan: Modern 2-bedroom apartment, labeled rooms, clean linework.
22. 3D Interior Render: Mid-century modern living room, forest view through large windows.
23. Victorian Street: London street corner, horse-drawn carriages, foggy atmosphere, daytime.
24. Futuristic City Plan: Vertical gardens, floating transport pods, top-down view.
25. Cozy Cabin: Stone fireplace, warm light, snow falling outside window.
26. Glass Beach House: Sunset view, ocean reflections on windows, minimalist decor.
27. Office Lobby: Living moss wall, minimalist furniture, bright natural light.
28. Steampunk Library: Brass pipes, glowing green lamps, infinite shelves.
29. Industrial Loft: Exposed brick, large windows, cinematic moody lighting.
30. Zen Garden: Stone path, koi pond, peaceful atmosphere, high detail.
Creative and Wild
31. Custom Action Figure: Hyper-detailed 1/6 scale figure of person from photo in premium collector box.
32. Whiteboard Sketch to 3D: Hand-drawn rocket engine sketch turned into photorealistic 3D blueprint.
33. Origami Dragon: Made of fire, dark background, glowing embers.
34. Autumn Leaf Person: Character made of leaves walking through city park.
35. Cloud Astronaut: Sitting on a cloud fishing for stars in purple galaxy.
36. Chess Cat: Cat in tuxedo playing chess against robot in Victorian study.
37. Surrealist Strawberry: Melting clock over a giant realistic strawberry.
38. Cyberpunk Tea Ceremony: Traditional Japanese tea ritual in neon-lit futuristic room.
39. Glass Piano Reef: Transparent piano filled with tropical fish and coral.
40. Heart Island: Floating island in shape of heart with waterfalls into clouds.
**Restoration and Editing**
41. Wedding Photo Restore: Turn blurred wedding photo into ultra-sharp editorial shot.
42. 4K Upscale: Take low-res 1990s photo and regenerate at 4K resolution.
43. Color Swap: Change car in image to electric blue with matte finish.
44. Background Replace: Move portrait subject to luxury hotel balcony overlooking Eiffel Tower.
45. People Removal: Remove background crowds from beach photo and extend sand.
46. Professional Lighting: Add studio lighting setup to dark selfie, preserve identity.
47. Watercolor Dog: Turn dog photo into artistic watercolor painting style.
48. 1890s Street Edit: Replace cars in modern photo with carriages and Victorian signs.
49. 3D Animation Style: Change style of photo to Pixar-tier 3D animation.
50. Old Memory Repair: Colorize faded black and white photo, fix scratches and tears.
Bonus Fun:
1. Toast Bread Infographic: How to toast bread, make it wacky and over the top with Rube Goldberg machines and scientific data.
2. Banana Runway: High-fashion show where models are giant realistic bananas wearing Gucci, background motion blur.
3. Jellyfish Concert: Underwater heavy metal concert with instruments made of glowing jellyfish, shark lead singer.
4. Pumpkin Penthouse: Luxury penthouse inside a giant hollowed-out pumpkin, autumn aesthetic.
5. Kitchen Time Machine: Blueprint of time machine made of kitchen appliances and duct tape with nonsensical terms.
Pro Tips for Nano Banana 2
* Use the Text Distance Rule: Specify exact words and placement relative to objects for clean layouts.
* Reference Images: Use up to 14 reference images (4 for characters, 10 for objects) to maintain consistency.
* Thinking Model: Toggle on for infographics or complex diagrams to ensure logical planning before pixels render.
I will post links to the complete library of prompts and use cases in the comments.
Get the full 500 prompt image library free with just one click at [PromptMagic.dev](http://PromptMagic.dev)
# 80,000 NOK ($7,500) drained from my Google Cloud account in 5 minutes — full forensic breakdown of how the attack worked
I want to write this up while it's fresh, because the *mechanism* of the attack is more interesting than the "I leaked a key, oops" headline — and the platform design that allowed it is something every Google Cloud user should know about.
# What happened
* May 8, 2026, evening (CET): I get a billing alert email saying I owe NOK 82,305.36 (\~$7,500 USD) on my Google Cloud account.
* My typical monthly spend: \~100 NOK ($10).
* The spike happened in roughly 5 minutes.
* All charges were on the Gemini API in a single project I'd barely touched (an old "no-code maps" project from 2017).
* An API key from that project was leaked somewhere — I'm still hunting where. Most likely an old GitHub repo or a public webpage from 2018-ish that had Gemini API enabled on its project years later (I think this is what made it exploitable — the key sat dormant, but the moment Gemini got enabled on its project, the dormant key became a Gemini-capable wallet).
# What the attacker actually did (the part nobody talks about)
I pulled the SKU-level breakdown from Billing → Reports. The attacker didn't just hit one model. They ran an automated framework that fanned out across every Gemini variant simultaneously:
* Gemini 3 Pro (text + image generation)
* Gemini 3 Flash
* Gemini 3.1 Flash Image
* Gemini 3.1 Flash Lite Preview
* Gemini 2.5 Pro (text + TTS)
* Gemini 2.5 Flash (short + long context, multimodal)
* Gemini 2.5 Flash Lite
* Gemini 2.0 Flash TTS
* Gemini Embedding-2 + Embedding-001
15+ distinct models in 5 minutes. No human application uses 15 models in parallel. This is the signature of an automated abuse framework, almost certainly a credential-resale operation.
Token volumes:
* 1.09 BILLION input tokens on Gemini 2.5 Flash Lite alone
* 402M image input tokens on Gemini 3 Pro
* 226M text input tokens on Gemini 3 Pro
* 19.4M image output tokens on Gemini 3 Pro Image — kr 21,674 ($2,000) on this single SKU, the most expensive line item
The attacker prioritized image generation because that's where the real money is — image output tokens are 50–100x more expensive than text.
# How they bypassed rate limits (this is the architectural problem)
You'd think rate limits would protect you. They don't — at least not on Google Cloud:
* Gemini 3 Pro: 1,000 RPM
* Gemini 3 Flash: 2,000 RPM
* Gemini 2.5 Flash Lite: 4,000 RPM
* (etc., for every model — *each with its own independent quota*)
There is no per-key aggregate cap across models. If you fan out across 15 models concurrently, you cap at the *sum* — easily 30,000+ RPM combined.
OpenAI, Anthropic, and Mistral all have per-key aggregate caps. Google does not. This is not a policy oversight — it's the core mechanism that makes a single compromised key a 5-minute, 5-figure liability.
Also: Google Cloud does not offer a hard spending cap. No "stop all spend at $X" option. The closest is a budget alert that *emails you* (after the fact), or — and this is the documented "solution" — you can write your own Cloud Function that listens to budget Pub/Sub events and programmatically disables your billing account. Yes, Google's official answer to "how do I stop runaway spending" is "deploy code on the same platform that's billing you." This has been a known gripe for years.
# What logging gave me — almost nothing
I tried every audit log query:
* `protoPayload.serviceName="generativelanguage.googleapis.com"` → empty
* `resource.type="consumed_api"` for the project → empty
* Vertex AI logs → empty
Google does not log per-request data for Gemini API key calls. No caller IP, no user-agent, no request size. The only forensic record that exists is the SKU-level billing report — and that only goes down to "model + token type", not session/request/key.
So I can't tell you who did it, where they were, or what they generated. I just know it was 15 models in parallel and 19M image output tokens.
# What I did in the first 90 minutes
* Deleted all 13 API keys on the affected project (after seeing the alert at \~01:25)
* Disabled [`generativelanguage.googleapis.com`](http://generativelanguage.googleapis.com) and [`aiplatform.googleapis.com`](http://aiplatform.googleapis.com) on every one of my 25+ projects (script via `gcloud services disable`)
* Closed all 3 billing accounts
* Called my bank, blocked the Visa
* Got into Google's billing chat queue, escalated to specialist team within 5 messages
* Case 71021804 opened, 24-48h response window
* Pulled SKU-level forensic evidence
The chat agent confirmed end-of-month billing cycle, so the actual charge attempt won't fire until \~May 28-31. By then either the specialist team has waived it, or the card-block + chargeback dispute kicks in.
# What I'm pretty sure happens next
* \~85% chance: specialist team waives the charge under the compromised-credentials policy. Google has standardized this for exactly this scenario because they know the rate-limit architecture allows it.
* \~10% chance: partial waiver / settlement.
* \~5% chance: they refuse, my bank chargeback wins it under Norwegian Finansavtaleloven (450 NOK max liability for unauthorized card use).
I'm not actually going to pay 80k. The realistic worst case is several months of paperwork.
# Lessons / PSA for everyone running Google Cloud
1. Restrict every API key at creation time. Application restriction (HTTP referrer or IP allowlist) + API restriction (only the APIs you use). An unrestricted key on a project where Gemini happens to be enabled is a wallet.
2. Audit every project for keys you've forgotten about. I had keys from 2017, 2020, 2021 — most predating Gemini's existence. The moment Gemini got enabled on those old projects, the old keys could call it.
3. Disable APIs you don't actively use. Per-project. An enabled API + an unrestricted key = exposure.
4. Set up a budget-disables-billing Cloud Function. The auto-shutdown one. Yes it's stupid that Google makes you write code for this, but it's the only real circuit breaker.
5. Don't trust rate limits. They protect Google's infrastructure, not your wallet. Per-model RPM × N models = no real cap.
6. Don't store API keys in client-side code, ever. Even if you think a project is dead.
# Where the leak came from
Honestly, I don't know yet. The project was created in 2017 (back when Google appended a numeric suffix like `-364317` to project IDs). It had 13 keys accumulated over years. One of them is somewhere out in the wild. I'll be searching GitHub history, old Vercel deployments, Wayback Machine, and screenshots over the coming days. If I find it I'll edit this post.
If anyone has run into the same multi-model abuse pattern recently, I'd love to hear about it — particularly if you have any signals on which credential-resale operations are currently active.
Edit: Will update with specialist team's response when it arrives in 24-48h.
https://x.com/artificialanlys/status/2027052241019175148?s=46
**TLDR - Check out the attached presentation!**
Google just dropped Nano Banana 2 and it is the best AI image model in the world right now. It generates images from 512px to native 4K, supports 14 aspect ratios including ultra-wide 21:9 and vertical 9:16, renders legible text in any language inside images, maintains character consistency across up to 5 characters, pulls live data from Google Search to create accurate infographics, and works everywhere including Gemini, Google AI Studio, Google Flow at zero credits, Google Ads, Vertex AI, Pomelli, NotebookLM, and through third-party apps like Adobe Firefly, Perplexity, Figma, Notion, and Gamma. This post covers 160 use cases, 500 prompts, structured prompting secrets, and every platform where you can access it. It is free for consumer users.
**WHAT IS NANO BANANA 2?**
Nano Banana 2 is technically Gemini 3.1 Flash Image Preview. It is the third model in the Nano Banana family, following the original Nano Banana from August 2025 and Nano Banana Pro from November 2025. It runs on the Gemini 3.1 Flash reasoning backbone, which means it thinks before it renders. It plans the composition, resolves physics and spatial relationships, reasons about object interactions, and then produces pixels.
On February 26, 2026, it launched and immediately took the number one spot on the Artificial Analysis Image Arena, a blind human evaluation leaderboard, at roughly half the API cost of every comparable model. It is not a minor upgrade. It is a full architectural leap that collapses the gap between Pro-quality output and Flash-tier speed and pricing.
THE 6 CORE CAPABILITIES THAT MAKE IT DIFFERENT
1. It plans the image before rendering pixels. Nano Banana 2 uses a reasoning engine that understands physics, object interactions, geography, coordinates, diagrams, structure, and spelling. It generates interim thought images in the background to refine composition before producing the final output.
2. Real-time web and image search grounding. It can pull live data from Google Search and Google Image Search to create infographics, data visualizations, weather charts, and accurate depictions of real-world subjects. This is exclusive to Nano Banana 2 and not available in Nano Banana Pro.
3. Precision text rendering and translation. It spells correctly inside images. It renders legible, stylized text for marketing mockups, greeting cards, infographics, and posters. It can also translate embedded text from one language to another without altering the surrounding visual composition.
4. Character consistency across up to 5 characters. It maintains resemblance for up to 4 characters and fidelity for up to 10 objects in a single workflow, totaling 14 reference images. This enables storyboarding, product catalogs, and brand asset workflows where characters must look the same across dozens of images.
5. Native 512px to 4K resolution with 14 aspect ratios. Supported ratios include 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, and 8:1.
6. Flash-tier speed at production-ready quality. Vibrant lighting, richer textures, sharper details. Standard resolution images generate in under two seconds. The API costs approximately $0.067 per 2K image versus $0.134 for Nano Banana Pro.
THE STRUCTURED PROMPTING FRAMEWORK
This is the single most important section in this guide. Nano Banana 2 responds dramatically better when you structure your prompt using this pattern.
The formula: Subject -- What is the main focus of the image Composition -- Camera angle, framing, distance, layout Action -- What is happening in the scene Location -- Where the scene takes place Style -- Visual style, film stock, rendering approach, color palette Editing instructions -- When editing an existing image, what to change and what to preserve
Pro tips that separate beginners from experts:
* Write full sentences, not comma-separated keyword tags. Nano Banana 2 is a language model that generates images. Talk to it like a creative director briefing a photographer.
* Name the camera. Saying shot on Hasselblad X2D 135mm at f/5.6 gives radically different results than just saying portrait.
* Direct the light. Specify soft key light from upper left or golden hour backlight through floor-to-ceiling windows.
* Provide the why. Telling it the image is for a luxury perfume launch campaign changes the output mood and quality.
* Use the text distance rule. When adding text to images, specify the exact words, the font style, and the placement relative to other elements.
* Specify resolution and aspect ratio explicitly. Say 4K output, 16:9 aspect ratio at the end of your prompt.
HOW TO CREATE IMAGES AT DIFFERENT ASPECT RATIOS
Nano Banana 2 supports the widest range of aspect ratios of any major image model.
|Aspect Ratio|Best For|
|:-|:-|
|1:1|Instagram feed posts, profile icons, social cards|
|16:9|YouTube thumbnails, presentations, web banners|
|9:16|TikTok, Instagram Reels, Stories, mobile wallpapers|
|21:9|Cinematic concepts, panoramic images, ultrawide banners|
|3:2|Standard photography, print media|
|4:3|Web UI design, classic digital art, presentations|
|4:5|Instagram portrait feed, professional portraits|
|2:3|Phone wallpapers, book covers, magazine pages|
|1:4|Tall infographics, vertical banners|
|4:1|Website headers, horizontal banners|
|1:8|Extreme vertical content, scrolling social infographics|
|8:1|Extreme horizontal banners, ticker-style content|
In the Gemini app: Simply state the aspect ratio in your prompt. Say create this as a 16:9 widescreen image or make it 9:16 vertical for Instagram Stories.
In Google AI Studio: Select the aspect ratio from the dropdown in the right panel. You get all 14 options plus resolution control from 512px to 4K.
In the API: Set the aspect\_ratio and image\_size parameters in the ImageConfig object. Aspect ratio accepts strings like 16:9 and resolution accepts 512px, 1K, 2K, or 4K.
WHERE TO ACCESS NANO BANANA 2 -- EVERY PLATFORM
The Gemini App (Free) Nano Banana 2 is the default model for all users across Fast, Thinking, and Pro modes. Click the banana icon or just ask Gemini to create an image.
Google AI Studio (Free with API Key) Navigate to [aistudio.google.com](http://aistudio.google.com), select gemini-3.1-flash-image-preview from the model dropdown. Here you get full control over aspect ratio, resolution, thinking mode, and search grounding. This is where power users go when the Gemini app is not enough.
Google Flow (Free, Zero Credits) Google Flow is Google's AI filmmaking tool. Nano Banana 2 is the default image generation engine. It costs zero credits for all users. You can select the aspect ratio, choose how many images to generate in a batch (up to 4 at a time with specified resolution), and enter your prompt. This is the best-kept secret for batch generation without burning credits.
Pomelli (Free) Pomelli is Google Labs' free marketing tool for small and medium businesses. The new Photoshoot feature lets you upload any product photo and it generates professional studio-quality product shots in multiple templates: Studio, Floating, Ingredient, In Use with AI-generated models, and Lifestyle scenes.
NotebookLM (Free) Upload your source documents and click Create Slides or Create Infographic. NotebookLM uses Nano Banana to convert your content into visually stunning slide decks or single-page infographics. You can export directly to Google Slides for editing.
Google Ads (Free within Ads) Nano Banana 2 now powers the AI-generated creative suggestions when building campaigns. Performance marketers get higher-quality asset suggestions natively inside the campaign builder.
Third-Party Apps Confirmed third-party integrations include:
* Adobe Firefly: Integrated into the creative suite for image generation and editing.
* Perplexity: Uses Nano Banana 2 for image generation within research and browsing workflows.
* Figma: Tested for iterative design workflows and UI mockups.
* Notion: Integrated for in-document image generation.
* Gamma: Integrated into Studio Mode for generating theme-matched presentation images.
* Whering: Transforms clothing photos into studio-quality product imagery.
* WPP / Unilever: Used for enterprise-scale campaign testing.
HOW TO MAINTAIN CHARACTER CONSISTENCY ACROSS 5 CHARACTERS
This is the workflow that actually works:
Step 1: Create strong character reference sheets. Start with a clear, well-lit headshot or full-body photo for each character. Step 2: Upload reference images. In AI Studio or the API, you can upload up to 14 reference images total (up to 4 character images and up to 10 object images). Step 3: Describe each character consistently. Use the same physical description across every prompt in the workflow. Step 4: Use the multi-image prompt structure. Upload all character reference images alongside your scene description. Step 5: For video workflows, generate character reference sheets showing multiple angles of each character (front, left profile, right profile, etc.) to maintain 100 percent facial accuracy.
TOP 20 USE CASES
1. Live Data Infographics: Use search grounding to create charts based on real-time data.
2. Global Campaign Localization: Update backgrounds, language, and cultural cues for billboards from a single base creative.
3. Physics-Aware Virtual Try-On: Fabric drapes realistically on body models for fashion mockups.
4. Architectural Time Travel: Restore modern streets to their Victorian 1890s counterparts.
5. Text-Heavy Social Media Posts: Quote cards and posters with strong styled typography.
6. Product Photography at Scale: Professional shots from minimal product photos using Pomelli.
7. LinkedIn Professional Headshots: Transform selfies into studio-quality corporate photos.
8. 4K Image Upscaling: Regenerate low-res images into 4K resolution for free.
9. Old Photo Restoration: Restore damaged or faded memories with colorization and feature repair.
10. Action Figures and Collectibles: Turn likenesses into custom branded figurines.
11. Room Design and Floor Plans: Move from 2D floor plans to photorealistic 3D presentation boards.
12. YouTube Thumbnails: High-converting widescreen graphics with expressive subjects and bold text.
13. E-Commerce Catalog Generation: Maintain product fidelity across seasonal themes using reference images.
14. Brand Identity Kits: Complete brand boards including logos, palettes, and typography.
15. Multi-Panel Storytelling: Maintain visual identity across comic strips and storyboards.
16. Data Visualization from Articles: Paste a link to generate a custom infographic from the content.
17. Blurred Photo to Ultra Sharp: Editorial-quality restoration while preserving original composition.
18. Style Transfer: Swap image styles to watercolor, 3D render, anime, or pencil sketches.
19. Whiteboard and Sketch Visualization: Turn concepts into hand-drawn marker sketches.
20. Celebrity Selfies and Fun Photos: Photorealistic selfies in movie sets or absurd landmarks.
SECRETS MOST PEOPLE MISS
1. The Thinking Mode toggle changes everything. Enable it in AI Studio for complex layouts; it plans before rendering.
2. Image Search Grounding is exclusive to Nano Banana 2. It searches for visual references (buildings, specific products) before generating.
3. Multi-turn editing is the recommended workflow. Refine your image in follow-up messages rather than one massive prompt.
4. The 512px tier exists for rapid prototyping. Use it to find the best composition at low cost before upscaling to 4K.
5. You can generate up to 20 images in a single batch prompt through the API.
6. Flow generates at zero credits. It is the best hack for unlimited batch generation without a subscription.
7. You can use it as a real-time photo editor. Upload a photo and give natural language instructions to remove objects or change colors.
THE PROMPT LIBRARY -- 50 EPIC PROMPTS
**Professional and Business**
1. LinkedIn Headshot: Transform this selfie into a professional studio headshot. Clean neutral background, soft directional light, sharp focus on eyes, charcoal blazer. 4:5, 4K.
2. Infographic from Live Data: Search top 5 programming languages 2026. Create a 9:16 vertical infographic, flat vector style, icons, percentages, average salary.
3. Product Hero Shot: Matte-black wireless headphone on polished obsidian. 85mm macro, soft key light, reflection. 16:9, 4K.
4. SaaS Landing Page Hero: Landing page for FlowState tool. Headline on left, dashboard screenshot on right, two CTA buttons. 16:9, 2K.
5. Business Card Suite: Embossed matte cards, letterhead, wax stamp envelope on slate. Editorial flat lay. 3:2, 4K.
6. Social Media Content Calendar: 9:16 infographic showing 7-day blueprint for fitness brand. Icons for Reels and Stories.
7. Email Marketing Banner: 4:1 horizontal banner, field of wildflowers, text Spring Collection Now Live.
8. Pitch Deck Slide: Single slide, navy background, headline 3x Revenue Growth in Q4, teal line chart on right.
9. Executive Summary Dashboard: 16:9 infographic showing global sales metrics, heat map on left, key KPI cards on right.
10. Startup Team Mockup: Group of diverse professionals in a glass-walled conference room, futuristic Shinjuku city visible outside.
**Photography and Portraits**
11. Editorial Fashion: Model in vibrant red dress standing in desert, high contrast, blue sky, 35mm film grain.
12. Candid Street: Busy market in Marrakech, warm tones, natural lighting, shallow depth of field.
13. Macro Human Eye: Reflecting a city skyline, hyper-realistic, 8k textures.
14. Black and White Artist: Elderly artist in sunlit studio, high detail on skin and paint textures.
15. Gourmet Food Photography: Burger with steam rising, rustic wood background, professional lighting.
16. Cinematic Hiker: Wide shot on mountain peak at dawn, orange and purple sky, majestic mood.
17. Underwater Fashion: Model in silk dress, ethereal lighting, bubbles, fluid motion.
18. Brutalist Architecture: Concrete building shot from low angle, sharp shadows, dramatic sky.
19. Vintage 1970s Polaroid: Family picnic, faded colors, light leaks, nostalgic feel.
20. Cyberpunk Portrait: Close up of subject with neon light reflections on glasses, rainy city background.
**Architecture and Design**
21. 2D Floor Plan: Modern 2-bedroom apartment, labeled rooms, clean linework.
22. 3D Interior Render: Mid-century modern living room, forest view through large windows.
23. Victorian Street: London street corner, horse-drawn carriages, foggy atmosphere, daytime.
24. Futuristic City Plan: Vertical gardens, floating transport pods, top-down view.
25. Cozy Cabin: Stone fireplace, warm light, snow falling outside window.
26. Glass Beach House: Sunset view, ocean reflections on windows, minimalist decor.
27. Office Lobby: Living moss wall, minimalist furniture, bright natural light.
28. Steampunk Library: Brass pipes, glowing green lamps, infinite shelves.
29. Industrial Loft: Exposed brick, large windows, cinematic moody lighting.
30. Zen Garden: Stone path, koi pond, peaceful atmosphere, high detail.
Creative and Wild
31. Custom Action Figure: Hyper-detailed 1/6 scale figure of person from photo in premium collector box.
32. Whiteboard Sketch to 3D: Hand-drawn rocket engine sketch turned into photorealistic 3D blueprint.
33. Origami Dragon: Made of fire, dark background, glowing embers.
34. Autumn Leaf Person: Character made of leaves walking through city park.
35. Cloud Astronaut: Sitting on a cloud fishing for stars in purple galaxy.
36. Chess Cat: Cat in tuxedo playing chess against robot in Victorian study.
37. Surrealist Strawberry: Melting clock over a giant realistic strawberry.
38. Cyberpunk Tea Ceremony: Traditional Japanese tea ritual in neon-lit futuristic room.
39. Glass Piano Reef: Transparent piano filled with tropical fish and coral.
40. Heart Island: Floating island in shape of heart with waterfalls into clouds.
**Restoration and Editing**
41. Wedding Photo Restore: Turn blurred wedding photo into ultra-sharp editorial shot.
42. 4K Upscale: Take low-res 1990s photo and regenerate at 4K resolution.
43. Color Swap: Change car in image to electric blue with matte finish.
44. Background Replace: Move portrait subject to luxury hotel balcony overlooking Eiffel Tower.
45. People Removal: Remove background crowds from beach photo and extend sand.
46. Professional Lighting: Add studio lighting setup to dark selfie, preserve identity.
47. Watercolor Dog: Turn dog photo into artistic watercolor painting style.
48. 1890s Street Edit: Replace cars in modern photo with carriages and Victorian signs.
49. 3D Animation Style: Change style of photo to Pixar-tier 3D animation.
50. Old Memory Repair: Colorize faded black and white photo, fix scratches and tears.
Bonus Fun:
1. Toast Bread Infographic: How to toast bread, make it wacky and over the top with Rube Goldberg machines and scientific data.
2. Banana Runway: High-fashion show where models are giant realistic bananas wearing Gucci, background motion blur.
3. Jellyfish Concert: Underwater heavy metal concert with instruments made of glowing jellyfish, shark lead singer.
4. Pumpkin Penthouse: Luxury penthouse inside a giant hollowed-out pumpkin, autumn aesthetic.
5. Kitchen Time Machine: Blueprint of time machine made of kitchen appliances and duct tape with nonsensical terms.
Pro Tips for Nano Banana 2
* Use the Text Distance Rule: Specify exact words and placement relative to objects for clean layouts.
* Reference Images: Use up to 14 reference images (4 for characters, 10 for objects) to maintain consistency.
* Thinking Model: Toggle on for infographics or complex diagrams to ensure logical planning before pixels render.
I will post links to the complete library of prompts and use cases in the comments.
Get the full 500 prompt image library free with just one click at [PromptMagic.dev](http://PromptMagic.dev)
Just got access to the new Nano Banana 2(Gemini 3.1 Flash Image) model and it's wild.
Here's a working curl request to test it yourself (just swap in your own Vertex API key):
curl "https://aiplatform.googleapis.com/v1/publishers/google/models/gemini-3.1-flash-image-preview:streamGenerateContent?key={API_KEY}" \
-X POST \
-H "Content-Type: application/json" \
-d '{
"contents": [
{
"role": "user",
"parts": [
{
"text": "Explain how AI works in a few words"
}
]
}
]
}'
You can also try it on Google gemini.
OpenClaw is now available in the UGOS Pro App Center as a one-click install on supported UGREEN NASync models. If you've been waiting for a local AI agent that lives on your NAS instead of your laptop, it's ready.
# Overview
**OpenClaw** is a personal AI assistant that runs locally on your device. It interacts with users through common communication channels. The UGREEN NAS version has been deeply customized to provide a "**one-click**" installation experience, enabling a fully no-code deployment workflow.
This guide walks you through installing OpenClaw on your NAS, completing the path and API configuration, and verifying your first connection.
**Note**: This application consumes significant system resources. Ensure your system has at least **2 GB** of available RAM. The installation process may take more than **10 minutes**, which is normal.
# Supported Models
The following NAS models currently support the OpenClaw application (more models are being added):
● DXP Series: DXP2800, DXP4800, DXP4800 Plus, DXP4800 Pro, DXP6800, DXP6800 Plus, DXP6800 Pro, DXP8800, DXP8800 Plus, DXP8800 Pro, DXP480T Plus
● iDX Series: iDX6011, iDX6011 Pro *(support ready; devices currently shipping to crowdfunding backers and not yet generally available)*
# Prerequisites
Before installing OpenClaw, go to the "**App Center**" on your UGREEN NAS and make sure that the **Docker** application is installed and updated to the latest version.
**Note**: OpenClaw relies heavily on the Docker container environment. If the Docker version is outdated, it may cause deployment failures, incorrect permission configurations, or runtime issues.
# Step 1: Install the Application
Log in to your UGREEN NAS, go to the "**App Center**", find **OpenClaw** under the "**All**" list, and click "**Install**".
https://preview.redd.it/pwm42yn1qi0h1.png?width=1112&format=png&auto=webp&s=cc2e4ddb3212210bce8c386140ffa27775ecff73
# Step 2: Installation and Configuration
When installing the OpenClaw application, you must first complete the system path and model interface settings on the "**OpenClaw installation configuration**" page.
# Path Configuration
This section defines the file access scope for OpenClaw. For data security, grant permissions carefully.
https://preview.redd.it/mlbrobc4qi0h1.png?width=591&format=png&auto=webp&s=8f78a6597688186f0f5ed436e2f23016c64a9b6b
* **Workspace path**: This is the default working directory for OpenClaw. It is used by AI to write code, generating files, and storing temporary data. It is recommended to assign an empty folder.
* **File access path**: Authorizes OpenClaw to read or modify existing files on the NAS (such as documents and videos). Multiple directories can be configured by clicking the "**Add**" button.
**Notes:**
* Do not select folders containing private or sensitive data. Deleting, modifying, or moving authorized folders may cause application malfunctions.
* The file access path must not overlap with the "**Workspace path**" or any of its subdirectories.
# API and Model Configuration
This section is used to connect to external large language model (LLM) services.
* **API Base URL**: Enter the interface address provided by the LLM service provider (e.g., `https://generativelanguage.googleapis.com/v1/models/gemini-3.1-flash-lite-preview:generateContent`).
* **Model name**: Enter the specific model version to be invoked (e.g., `gemini-3.1-flash-lite-preview`).
* **API key**: Enter the dedicated access key obtained from the model provider.
* **Gateway token**: This token is required for authentication when accessing the OpenClaw Web interface via a browser.
https://preview.redd.it/zt6y5kk7qi0h1.png?width=597&format=png&auto=webp&s=6293e7c86aa56b6f8004f9517c6688a9645bf738
# Complete the Installation
After confirming that the above configurations are correct and fully understanding the associated risks, select "**I have read and understand and the installation risks**" in the lower-left corner of the page. Then click "**Install**" in the lower-right corner to begin deploying the application.
https://preview.redd.it/hxyc9s79qi0h1.png?width=598&format=png&auto=webp&s=660eceda1aedc64ae59e6cf95e7bbea6a82433cc
# Risks and Precautions
OpenClaw is a highly privileged automation agent. Before installation, please carefully review the following key risks:
1. By default, the application runs within a Docker container with root privileges, allowing it to read and write files and execute system commands. It is strongly recommended not to expose this application to the public internet.
2. The current version does not provide secure multi-user isolation. If multiple users share the same application instance, they will share all tool permissions and data.
3. This open-source project is currently in the testing stage. Any legal, financial, or data-related risks arising from AI-driven automation—such as accidental file deletion or data leakage—shall be borne solely by the user.
# Usage and Connection
After installation, you need to verify the connection through the Web interface. The frontend console must connect to the OpenClaw backend gateway.
# Step 1: Access the Web Management Interface
1. On the UGREEN NAS desktop, locate the OpenClaw app and click its shortcut icon.
2. The system will automatically open your browser and navigate to the OpenClaw Web management interface.
3. After entering the interface, you will be prompted to **enter the Gateway Token**. Input the token you previously set.
4. Once entered, click the "**Connect**" button below.
https://preview.redd.it/z47ehn5bqi0h1.png?width=1083&format=png&auto=webp&s=f710177d8a5c12ac9d2719c10a214826bd1cd70c
# Step 2: Verify Connection Status
After a successful connection, you can confirm that the gateway is active through the interface:
* Click "**Overview**" in the left sidebar.
* Locate the "**Snapshot**" card on the right side.
Its status should display as green "**OK**"
https://preview.redd.it/pjfnxx5fqi0h1.png?width=1914&format=png&auto=webp&s=0a4dfbddf03ce237481612f22f55195bec0e106e
# Step 3: Start Using the AI Assistant
Click "Chat" in the left sidebar, type your question or command in the input box at the bottom, and click Send (paper plane icon).
If the model you configured in Step 2 is reachable, you'll get a response. If you see a provider error instead, the most common cause is a misconfigured API Base URL, Model name, or API key. You can adjust these from the OpenClaw web UI.
# Common LLM API Configuration
When configuring external large language models for your application, you need to provide the correct API endpoint (URL) and model ID. Below are reference configurations for commonly used models.
# OpenAI (GPT Series)
When configuring the OpenAI API, choose either the base URL or the full request URL based on your application’s requirements:
* Base URL:
​
https://api.openai.com/v1
* Full Request URL (Chat Completions):
​
https://api.openai.com/v1/chat/completions
# Google Gemini
When configuring the Google Gemini API, the request URL must be constructed by combining the base template with a specific model ID.
1. API Endpoint Template Replace `{model}` with the actual model ID you want to use:`https://generativelanguage.googleapis.com/v1/models/{model}:generateContent`
2. Example of a Complete Endpoint For example, using `gemini-3.1-flash-lite-preview`:`https://generativelanguage.googleapis.com/v1/models/gemini-3.1-flash-lite-preview:generateContent`
3. Available Gemini 3 Series Model IDs
Select a model based on your token quota and task requirements:
* `gemini-3.1-flash-lite-preview`
* `gemini-3.1-flash-image-preview`
* `gemini-3.1-pro-preview`
* `gemini-3-flash-preview`
* `gemini-3-pro-image-preview`
# Security Risk Notice
* **Root Privileges and High-Risk Operations**: This application runs with **root privileges** inside a Docker container to support automation tasks. It can read/write/delete files, publish external messages, and execute system commands. **Only grant access to trusted directories and avoid mounting sensitive data paths**.
* **No Multi-Tenant Isolation**: By default, OpenClaw is designed for a single trusted user. It does not provide multi-user isolation. If shared, all users will have the same permissions.
* **Public Network Exposure Risks**: Do not expose the service to the public internet unless you are familiar with security hardening and access control. **UGREEN NAS does not provide guidance for public deployment**—consult professionals if needed.
* **Content Compliance**: When interacting externally (e.g., via messaging bots), ensure compliance with platform policies and local laws. Users are responsible for all AI-generated content.
* **Disclaimer**: This application is provided solely as a service integration tool. The provider offers deployment support only and makes no guarantees regarding the security, stability, or functional completeness of the OpenClaw software itself. Any customization, configuration choices, or operational decisions made by the user—and all resulting legal, financial, or data-related risks (including but not limited to data breaches, device damage, cost overruns, or legal disputes)—are the sole responsibility of the user. The provider assumes no liability.
AI tools related to Gemini 3.1 Flash Image vs Gemini 1.5 Flash Deprecated
These tools are closely connected to one or both models in this comparison and can help you evaluate real-world fit.
googlegemini.co
googlegemini.co is a free tool for interacting with text and images, powered by the Google Gemini Pro API. It allows you to use Gemini easily without managing your own server or API configurations. Google Gemini is a multimodal AI developed by DeepMind capable of processing text, audio, images, and more. It is optimized for various devices, performs well on AI benchmarks, and is built with a focus on safety and responsible AI practices.
GeminiGoogle.cc
GeminiGoogle.cc is a platform dedicated to showcasing Google's most advanced AI model, Gemini. Built for native multimodality, Gemini reasons across text, images, video, audio, and code. It is available in three versions—Ultra, Pro, and Nano—to support tasks ranging from complex reasoning to on-device efficiency. The site highlights Gemini's performance, including its MMLU benchmarks, and provides examples of its capabilities in image generation, problem-solving, and multimodal analysis.
Summarize and Translate Web Pages - Chrome Extension
The Summarize and Translate Web Pages Chrome extension enables you to summarize and translate web content with a single click. Powered by Google's Gemini AI, this tool provides high-quality summaries and translations for web pages, selected text, YouTube video captions, images, and PDF files.
FlyMSG - Chrome Extension
FlyMSG is a free AI-powered Chrome extension designed to enhance productivity through text expansion, autofill, and keyboard shortcuts. It features FlyPosts AI for social media content generation and FlyEngage AI for LinkedIn interaction. Built on Microsoft Azure OpenAI (GPT-4, GPT-3, and GPT-3.5) and Google AI PaLM 2, the extension automates repetitive typing tasks and provides instant access to pre-written templates.
Which model should you choose?
Use the summary below to decide which model better fits your workflow, budget, and feature requirements.
Gemini 3.1 Flash Image
Gemini 3.1 Flash Image is a stronger fit for reasoning-heavy tasks, multimodal applications, cost-efficient scale.
Gemini 1.5 Flash Deprecated
Gemini 1.5 Flash Deprecated is a stronger fit for general-purpose AI workloads.
Choose Gemini 3.1 Flash Image if you prioritize reasoning-heavy tasks, multimodal applications, cost-efficient scale. Choose Gemini 1.5 Flash Deprecated if your workflow depends more on general-purpose AI workloads.
Common questions about Gemini 3.1 Flash Image vs Gemini 1.5 Flash Deprecated
What is the main difference between Gemini 3.1 Flash Image and Gemini 1.5 Flash Deprecated?
Gemini 3.1 Flash Image leans toward reasoning-heavy tasks, multimodal applications, cost-efficient scale, while Gemini 1.5 Flash Deprecated is better suited to general-purpose AI workloads.
Which model is cheaper: Gemini 3.1 Flash Image or Gemini 1.5 Flash Deprecated?
Review both models' current pricing on this page to decide which option is more cost-effective.
Which model has the larger context window: Gemini 3.1 Flash Image or Gemini 1.5 Flash Deprecated?
Gemini 3.1 Flash Image is listed with a context window of 131,072, while Gemini 1.5 Flash Deprecated is listed with N/A.
How should I evaluate Gemini 3.1 Flash Image vs Gemini 1.5 Flash Deprecated for my use case?
Use the feature, pricing, and context comparisons on this page to evaluate the two models.