Google vs Google

Gemini 2.0 Flash Lite vs Gemini 2.0 Flash

Compare Gemini 2.0 Flash Lite and Gemini 2.0 Flash across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for long-context workloads versus long-context workloads.

Overview Comparison

Structured side-by-side differences for the highest-signal model metadata.

Gemini 2.0 Flash Lite
Gemini 2.0 Flash

Provider

The entity that currently provides this model.

Gemini 2.0 Flash Lite Google
Gemini 2.0 Flash Google

Model ID

The routed model identifier exposed by upstream providers.

Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite-001
Gemini 2.0 Flash google/gemini-2.0-flash-001

Input Context Window

The number of tokens supported by the input context window.

Gemini 2.0 Flash Lite 1,048,576 tokens
Gemini 2.0 Flash 1,048,576 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

Gemini 2.0 Flash Lite 8,192 tokens tokens
Gemini 2.0 Flash 8,192 tokens tokens

Open Source

Whether the model's code is available for public use.

Gemini 2.0 Flash Lite No
Gemini 2.0 Flash No

Release Date

When the model was first released.

Gemini 2.0 Flash Lite Feb 25, 2025
Gemini 2.0 Flash Feb 05, 2025

Knowledge Cut-off Date

When the model's knowledge was last updated.

Gemini 2.0 Flash Lite June 2024
Gemini 2.0 Flash June 2024

API Providers

The providers that currently expose the model through an API.

Gemini 2.0 Flash Lite
Google, Vertex AI
Gemini 2.0 Flash
Google, Vertex AI

Modalities

Types of data each model can process or return.

Gemini 2.0 Flash Lite
Text Image File Audio Video
Gemini 2.0 Flash
Text Image File Audio Video

Pricing Comparison

Compare current token pricing before you choose the cheaper or more scalable API option.

Gemini 2.0 Flash Lite Google
Input price $0.08 Per 1M tokens
Output price $0.30 Per 1M tokens
Gemini 2.0 Flash Google
Input price $0.15 Per 1M tokens
Output price $0.40 Per 1M tokens

Capabilities Comparison

See where each model overlaps, where they differ, and which one supports more of the features you care about.

Capability
Gemini 2.0 Flash Lite
Gemini 2.0 Flash
Cost-Effective Scaling Priced for high-volume usage, allowing developers to run large numbers of requests while keeping per-token costs low compared to larger model tiers.
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash
Fast Inference Optimized for low-latency responses, making it suitable for real-time applications and pipelines that require quick turnaround on text generation tasks.
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash
File
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash Supported
Function Calling Supports function calling, enabling the model to invoke developer-defined tools and integrate with external APIs or services within a workflow.
Gemini 2.0 Flash Lite
Gemini 2.0 Flash Supported
Image
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash Supported
Large Context Window Processes up to 1,048,576 tokens in a single request, enabling analysis of long documents, codebases, or extended conversation histories without truncation.
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash Supported
Multimodal Input Accepts text and image inputs within the same request, supporting tasks that combine visual and textual understanding such as image captioning or document analysis.
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash Supported
Real-Time Latency Designed to return responses at real-time speeds, making it suitable for interactive applications and live user-facing workflows.
Gemini 2.0 Flash Lite
Gemini 2.0 Flash Supported
Structured Output Supports JSON-mode responses, allowing developers to request structured data outputs suitable for downstream processing in applications and APIs.
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash Supported
Text
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash Supported
Text Generation Generates coherent, contextually relevant text for use cases including summarization, translation, classification, and content drafting.
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash Supported
Tools
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash Supported
Video
Gemini 2.0 Flash Lite Supported
Gemini 2.0 Flash Supported

Benchmark Comparison

Shared benchmark rows make it easier to compare performance where both models have published scores.

Benchmark Gemini 2.0 Flash Lite Gemini 2.0 Flash
AIME 2024
American math olympiad problems
Gemini 2.0 Flash Lite 27.7%
Gemini 2.0 Flash 33.0%
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
Gemini 2.0 Flash Lite 53.5%
Gemini 2.0 Flash 62.3%
HLE
Questions that challenge frontier models across many domains
Gemini 2.0 Flash Lite 3.6%
Gemini 2.0 Flash 5.3%
LiveCodeBench
Real-world coding tasks from recent competitions
Gemini 2.0 Flash Lite 18.5%
Gemini 2.0 Flash 33.4%
MATH-500
Undergraduate and competition-level math problems
Gemini 2.0 Flash Lite 87.3%
Gemini 2.0 Flash 93.0%
MMLU-Pro
Expert knowledge across 14 academic disciplines
Gemini 2.0 Flash Lite 72.4%
Gemini 2.0 Flash 77.9%
SciCode
Scientific research coding and numerical methods
Gemini 2.0 Flash Lite 25.0%
Gemini 2.0 Flash 33.3%
Community discussion

What Reddit discussions say about Gemini 2.0 Flash Lite vs Gemini 2.0 Flash

Gemini 2.0 Flash Lite and Gemini 2.0 Flash are both surfacing live Reddit discussions, giving this comparison a community layer beyond specs and benchmarks.

The most visible threads right now are clustered in r/Bard, r/googlecloud, r/RooCode. 3 threads are showing up in both models' discussion sets, which is useful for side-by-side evaluation.

Gemini 2.0 Flash r/Bard 157 upvotes 25 comments April 9, 2025
New Gemini Updates

Changelog

April 9, 2025

Model updates:

Released veo-2.0-generate-001, a generally available (GA) text- and image-to-video model, capable of generating detailed and artistically nuanced videos. To learn more, see the Veo docs.

Released gemini-2.0-flash-live-001, a public preview version of the Live API model with billing enabled.

Open Reddit thread

# 80,000 NOK ($7,500) drained from my Google Cloud account in 5 minutes — full forensic breakdown of how the attack worked

I want to write this up while it's fresh, because the *mechanism* of the attack is more interesting than the "I leaked a key, oops" headline — and the platform design that allowed it is something every Google Cloud user should know about.

# What happened

* May 8, 2026, evening (CET): I get a billing alert email saying I owe NOK 82,305.36 (\~$7,500 USD) on my Google Cloud account.
* My typical monthly spend: \~100 NOK ($10).
* The spike happened in roughly 5 minutes.
* All charges were on the Gemini API in a single project I'd barely touched (an old "no-code maps" project from 2017).
* An API key from that project was leaked somewhere — I'm still hunting where. Most likely an old GitHub repo or a public webpage from 2018-ish that had Gemini API enabled on its project years later (I think this is what made it exploitable — the key sat dormant, but the moment Gemini got enabled on its project, the dormant key became a Gemini-capable wallet).

# What the attacker actually did (the part nobody talks about)

I pulled the SKU-level breakdown from Billing → Reports. The attacker didn't just hit one model. They ran an automated framework that fanned out across every Gemini variant simultaneously:

* Gemini 3 Pro (text + image generation)
* Gemini 3 Flash
* Gemini 3.1 Flash Image
* Gemini 3.1 Flash Lite Preview
* Gemini 2.5 Pro (text + TTS)
* Gemini 2.5 Flash (short + long context, multimodal)
* Gemini 2.5 Flash Lite
* Gemini 2.0 Flash TTS
* Gemini Embedding-2 + Embedding-001

15+ distinct models in 5 minutes. No human application uses 15 models in parallel. This is the signature of an automated abuse framework, almost certainly a credential-resale operation.

Token volumes:

* 1.09 BILLION input tokens on Gemini 2.5 Flash Lite alone
* 402M image input tokens on Gemini 3 Pro
* 226M text input tokens on Gemini 3 Pro
* 19.4M image output tokens on Gemini 3 Pro Image — kr 21,674 ($2,000) on this single SKU, the most expensive line item

The attacker prioritized image generation because that's where the real money is — image output tokens are 50–100x more expensive than text.

# How they bypassed rate limits (this is the architectural problem)

You'd think rate limits would protect you. They don't — at least not on Google Cloud:

* Gemini 3 Pro: 1,000 RPM
* Gemini 3 Flash: 2,000 RPM
* Gemini 2.5 Flash Lite: 4,000 RPM
* (etc., for every model — *each with its own independent quota*)

There is no per-key aggregate cap across models. If you fan out across 15 models concurrently, you cap at the *sum* — easily 30,000+ RPM combined.

OpenAI, Anthropic, and Mistral all have per-key aggregate caps. Google does not. This is not a policy oversight — it's the core mechanism that makes a single compromised key a 5-minute, 5-figure liability.

Also: Google Cloud does not offer a hard spending cap. No "stop all spend at $X" option. The closest is a budget alert that *emails you* (after the fact), or — and this is the documented "solution" — you can write your own Cloud Function that listens to budget Pub/Sub events and programmatically disables your billing account. Yes, Google's official answer to "how do I stop runaway spending" is "deploy code on the same platform that's billing you." This has been a known gripe for years.

# What logging gave me — almost nothing

I tried every audit log query:

* `protoPayload.serviceName="generativelanguage.googleapis.com"` → empty
* `resource.type="consumed_api"` for the project → empty
* Vertex AI logs → empty

Google does not log per-request data for Gemini API key calls. No caller IP, no user-agent, no request size. The only forensic record that exists is the SKU-level billing report — and that only goes down to "model + token type", not session/request/key.

So I can't tell you who did it, where they were, or what they generated. I just know it was 15 models in parallel and 19M image output tokens.

# What I did in the first 90 minutes

* Deleted all 13 API keys on the affected project (after seeing the alert at \~01:25)
* Disabled [`generativelanguage.googleapis.com`](http://generativelanguage.googleapis.com) and [`aiplatform.googleapis.com`](http://aiplatform.googleapis.com) on every one of my 25+ projects (script via `gcloud services disable`)
* Closed all 3 billing accounts
* Called my bank, blocked the Visa
* Got into Google's billing chat queue, escalated to specialist team within 5 messages
* Case 71021804 opened, 24-48h response window
* Pulled SKU-level forensic evidence

The chat agent confirmed end-of-month billing cycle, so the actual charge attempt won't fire until \~May 28-31. By then either the specialist team has waived it, or the card-block + chargeback dispute kicks in.

# What I'm pretty sure happens next

* \~85% chance: specialist team waives the charge under the compromised-credentials policy. Google has standardized this for exactly this scenario because they know the rate-limit architecture allows it.
* \~10% chance: partial waiver / settlement.
* \~5% chance: they refuse, my bank chargeback wins it under Norwegian Finansavtaleloven (450 NOK max liability for unauthorized card use).

I'm not actually going to pay 80k. The realistic worst case is several months of paperwork.

# Lessons / PSA for everyone running Google Cloud

1. Restrict every API key at creation time. Application restriction (HTTP referrer or IP allowlist) + API restriction (only the APIs you use). An unrestricted key on a project where Gemini happens to be enabled is a wallet.
2. Audit every project for keys you've forgotten about. I had keys from 2017, 2020, 2021 — most predating Gemini's existence. The moment Gemini got enabled on those old projects, the old keys could call it.
3. Disable APIs you don't actively use. Per-project. An enabled API + an unrestricted key = exposure.
4. Set up a budget-disables-billing Cloud Function. The auto-shutdown one. Yes it's stupid that Google makes you write code for this, but it's the only real circuit breaker.
5. Don't trust rate limits. They protect Google's infrastructure, not your wallet. Per-model RPM × N models = no real cap.
6. Don't store API keys in client-side code, ever. Even if you think a project is dead.

# Where the leak came from

Honestly, I don't know yet. The project was created in 2017 (back when Google appended a numeric suffix like `-364317` to project IDs). It had 13 keys accumulated over years. One of them is somewhere out in the wild. I'll be searching GitHub history, old Vercel deployments, Wayback Machine, and screenshots over the coming days. If I find it I'll edit this post.

If anyone has run into the same multi-model abuse pattern recently, I'd love to hear about it — particularly if you have any signals on which credential-resale operations are currently active.

Edit: Will update with specialist team's response when it arrives in 24-48h.

Open Reddit thread
Gemini 2.0 Flash Lite r/RooCode 32 upvotes 6 comments February 5, 2025
Roo Code 3.3.12 Released - Support for new Gemini models!

### 📢 Gemini 2.0 Support
- **Added support for new Gemini 2.0 models**, which include:
- Structured outputs
- Function calling
- Large context windows
- Image support
- Prompt caching (coming soon)
- 8192 max token Output

- **Individual Models**
- **gemini-2.0-flash-001** – 1,048,576 context
- **gemini-2.0-flash-lite-preview-02-05** – 1,048,576 context
- **gemini-2.0-pro-exp-02-05** – 2,097,152 context

### 🐛 Bug Fixes
- Fix issue with changing a mode's API configuration on the prompts tab

----

If Roo Code has been useful to you, take a moment to [rate it on the VS Code Marketplace](https://marketplace.visualstudio.com/items?itemName=RooVeterinaryInc.roo-cline&ssr=false#review-details). Reviews help others discover it and keep it growing!

---
*Download the latest version from our [VSCode Marketplace page](https://marketplace.visualstudio.com/items?itemName=rooveterinaryinc.roo-cline) and pleaes WRITE US A REVIEW*

*Join our communities:*
* *[Discord server](https://discord.gg/roocode) for real-time support and updates*
* *[r/RooCode](https://reddit.com/r/RooCode) for discussions and announcements*

Open Reddit thread
Gemini 2.0 Flash Lite r/ChatGPTCoding 27 upvotes 15 comments February 5, 2025
Roo Code Support Gemini 2.0 - 3.3.12 Released

### 📢 Gemini 2.0 Support
- **Added support for new Gemini 2.0 models**, which include:
- Structured outputs
- Function calling
- Large context windows
- Image support
- Prompt caching (coming soon)
- 8192 max token Output

- **Individual Models**
- **gemini-2.0-flash-001** – 1,048,576 context
- **gemini-2.0-flash-lite-preview-02-05** – 1,048,576 context
- **gemini-2.0-pro-exp-02-05** – 2,097,152 context

### 🐛 Bug Fixes
- Fix issue with changing a mode's API configuration on the prompts tab

----

If Roo Code has been useful to you, take a moment to [rate it on the VS Code Marketplace](https://marketplace.visualstudio.com/items?itemName=RooVeterinaryInc.roo-cline&ssr=false#review-details). Reviews help others discover it and keep it growing!

---
*Download the latest version from our [VSCode Marketplace page](https://marketplace.visualstudio.com/items?itemName=rooveterinaryinc.roo-cline) and pleaes WRITE US A REVIEW*

*Join our communities:*
* *[Discord server](https://discord.gg/roocode) for real-time support and updates*
* *[r/RooCode](https://reddit.com/r/RooCode) for discussions and announcements*

Open Reddit thread
Gemini 2.0 Flash r/lmarena 12 upvotes 16 comments April 25, 2026
List of all models.

There are currently 481 models listed on the [arena.ai](http://arena.ai) website.

Here's the full list:

amazon.nova-pro-v1:0

anonymous-0410

anonymous-1111

anonymous-1218

anonymous-1221

anonymous-1800

anonymous-1815

anonymous-1825

anonymous-1835

apex-atlas

april26-chatbot1

april26-chatbot2

arastradero

atlas

autobear

badger

basalt-0303-1

basalt-0422-1

baseliner

beluga-0311-1

beluga-0413-1

blackhawk

blue-forge

botbot2

chatgpt-image-latest-high-fidelity (20251216)

chipmunk

chives

citrus

claude-3-5-sonnet-20241022

claude-3-7-sonnet-20250219

claude-3-7-sonnet-20250219-thinking-32k

claude-haiku-4-5-20251001

claude-opus-4-1-20250805

claude-opus-4-1-20250805-thinking-16k

claude-opus-4-1-search

claude-opus-4-20250514

claude-opus-4-20250514-thinking-16k

claude-opus-4-5-20251101

claude-opus-4-5-20251101-thinking-32k

claude-opus-4-5-search

claude-opus-4-6

claude-opus-4-6-search

claude-opus-4-6-thinking

claude-opus-4-7

claude-opus-4-7-search

claude-opus-4-7-thinking

claude-opus-4-search

claude-sonnet-4-20250514

claude-sonnet-4-20250514-thinking-32k

claude-sonnet-4-5-20250929

claude-sonnet-4-5-20250929-thinking-32k

claude-sonnet-4-5-search

claude-sonnet-4-6

claude-sonnet-4-6-search

clawl

clinkz

cloud-buddy

dall-e-3

dart-frog-0206

deep-octo

deepseek-v4-flash

deepseek-v4-flash-thinking

deepseek-v4-pro

deepseek-v4-pro-thinking

devstral-2

devstral-medium-2507

dialogue

dola-seed-2.0-preview-text

dola-seed-2.0-preview-vision

dola-seed-2.0-pro-text

dola-seed-2.0-pro-vision

dove

duomo-1-hero

EB45-turbo

EB45-vision

ember

emu

epilogue

ernie-5.0-0110

ernie-5.0-preview-1220

ernie-exp-251023

ernie-exp-251024

ernie-exp-251025

ernie-exp-251026

ernie-exp-251027

ernie-exp-vl-251016

ernie-image

eureka

february26-chatbot2

february26-chatbot3

february26-chatbot4

flashbrown-a

flashbrown-b

flow-state

flow-state-2

flow-state-3

flux-1-kontext-dev

flux-1-kontext-max

flux-1-kontext-pro

flux-2-dev

flux-2-flex

flux-2-klein-4b

flux-2-klein-9b

flux-2-max

flux-2-pro

flying-octopus

frenchfry

frieza

gallant

gallery

gcps-fast

gemini-2.0-flash-001

gemini-2.5-flash

gemini-2.5-flash-image-preview (nano-banana)

gemini-2.5-pro

gemini-2.5-pro-grounding

gemini-2.5-pro-grounding-exp

gemini-3-flash

gemini-3-flash (thinking-minimal)

gemini-3-flash-grounding

gemini-3-pro

gemini-3-pro-image-preview-2k (nano-banana-pro)

gemini-3.1-flash-image-preview (nano-banana-2) \[web-search\]

gemini-3.1-flash-lite-preview

gemini-3.1-pro

gemini-3.1-pro-grounding

gemini-3.1-pro-preview

gemma-3-27b-it

gemma-3n-e4b-it

glm-4.7

glm-4.7-flash

glm-5

glm-5.1

glm-5v-turbo

globe\_2

gpt-4.1-2025-04-14

gpt-4.1-mini-2025-04-14

gpt-5-chat

gpt-5-high

gpt-5-high-new-system-prompt

gpt-5-high-no-system-prompt

gpt-5-medium

gpt-5-mini-high

gpt-5-nano-high

gpt-5-search

gpt-5.1

gpt-5.1-codex

gpt-5.1-codex-max

gpt-5.1-codex-mini

gpt-5.1-high

gpt-5.1-medium

gpt-5.1-search

gpt-5.1-search-sp

gpt-5.2

gpt-5.2-chat-latest

gpt-5.2-codex

gpt-5.2-high

gpt-5.2-search

gpt-5.2-search-non-reasoning

gpt-5.3-chat-latest

gpt-5.3-codex

gpt-5.4

gpt-5.4-high

gpt-5.4-high-no-system-prompt

gpt-5.4-medium

gpt-5.4-mini-high

gpt-5.4-nano-high

gpt-5.4-no-system-prompt

gpt-5.4-search

gpt-5.5

gpt-5.5-high

gpt-5.5-search

gpt-image-1

gpt-image-1-high-fidelity

gpt-image-1-mini

gpt-image-1.5-high-fidelity

gpt-image-2 (medium)

gpt-oss-120b

gpt-oss-20b

grok-3-mini-beta

grok-3-mini-high

grok-4-0709

grok-4-1-fast-non-reasoning

grok-4-1-fast-reasoning

grok-4-1-fast-search

grok-4-fast-chat

grok-4-fast-reasoning

grok-4-fast-search

grok-4-search

grok-4.1

grok-4.1-thinking

grok-4.20-beta-0309-reasoning

grok-4.20-beta1

grok-4.20-multi-agent-beta-0309

grok-code-fast-1

grok-imagine-image

grok-imagine-image-pro

grok-imagine-video

hailuo-02-fast

hailuo-02-pro

hailuo-02-standard

hailuo-2.3

hailuo-2.3-fast

happy-friday-testing-1

happy-friday-testing-2

hearth

hidream-e1.1

hofburg\_2

hofburg\_2\_alt

hofburg\_3

hofburg\_4

hofburg\_5

hofburg\_5\_alt

hofburg-1

hunyuan-hy3-preview

hunyuan-image-2.1

hunyuan-image-3.0

hunyuan-image-3.0-fal

hunyuan-t1-20250711

hunyuan-video-1.5

hunyuan-vision-1.5-thinking

ibm-granite-h-small

ideogram-v3-quality

imagen-3.0-generate-002

imagen-4.0-fast-generate-001

imagen-4.0-generate-001

imagen-4.0-ultra-generate-001

intellect-3

jester

jumbo-dungeness

juniper

k2

kandinsky-5.0-i2v-pro

kandinsky-5.0-t2v-lite

kandinsky-5.0-t2v-pro

karyu

KAT-Coder-Pro-V1

ketchup-v2

kimi-k2-0711-preview

kimi-k2-0905-preview

kimi-k2-thinking-turbo

kimi-k2.5

kimi-k2.5-instant

kimi-k2.6

kiteki

kiwi-do

kiwire

kizen-alpha

kizen-beta

kling-2.5-turbo-1080p

kling-2.6-pro

kling-2.6-standard

kling-image-o1

kling-o1-pro

kling-o3-pro

kling-v2.1-master

kling-v2.1-standard

kling-v3

leepwal

left-bank

ling-1t

ling-1t-1031

ling-2.5-1t

ling-flash-2.0

llama-3.3-70b-instruct

longcat-flash-chat

ltx-2-19b

lucid-origin

mammoth-newt-0206

mammoth-newt-0226

march26-chatbot1

march26-chatbot1-public

march26-chatbot2

march26-chatbot3

markhor

Max

mercury

mercury-2

micro-mango

mimo-v2-flash

mimo-v2-flash (thinking)

mimo-v2-omni

mimo-v2-pro

mimo-v2.5

mimo-v2.5-pro

minicpm-sala

minimax-m1

minimax-m2

minimax-m2-preview

minimax-m2.1-preview

minimax-m2.5

mistral-large-3

mistral-medium-2505

mistral-medium-2508

mistral-small-2506

mistral-small-2603

mistral-small-3.1-24b-instruct-2503

mochi-v1

model-x

model-x-2

molmo-2-8b

monologue

monster

monterey

neon

nightride-on

nightride-on-v2

nova-2-lite

nvidia-nemotron-3-nano-30b-a3b-bf16

o3-2025-04-16

o3-mini

o3-search

o4-mini-2025-04-16

olmo-3-32b-think

olmo-3.1-32b-instruct

olmo-3.1-32b-think

orion

p-image

p-image-edit

paper-lantern

pebble-1

pebble-2

pepper

photon

pika-v2.2

pine

pisces-0226d

pisces-0309

pisces-0309-vision

pisces-0309b

pisces-0309c

pisces-0309d

pisces-0318-text

pisces-0318-vision

pisces-0320

pisces-llm-0130

pixel-parrot

pixverse-v5.6

ppl-sonar-reasoning-pro-high

prologue

pteronura

pulse

queen-bee

quiet\_sand

qwen-image-2.0

qwen-image-2.0-pro

qwen-image-2512

qwen-image-edit

qwen-image-edit-2511

qwen-image-prompt-extend

qwen-vl-max-2025-08-13

qwen3-235b-a22b

qwen3-235b-a22b-instruct-2507

qwen3-235b-a22b-no-thinking

qwen3-235b-a22b-thinking-2507

qwen3-30b-a3b

qwen3-30b-a3b-instruct-2507

qwen3-coder-480b-a35b-instruct

qwen3-max-2025-09-23

qwen3-max-2025-09-26

qwen3-max-2025-10-30

qwen3-max-preview

qwen3-max-thinking

qwen3-next-80b-a3b-instruct

qwen3-next-80b-a3b-thinking

qwen3-omni-flash

qwen3-vl-235b-a22b-instruct

qwen3-vl-235b-a22b-thinking

qwen3-vl-8b-instruct

qwen3-vl-8b-thinking

qwen3.5-122b-a10b

qwen3.5-122b-a10b-code

qwen3.5-27b

qwen3.5-27b-code

qwen3.5-35b-a3b

qwen3.5-35b-a3b-code

qwen3.5-397b-a17b

qwen3.5-flash

qwen3.6-plus

qwen3.6-plus-preview

qwq-32b

raptor-1.8-0120

raptor-1123

raptor-1124

ray-3

ray2

recraft-v3

recraft-v4

redwood

reve-v1.1

reve-v1.1-fast

ring-1t

ring-2.5-1t

ring-flash-2.0

rising-sun

robin

robin-high

rotten-apple

runway-gen-4.5

runway-gen4

runway-gen4-aleph

runway-gen4-turbo

scorch

seed-1.8

seedance-v1-lite

seedance-v1-pro

seedance-v1.5-pro

seededit-3.0

seedream-3

seedream-4-high-res-fal

seedream-4.5

seedream-5.0-lite

shakshouka

significant-otter

snowflake

soft-shell

solar-eclipse

sora

sora-2

sora-2-pro

spark

sphinx

spire

star-drift

steed-0217

step-3

step-3-mini-2511

step-3.5-flash

stephen-v2

stephen-vision-csfix

sungod

sunshine-ai

super-cara

super-gcp

tatertot

trinity-large

trinity-large-thinking

velo

veo-2

veo-3

veo-3-audio

veo-3-fast

veo-3-fast-audio

veo-3.1-audio

veo-3.1-audio-1080p

veo-3.1-audio-4k

veo-3.1-fast-audio

veo-3.1-fast-audio-1080p

veo-3.1-fast-audio-4k

vidu-q2-image

vierra

viper

vortex

vulcan

waffle

wan-v2.2-a14b

wan-vace

wan2.5-i2i-preview

wan2.5-i2v-preview

wan2.5-preview

wan2.5-t2i-preview

wan2.5-t2v-preview

wan2.6-i2v

wan2.6-image

wan2.6-t2i

wan2.6-t2v

wan2.7-i2v

wan2.7-image

wan2.7-image-pro

wan2.7-t2v

whisperfall

wild-bits

yivon-beta

yotta-nexus

z-image

zephyr

zero-prism

zeylu-alpha

zeylu-beta

zorik

Unfortunately, the list of models available for selection in direct and side-by-side mode is much smaller :(

Open Reddit thread
Gemini 2.0 Flash Lite r/Bard 7 upvotes 1 comments April 22, 2025
All models available from API

I wonder why they don't put all models also on the aistudio UI (perhaps as a setting).

Here is the full list as of today:

chat-bison-001
text-bison-001
embedding-gecko-001
gemini-1.0-pro-vision-latest
gemini-pro-vision
gemini-1.5-pro-latest
gemini-1.5-pro-001
gemini-1.5-pro-002
gemini-1.5-pro
gemini-1.5-flash-latest
gemini-1.5-flash-001
gemini-1.5-flash-001-tuning
gemini-1.5-flash
gemini-1.5-flash-002
gemini-1.5-flash-8b
gemini-1.5-flash-8b-001
gemini-1.5-flash-8b-latest
gemini-1.5-flash-8b-exp-0827
gemini-1.5-flash-8b-exp-0924
gemini-2.5-pro-exp-03-25
gemini-2.5-pro-preview-03-25
gemini-2.5-flash-preview-04-17
gemini-2.0-flash-exp
gemini-2.0-flash
gemini-2.0-flash-001
gemini-2.0-flash-lite-001
gemini-2.0-flash-lite
gemini-2.0-flash-lite-preview-02-05
gemini-2.0-flash-lite-preview
gemini-2.0-pro-exp
gemini-2.0-pro-exp-02-05
gemini-exp-1206
gemini-2.0-flash-thinking-exp-01-21
gemini-2.0-flash-thinking-exp
gemini-2.0-flash-thinking-exp-1219
learnlm-1.5-pro-experimental
learnlm-2.0-flash-experimental
gemma-3-1b-it
gemma-3-4b-it
gemma-3-12b-it
gemma-3-27b-it
embedding-001
text-embedding-004
gemini-embedding-exp-03-07
gemini-embedding-exp
aqa
imagen-3.0-generate-002
veo-2.0-generate-001
gemini-2.0-flash-live-001

Open Reddit thread
View more discussions →

AI tools related to Gemini 2.0 Flash Lite vs Gemini 2.0 Flash

These tools are closely connected to one or both models in this comparison and can help you evaluate real-world fit.

Large Language Models (LLMs)

googlegemini.co

googlegemini.co is a free tool for interacting with text and images, powered by the Google Gemini Pro API. It allows you to use Gemini easily without managing your own server or API configurations. Google Gemini is a multimodal AI developed by DeepMind capable of processing text, audio, images, and more. It is optimized for various devices, performs well on AI benchmarks, and is built with a focus on safety and responsible AI practices.

Free 0 visits 2 saves
AI Assistant

GeminiGoogle.cc

GeminiGoogle.cc is a platform dedicated to showcasing Google's most advanced AI model, Gemini. Built for native multimodality, Gemini reasons across text, images, video, audio, and code. It is available in three versions—Ultra, Pro, and Nano—to support tasks ranging from complex reasoning to on-device efficiency. The site highlights Gemini's performance, including its MMLU benchmarks, and provides examples of its capabilities in image generation, problem-solving, and multimodal analysis.

Free 0 visits 2 saves

The Summarize and Translate Web Pages Chrome extension enables you to summarize and translate web content with a single click. Powered by Google's Gemini AI, this tool provides high-quality summaries and translations for web pages, selected text, YouTube video captions, images, and PDF files.

Free
AI Chatbot

Alle-AI

Alle-AI is an all-in-one platform that lets you use multiple leading generative AI models side-by-side. It allows you to interact with, compare, and leverage the capabilities of models such as OpenAI's ChatGPT, Google's Gemini, Anthropic's Claude, DALL-E 2, Stable Diffusion, and Midjourney for chat, image, audio, and video generation.

Free 30 visits 5 saves

Which model should you choose?

Use the summary below to decide which model better fits your workflow, budget, and feature requirements.

Best fit for

Gemini 2.0 Flash Lite

Gemini 2.0 Flash Lite is a stronger fit for long-context workloads, tool-augmented workflows, multimodal applications.

Best fit for

Gemini 2.0 Flash

Gemini 2.0 Flash is a stronger fit for long-context workloads, tool-augmented workflows, multimodal applications.

Verdict

Choose Gemini 2.0 Flash Lite if you prioritize long-context workloads, tool-augmented workflows, multimodal applications. Choose Gemini 2.0 Flash if your workflow depends more on long-context workloads, tool-augmented workflows, multimodal applications.

FAQ

Common questions about Gemini 2.0 Flash Lite vs Gemini 2.0 Flash

What is the main difference between Gemini 2.0 Flash Lite and Gemini 2.0 Flash?

Gemini 2.0 Flash Lite leans toward long-context workloads, tool-augmented workflows, multimodal applications, while Gemini 2.0 Flash is better suited to long-context workloads, tool-augmented workflows, multimodal applications.

Which model is cheaper: Gemini 2.0 Flash Lite or Gemini 2.0 Flash?

Gemini 2.0 Flash Lite starts lower on input pricing at $0.0800 per 1M input tokens, compared with $0.1500 for Gemini 2.0 Flash.

Which model has the larger context window: Gemini 2.0 Flash Lite or Gemini 2.0 Flash?

Gemini 2.0 Flash Lite is listed with a context window of 1,048,576, while Gemini 2.0 Flash is listed with 1,048,576.

How should I evaluate Gemini 2.0 Flash Lite vs Gemini 2.0 Flash for my use case?

This comparison currently includes 7 shared benchmark rows, helping you compare practical performance across overlapping evaluations.