Amazon vs Amazon

Amazon Nova Lite vs Amazon Nova Micro

Compare Amazon Nova Lite and Amazon Nova Micro across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for tool-augmented workflows versus tool-augmented workflows.

Amazon Nova Lite

Dec 05, 2024 300,000 context 5,000 tokens output

Amazon Nova Micro

Dec 05, 2024 128,000 context 5,000 tokens output

Overview ↓ Pricing ↓ Capabilities ↓ Benchmarks ↓ Community ↓ Tools ↓ Verdict ↓ FAQ ↓ Related ↓

Overview Comparison

Structured side-by-side differences for the highest-signal model metadata.

Amazon Nova Lite

Amazon Nova Micro

Provider

The entity that currently provides this model.

Amazon Nova Lite Amazon

Amazon Nova Micro Amazon

Model ID

The routed model identifier exposed by upstream providers.

Amazon Nova Lite amazon/nova-lite-v1

Amazon Nova Micro amazon/nova-micro-v1

Input Context Window

The number of tokens supported by the input context window.

Amazon Nova Lite 300,000 tokens

Amazon Nova Micro 128,000 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

Amazon Nova Lite 5,000 tokens tokens

Amazon Nova Micro 5,000 tokens tokens

Open Source

Whether the model's code is available for public use.

Amazon Nova Lite No

Amazon Nova Micro No

Release Date

When the model was first released.

Amazon Nova Lite Dec 05, 2024

Amazon Nova Micro Dec 05, 2024

Knowledge Cut-off Date

When the model's knowledge was last updated.

Amazon Nova Lite 2024-10-31

Amazon Nova Micro December 2024

API Providers

The providers that currently expose the model through an API.

Amazon Nova Lite

OpenRouter

Amazon Nova Micro

OpenRouter

Modalities

Types of data each model can process or return.

Amazon Nova Lite

Text Image Video

Amazon Nova Micro

Text

Pricing Comparison

Compare current token pricing before you choose the cheaper or more scalable API option.

Amazon Nova Lite Amazon

Input price $0.06 Per 1M tokens

Output price $0.24 Per 1M tokens

Amazon Nova Micro Amazon

Input price $0.04 Per 1M tokens

Output price $0.14 Per 1M tokens

Capabilities Comparison

See where each model overlaps, where they differ, and which one supports more of the features you care about.

Capability

Amazon Nova Lite

Amazon Nova Micro

Agentic Task Execution Designed to support agentic workflows and UI actuation. Can be used in multi-step task pipelines that require reasoning and action sequencing.

Amazon Nova Lite Supported

Amazon Nova Micro —

Agentic Task Support Supports agentic workflows, allowing the model to be used in multi-step task pipelines and tool-use scenarios within Amazon Bedrock.

Amazon Nova Lite —

Amazon Nova Micro Supported

Cost-Efficient Inference Priced at the lower end of the Nova model family for multimodal tasks. Intended for high-volume applications where per-token cost is a key constraint.

Amazon Nova Lite Supported

Amazon Nova Micro Supported

Fine-Tuning Support Supports text and vision fine-tuning via Amazon Bedrock. Developers can customize the model to improve accuracy or reduce cost for specific tasks.

Amazon Nova Lite Supported

Amazon Nova Micro Supported

Image

Amazon Nova Lite Supported

Amazon Nova Micro —

Large Context Window Supports up to 300,000 tokens of context per request. This allows processing of long documents, extended conversations, or multiple media inputs in one call.

Amazon Nova Lite Supported

Amazon Nova Micro —

Long Context Window Supports up to 128,000 tokens per request, enabling processing of long documents, transcripts, or multi-turn conversations in a single call.

Amazon Nova Lite —

Amazon Nova Micro Supported

Low Latency Responses Designed to return text completions faster than other models in the Nova family, making it suitable for real-time or high-throughput applications.

Amazon Nova Lite —

Amazon Nova Micro Supported

Low-Latency Responses Optimized for fast inference across multimodal inputs. Designed to return responses quickly even when handling image and video alongside text.

Amazon Nova Lite Supported

Amazon Nova Micro —

Multimodal Input Processes image, video, and text inputs within a single request. Enables tasks like visual question answering and document analysis combining text and images.

Amazon Nova Lite Supported

Amazon Nova Micro —

Text

Amazon Nova Lite Supported

Amazon Nova Micro Supported

Text Generation Generates coherent text output for tasks such as summarization, classification, question answering, and instruction following using text-only input.

Amazon Nova Lite —

Amazon Nova Micro Supported

Tools

Amazon Nova Lite Supported

Amazon Nova Micro Supported

Video Understanding Accepts video as a direct input type for analysis and comprehension tasks. Enables use cases such as video summarization and content extraction.

Amazon Nova Lite Supported

Amazon Nova Micro —

Benchmark Comparison

Shared benchmark rows make it easier to compare performance where both models have published scores.

Benchmark	Amazon Nova Lite	Amazon Nova Micro
AIME 2024 American math olympiad problems	Amazon Nova Lite 10.7%	Amazon Nova Micro 8.0%
GPQA Diamond PhD-level science questions (biology, physics, chemistry)	Amazon Nova Lite 43.3%	Amazon Nova Micro 35.8%
HLE Questions that challenge frontier models across many domains	Amazon Nova Lite 4.6%	Amazon Nova Micro 4.7%
LiveCodeBench Real-world coding tasks from recent competitions	Amazon Nova Lite 16.7%	Amazon Nova Micro 14.0%
MATH-500 Undergraduate and competition-level math problems	Amazon Nova Lite 76.5%	Amazon Nova Micro 70.3%
MMLU-Pro Expert knowledge across 14 academic disciplines	Amazon Nova Lite 59.0%	Amazon Nova Micro 53.1%
SciCode Scientific research coding and numerical methods	Amazon Nova Lite 13.9%	Amazon Nova Micro 9.4%

Community discussion

What Reddit discussions say about Amazon Nova Lite vs Amazon Nova Micro

Amazon Nova Lite and Amazon Nova Micro are both surfacing live Reddit discussions, giving this comparison a community layer beyond specs and benchmarks.

The most visible threads right now are clustered in r/aws, r/LLMDevs, r/BlackboxAI_. The feed below mixes discussion threads surfaced for each model so you can quickly spot where community sentiment overlaps or diverges.

Amazon Nova Lite r/LLMDevs 10 upvotes 11 comments December 9, 2025

I don't think anyone is using Amazon Nova Lite 2.0, but I built router for it for Claude Code

Amazon just launched Nova 2 Lite models on Bedrock.

Now, you can use those models directly with Claude Code, and set automatic preferences on when to invoke the model for specific coding scenarios. Sample config below. This way you can mix/match different models based on coding use cases. Details in the demo folder here: [https://github.com/katanemo/archgw/tree/main/demos/use\_cases/claude\_code\_router](https://github.com/katanemo/archgw/tree/main/demos/use_cases/claude_code_router)

if you think this is useful, then don't forget to the star the project 🙏

# Anthropic Models
- model: anthropic/claude-sonnet-4-5
access_key: $ANTHROPIC_API_KEY
routing_preferences:
- name: code understanding
description: understand and explain existing code snippets, functions, or libraries

- model: amazon_bedrock/us.amazon.nova-2-lite-v1:0
default: true
access_key: $AWS_BEARER_TOKEN_BEDROCK
base_url: https://bedrock-runtime.us-west-2.amazonaws.com
routing_preferences:
- name: code generation
description: generating new code snippets, functions, or boilerplate based on user prompts or requirements

- model: anthropic/claude-haiku-4-5
access_key: $ANTHROPIC_API_KEY

Open Reddit thread

Amazon Nova Lite r/BlackboxAI_ 5 upvotes 5 comments December 4, 2025

Problems with Microsoft models

None of the MS models seem to be working for me. I get an error like:

`[API Error: 404 litellm.NotFoundError: NotFoundError: OpenrouterException - {"error":{"message":"No endpoints found that support tool use. To learn more about provider routing, visit:`
`https://openrouter.ai/docs/guides/routing/provider-selection","code":404}}. Received Model Group=blackboxai/microsoft/phi-4Available Model Group Fallbacks=None]`

Separately, the amazon/nova-lite-v1 model is s\*\*t... Offers vague recommendations and no specific fix for any code.

Open Reddit thread

Amazon Nova Lite r/ClaudeAI 4 upvotes 3 comments December 9, 2025

I didn't think anyone cared for Amazon Nova Lite 2.0 LLM, until I built a router and hooked it up with Claude Code

Amazon just launched Nova 2 Lite models on Bedrock.

Now, you can use those models directly with Claude Code, and set automatic preferences on when to invoke the model for specific coding scenarios. Sample config below. This way you can mix/match different models based on coding use cases. Details in the demo folder here: [https://github.com/katanemo/archgw/tree/main/demos/use\_cases/claude\_code\_router](https://github.com/katanemo/archgw/tree/main/demos/use_cases/claude_code_router)

# Anthropic Models
- model: anthropic/claude-sonnet-4-5
access_key: $ANTHROPIC_API_KEY
routing_preferences:
- name: code understanding
description: understand and explain existing code snippets, functions, or libraries

- model: amazon_bedrock/us.amazon.nova-2-lite-v1:0
default: true
access_key: $AWS_BEARER_TOKEN_BEDROCK
base_url: https://bedrock-runtime.us-west-2.amazonaws.com
routing_preferences:
- name: code generation
description: generating new code snippets, functions, or boilerplate based on user prompts or requirements

- model: anthropic/claude-haiku-4-5
access_key: $ANTHROPIC_API_KEY

if you think this is useful, then don't forget to the star the project 🙏

Open Reddit thread

Amazon Nova Lite r/aipromptprogramming 4 upvotes 1 comments December 6, 2025

I built the bridge from Claude Code to Amazon Nova Lite 2.0

Open Reddit thread

Amazon Nova Micro r/ChatGPT 3 upvotes 1 comments April 3, 2025

Automated Hallucination Reduction via Multi-Agent Cross-Verification

Today, the AI model that hallucinates the least is Google Gemini 2.0 Flash 001, with a factual consistency rate of 99.3%. This score is encouraging because it means that we're relatively close to solving the hallucination problem.

https://github.com/vectara/hallucination-leaderboard

What would happen if we built an AI agent that would first query Google Gemini 2.5 Pro about something, (because it is currently the most powerful model, completely dominating the Chatbot Arena Leaderboard by almost 40 points) and then ran the answer it generated by other models to catch any inaccuracies it may have generated?

https://lmarena.ai/?leaderboard

We presume that the different AI developers use different data sets to build their models, so while one may hallucinate about a certain query, it's possible that another would not. What would happen if we instructed our AI agent to run the content Gemini 2.5 generated through the next ten models by other developers, asking them each to analyze the answer for factual consistency?

Could this be a way to arrive at a factual consistency for answers that is perhaps 99.9% or higher? Could this be done relatively inexpensively and completely automatically?

Below are ten top models that our AI agent would run Gemini 2.5's answer through, ranked according to their factual consistency rate score. I asked 2.5 to comment on the usefulness of this approach, and point out some possible flaws and limitations, and end this post with its answer. While the technique may be relatively expensive, in some cases the usefulness of the accuracy may justify the extra expense.

OpenAI-o3-mini-high-reasoning - 99.2%
Zhipu AI GLM-4-9B-Chat - 98.7%
Amazon Nova-Micro-V1 - 98.4%
Qwen2-72B-Instruct - 95.3%
Mixtral-8x22B-Instruct-v0.1 - 95.3%
Anthropic Claude-3-5-haiku - 95.1%
01-AI Yi-1.5-9B-Chat - 95.1%
Cohere Command-R - 95.1%
IBM Granite-3.0-8B-Instruct - 93.5%
Microsoft phi-2 - 93.3%

Gemini 2.5 Pro:

"Okay, here's a concise summary of the analysis:
Your proposal involves using Gemini 2.5 Pro to generate an answer and then having 10 other models (ranked high for factual consistency) verify it. The goal is automated, inexpensive, near-perfect (99.9%+) factual accuracy.
Potential & Feasibility:
* Improved Accuracy: Yes, this ensemble approach could significantly increase factual accuracy compared to using a single model by catching diverse errors. Reaching 99.9%+ is ambitious but directionally plausible.
* Automation: The process is fully automatable using APIs.
Key Challenges & Limitations:
* Cost & Speed: Running 11 models per query will be substantially more expensive and slower than using one. "Relatively inexpensive" is unlikely.
* Consensus Logic: Deciding how to handle disagreements among the 10 verifiers is complex. A simple majority vote might miss valid corrections or reinforce errors.
* Verifier Imperfection: The verification models themselves aren't perfect and can make mistakes (hallucinate) or fail to spot errors.
* Shared Errors: Models might share common misconceptions from similar training data, leading to false consensus.
* Complexity: Designing effective verification prompts and consensus mechanisms is non-trivial.
In Short:
The idea leverages model diversity for better accuracy and is automatable. However, it faces significant hurdles in cost, speed, and the complexity of managing verification results. While promising, it's not a simple or cheap solution for achieving near-perfect factual consistency."

Open Reddit thread

Amazon Nova Micro r/aws 2 upvotes 4 comments November 13, 2025

Can't Access Claude Sonnet 4.5 on AWS Bedrock (Channel Program Account)

Hey everyone,

I just ran into an issue trying to call **Claude Sonnet 4.5** via the **AWS Bedrock Runtime API**, and I’m hoping someone here might have insights or has faced the same thing.

**Setup:**

* **Account type:** Channel program account (via AWS Partner / Distributor)
* **Region:** `us-east-1`
* **API key:** Valid — works fine for `amazon.nova-micro-v1:0`
* **Model I’m calling:** `anthropic.claude-sonnet-4-5-20250929-v1:0`

Here’s the cURL command I used:

curl -X POST "https://bedrock-runtime.us-east-1.amazonaws.com/model/anthropic.claude-sonnet-4-5-20250929-v1:0/converse" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer <valid-token>" \
-d '{
"messages": [
{
"role": "user",
"content": [{"text": "Hello"}]
}
]
}'

And here’s the **error response** I got back:

{
"message": "Invocation of model ID anthropic.claude-sonnet-4-5-20250929-v1:0 with on-demand throughput isn't supported. Retry your request with the ID or ARN of an inference profile that contains this model."
}

After reaching out to AWS Support, I also got this message:

>

Has anyone here successfully accessed Claude Sonnet 4.5 under a channel program account, or know how to obtain the required inference profile ARN?

I seem i can't use any claude variant of models but I can use aws nova variant tho

Any clarification or workaround would be super appreciated 🙏

Here’s a slightly refined and Reddit-ready version of your post — same message, just cleaner formatting and tone so it reads smoothly and attracts good replies:

# [Help] Can't Access Claude Sonnet 4.5 on AWS Bedrock (Channel Program Account)

Hey everyone,

I just ran into an issue trying to call Claude Sonnet 4.5 via the AWS Bedrock Runtime API, and I’m hoping someone here might have insights or has faced the same thing.

Setup

* Account type: Channel program account (via AWS Partner / Distributor)
* Region: us-east-1
* API key: Valid — works fine for amazon.nova-micro-v1:0
* Model I’m calling: anthropic.claude-sonnet-4-5-20250929-v1:0

cURL command:

curl -X POST "https://bedrock-runtime.us-east-1.amazonaws.com/model/anthropic.claude-sonnet-4-5-20250929-v1:0/converse" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer <valid-token>" \
-d '{
"messages": [
{
"role": "user",
"content": [{"text": "Hello"}]
}
]
}'

Error response:

{
"message": "Invocation of model ID anthropic.claude-sonnet-4-5-20250929-v1:0 with on-demand throughput isn't supported. Retry your request with the ID or ARN of an inference profile that contains this model."
}

After reaching out to AWS Support, I got this message back:

>

It seems like I can’t use any Claude variant (Sonnet, Haiku, etc.), but I can use AWS Nova models just fine.

Has anyone here successfully accessed Claude Sonnet 4.5 under a channel program account, or know how to obtain the required inference profile ARN?

Any clarification or workaround would be super appreciated 🙏

Open Reddit thread

View more discussions →

AI tools related to Amazon Nova Lite vs Amazon Nova Micro

These tools are closely connected to one or both models in this comparison and can help you evaluate real-world fit.

Large Language Models (LLMs)

PartyRock

PartyRock is a playground powered by Amazon Bedrock that allows you to build AI-generated apps. It offers a fast, engaging way to explore generative AI, providing access to foundation models through an intuitive, code-free interface designed for learning prompt engineering and AI fundamentals.

Free 137 visits 1 saves

AI Assistant

SmythOS

SmythOS is a no-code platform that enables you to build and deploy AI agents without manual programming. By describing your requirements, the Agent Weaver tool automatically constructs agents using leading AI models and APIs. The platform supports integrations with services like OpenAI, Hugging Face, and Amazon Bedrock. SmythOS is recognized as a Gartner AI Category Leader and is utilized by organizations including the US Airforce and Unilever.

Free 73 visits

AI Image Generator

StoryBee

StoryBee is an AI-powered story generator designed to spark creativity and imagination in children. The platform enables users to create personalized children's stories, bedtime tales, and educational narratives in seconds by providing a simple hint or theme. It is built for parents, teachers, and young readers.

Free 21 visits 18 saves

AI API

aivideogenerator.me

aivideogenerator.me features the Luma AI video generator, an advanced tool from Luma Labs designed to create realistic videos from text prompts. Whether you need Luma AI drone shots or specific video styles, this generator offers versatile options for personal and professional projects. Unlike restricted models like Sora, the Luma AI video generator is publicly accessible, making it a practical choice for high-quality video creation. By using the Luma AI video generator free online, users can leverage Luma Labs' innovative AI technology to produce professional-grade content, positioning it as a leading, accessible alternative in the AI video market.

Free 113 visits 13 saves

Which model should you choose?

Use the summary below to decide which model better fits your workflow, budget, and feature requirements.

Best fit for

Amazon Nova Lite

Amazon Nova Lite is a stronger fit for tool-augmented workflows, multimodal applications, cost-efficient scale.

Best fit for

Amazon Nova Micro

Amazon Nova Micro is a stronger fit for tool-augmented workflows, cost-efficient scale, benchmark-led evaluation.

Verdict

Choose Amazon Nova Lite if you prioritize tool-augmented workflows, multimodal applications, cost-efficient scale. Choose Amazon Nova Micro if your workflow depends more on tool-augmented workflows, cost-efficient scale, benchmark-led evaluation.

FAQ

Common questions about Amazon Nova Lite vs Amazon Nova Micro

What is the main difference between Amazon Nova Lite and Amazon Nova Micro?

Amazon Nova Lite leans toward tool-augmented workflows, multimodal applications, cost-efficient scale, while Amazon Nova Micro is better suited to tool-augmented workflows, cost-efficient scale, benchmark-led evaluation.

Which model is cheaper: Amazon Nova Lite or Amazon Nova Micro?

Amazon Nova Micro starts lower on input pricing at $0.0400 per 1M input tokens, compared with $0.0600 for Amazon Nova Lite.

Which model has the larger context window: Amazon Nova Lite or Amazon Nova Micro?

Amazon Nova Lite is listed with a context window of 300,000, while Amazon Nova Micro is listed with 128,000.

How should I evaluate Amazon Nova Lite vs Amazon Nova Micro for my use case?

This comparison currently includes 7 shared benchmark rows, helping you compare practical performance across overlapping evaluations.

Amazon Nova Lite vs Amazon Nova Micro

Overview Comparison

Provider

Model ID

Input Context Window

Maximum Output Tokens

Open Source

Release Date

Knowledge Cut-off Date

API Providers

Modalities

Pricing Comparison

Capabilities Comparison

Benchmark Comparison

What Reddit discussions say about Amazon Nova Lite vs Amazon Nova Micro

AI tools related to Amazon Nova Lite vs Amazon Nova Micro

PartyRock

SmythOS

StoryBee

aivideogenerator.me

Which model should you choose?

Amazon Nova Lite

Amazon Nova Micro

Common questions about Amazon Nova Lite vs Amazon Nova Micro

Related comparisons