Multimodal Input
Processes image, video, and text inputs within a single request. Enables tasks like visual question answering and document analysis combining text and images.
Amazon Nova Lite is a multimodal foundation model developed by Amazon and made available through Amazon Bedrock. It accepts image, video, and text inputs and is designed to process them at low latency and low cost. The model was released in December 2024 as part of the Amazon Nova family, which includes three understanding models — Nova Micro, Nova Lite, and Nova Pro — and two creative content generation models. Nova Lite occupies the middle tier of the Nova understanding lineup, sitting between the text-only Nova Micro and the more capable Nova Pro. It supports a 300,000-token context window, making it suitable for tasks that involve long documents or extended conversations. The model also supports fine-tuning on Amazon Bedrock, allowing developers to adapt it for specific use cases. It is well-suited for applications that require multimodal input processing at scale where cost efficiency and speed are priorities.
High-signal model metadata in a structured two-column overview table.
The entity that provides this model.
The routed model identifier exposed by upstream providers.
The number of tokens supported by the input context window.
The number of tokens that can be generated by the model in a single request.
Whether the model's code is available for public use.
When the model was first released.
When the model's knowledge was last updated.
The providers that offer this model. This is not an exhaustive list.
Types of data this model can process.
A fuller summary of positioning, capabilities, and source-specific details for Amazon Nova Lite.
Amazon Nova Lite is a multimodal foundation model developed by Amazon and made available through Amazon Bedrock. It accepts image, video, and text inputs and is designed to process them at low latency and low cost. The model was released in December 2024 as part of the Amazon Nova family, which includes three understanding models — Nova Micro, Nova Lite, and Nova Pro — and two creative content generation models.
Nova Lite occupies the middle tier of the Nova understanding lineup, sitting between the text-only Nova Micro and the more capable Nova Pro. It supports a 300,000-token context window, making it suitable for tasks that involve long documents or extended conversations. The model also supports fine-tuning on Amazon Bedrock, allowing developers to adapt it for specific use cases. It is well-suited for applications that require multimodal input processing at scale where cost efficiency and speed are priorities.
Processes image, video, and text inputs within a single request. Enables tasks like visual question answering and document analysis combining text and images.
Supports up to 300,000 tokens of context per request. This allows processing of long documents, extended conversations, or multiple media inputs in one call.
Optimized for fast inference across multimodal inputs. Designed to return responses quickly even when handling image and video alongside text.
Accepts video as a direct input type for analysis and comprehension tasks. Enables use cases such as video summarization and content extraction.
Supports text and vision fine-tuning via Amazon Bedrock. Developers can customize the model to improve accuracy or reduce cost for specific tasks.
Designed to support agentic workflows and UI actuation. Can be used in multi-step task pipelines that require reasoning and action sequencing.
Priced at the lower end of the Nova model family for multimodal tasks. Intended for high-volume applications where per-token cost is a key constraint.
Primary API pricing shown in the same “quick compare” spirit as the reference page.
Additional usage-cost dimensions synced into the project for this model.
Places where this model is available, based on the synced detail-page metadata.
Endpoint-level provider data currently available for this model.
Benchmark scores synced from the current model source and normalized into the local catalog.
| Benchmark | Score |
|---|---|
|
AIME 2024
American math olympiad problems
|
|
|
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
|
|
|
HLE
Questions that challenge frontier models across many domains
|
|
|
LiveCodeBench
Real-world coding tasks from recent competitions
|
|
|
MATH-500
Undergraduate and competition-level math problems
|
|
|
MMLU-Pro
Expert knowledge across 14 academic disciplines
|
|
|
SciCode
Scientific research coding and numerical methods
|
Official model cards, release notes, docs, and other references synced from the source page.
Amazon Nova Lite discussions are most active in r/LLMDevs, r/BlackboxAI_, r/ClaudeAI. Top Reddit threads cluster around coding workflow discussions. The strongest match in this snapshot has 10 upvotes and 11 comments.
Amazon just launched Nova 2 Lite models on Bedrock.
Now, you can use those models directly with Claude Code, and set automatic preferences on when to invoke the model for specific coding scenarios. Sample config below. This way you can mix/match different models based on coding use cases. Details in the demo folder here: [https://github.com/katanemo/archgw/tree/main/demos/use\_cases/claude\_code\_router](https://github.com/katanemo/archgw/tree/main/demos/use_cases/claude_code_router)
if you think this is useful, then don't forget to the star the project 🙏
# Anthropic Models
- model: anthropic/claude-sonnet-4-5
access_key: $ANTHROPIC_API_KEY
routing_preferences:
- name: code understanding
description: understand and explain existing code snippets, functions, or libraries
- model: amazon_bedrock/us.amazon.nova-2-lite-v1:0
default: true
access_key: $AWS_BEARER_TOKEN_BEDROCK
base_url: https://bedrock-runtime.us-west-2.amazonaws.com
routing_preferences:
- name: code generation
description: generating new code snippets, functions, or boilerplate based on user prompts or requirements
- model: anthropic/claude-haiku-4-5
access_key: $ANTHROPIC_API_KEY
None of the MS models seem to be working for me. I get an error like:
`[API Error: 404 litellm.NotFoundError: NotFoundError: OpenrouterException - {"error":{"message":"No endpoints found that support tool use. To learn more about provider routing, visit:`
`https://openrouter.ai/docs/guides/routing/provider-selection","code":404}}. Received Model Group=blackboxai/microsoft/phi-4Available Model Group Fallbacks=None]`
Separately, the amazon/nova-lite-v1 model is s\*\*t... Offers vague recommendations and no specific fix for any code.
Amazon just launched Nova 2 Lite models on Bedrock.
Now, you can use those models directly with Claude Code, and set automatic preferences on when to invoke the model for specific coding scenarios. Sample config below. This way you can mix/match different models based on coding use cases. Details in the demo folder here: [https://github.com/katanemo/archgw/tree/main/demos/use\_cases/claude\_code\_router](https://github.com/katanemo/archgw/tree/main/demos/use_cases/claude_code_router)
# Anthropic Models
- model: anthropic/claude-sonnet-4-5
access_key: $ANTHROPIC_API_KEY
routing_preferences:
- name: code understanding
description: understand and explain existing code snippets, functions, or libraries
- model: amazon_bedrock/us.amazon.nova-2-lite-v1:0
default: true
access_key: $AWS_BEARER_TOKEN_BEDROCK
base_url: https://bedrock-runtime.us-west-2.amazonaws.com
routing_preferences:
- name: code generation
description: generating new code snippets, functions, or boilerplate based on user prompts or requirements
- model: anthropic/claude-haiku-4-5
access_key: $ANTHROPIC_API_KEY
if you think this is useful, then don't forget to the star the project 🙏
Amazon just launched Nova 2 Lite models on Bedrock.
Now, you can use those models directly with Claude Code, and set automatic preferences on when to invoke the model for specific coding scenarios. Sample config below. This way you can mix/match different models based on coding use cases. Details in the demo folder here: [https://github.com/katanemo/archgw/tree/main/demos/use\_cases/claude\_code\_router](https://github.com/katanemo/archgw/tree/main/demos/use_cases/claude_code_router)
if you think this is useful, then don't forget to the star the project 🙏
# Anthropic Models
- model: anthropic/claude-sonnet-4-5
access_key: $ANTHROPIC_API_KEY
routing_preferences:
- name: code understanding
description: understand and explain existing code snippets, functions, or libraries
- model: amazon_bedrock/us.amazon.nova-2-lite-v1:0
default: true
access_key: $AWS_BEARER_TOKEN_BEDROCK
base_url: https://bedrock-runtime.us-west-2.amazonaws.com
routing_preferences:
- name: code generation
description: generating new code snippets, functions, or boilerplate based on user prompts or requirements
- model: anthropic/claude-haiku-4-5
access_key: $ANTHROPIC_API_KEY
I’ve been working on integrating Amazon Bedrock’s Nova Lite model into my application and have encountered an issue. According to the documentation, cross-region inference should allow my Lambda function running in us-west-2 to access amazon.nova-lite-v1:0, but my API calls are failing with the following error:
jsonCopy code{
"statusCode": 500,
"body": {
"error": "Bedrock API request failed: An error occurred (ValidationException) when calling the InvokeModel operation: Invocation of model ID amazon.nova-lite-v1:0 with on-demand throughput isn’t supported. Retry your request with the ID or ARN of an inference profile that contains this model."
}
}
From my understanding, Nova Lite is supposed to support both on-demand and provisioned throughput according to the user guide. However, this error makes it seem like provisioned throughput is a mandatory requirement for this model.
I've already:
* Verified that my Lambda function and Bedrock client are correctly configured for us-west-2.
* Ensured that the IAM role has the necessary permissions (bedrock:InvokeModel, etc.).
* Checked the documentation, but it doesn’t explicitly mention that provisioned throughput is required for basic inference with Nova Lite.
Can anyone confirm if provisioned throughput is a hard requirement for Nova Lite, or am I missing a key configuration step? Any insights would be greatly appreciated!
Amazon Nova Lite supports a context window of 300,000 tokens, allowing it to handle long documents, extended conversations, or multiple media inputs within a single request.
Amazon Nova Lite is a multimodal model that accepts image, video, and text inputs. It is distinct from Nova Micro, which is text-only.
Amazon Nova Lite was released in December 2024 as part of the Amazon Nova model family on Amazon Bedrock.
Yes. Amazon Nova Lite supports text and vision fine-tuning through Amazon Bedrock, allowing developers to customize the model for specific use cases.
Nova Lite is one of three understanding models in the Nova family. Nova Micro is text-only with the lowest latency, Nova Lite adds multimodal support at low cost, and Nova Pro offers the highest capability across accuracy, speed, and cost for complex tasks.
Continue browsing adjacent models from the same provider.