Wan

Wan 2.6

Wan 2.6 is a video generation model developed by Alibaba that produces 1080p video at 24 frames per second for clips up to 15 seconds in length. It accepts text, image, or video as input and generates complete video output — including synchronized audio, dialogue, sound effects, and lip movements — in a single generation pass, without requiring a separate audio pipeline. The model was trained with a cutoff of December 2025 and is available as an open-source release. Wan 2.6 is designed for creators, marketers, and developers who need publish-ready video content without extensive post-production work. Its distinguishing features include multi-shot narrative handling across a single clip, character consistency when using reference figures, physics simulation for realistic motion, and style transfer from reference videos. These capabilities make it suited for use cases such as social media content, product demonstrations, commercials, and short narrative sequences.

December 2025 1,000 context N/A output
Native Audio Generation Image-to-Video Multi-Shot Narratives Character Consistency Physics Simulation Video Style Transfer

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Wan

Input Context Window

The number of tokens supported by the input context window.

1,000 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

N/A tokens

Open Source

Whether the model's code is available for public use.

No

Release Date

When the model was first released.

December 2025

Knowledge Cut-off Date

When the model's knowledge was last updated.

December 2025

API Providers

The providers that offer this model. This is not an exhaustive list.

Wan

Modalities

Types of data this model can process.

Video Text Image Audio

What is Wan 2.6

A fuller summary of positioning, capabilities, and source-specific details for Wan 2.6.

Wan 2.6 is a video generation model developed by Alibaba that produces 1080p video at 24 frames per second for clips up to 15 seconds in length. It accepts text, image, or video as input and generates complete video output — including synchronized audio, dialogue, sound effects, and lip movements — in a single generation pass, without requiring a separate audio pipeline. The model was trained with a cutoff of December 2025 and is available as an open-source release.

Wan 2.6 is designed for creators, marketers, and developers who need publish-ready video content without extensive post-production work. Its distinguishing features include multi-shot narrative handling across a single clip, character consistency when using reference figures, physics simulation for realistic motion, and style transfer from reference videos. These capabilities make it suited for use cases such as social media content, product demonstrations, commercials, and short narrative sequences.

Capabilities

What Wan 2.6 supports

AUD

Native Audio Generation

Generates synchronized audio — including dialogue, sound effects, and lip movements — alongside video in a single pass, eliminating the need for separate dubbing tools.

IMG

Image-to-Video

Accepts a source image as input and animates it into a 1080p video clip up to 15 seconds long.

AI

Multi-Shot Narratives

Handles camera transitions and scene segmentation across a single 15-second clip based on a text description of the full scene.

AI

Character Consistency

Places reference figures into generated scenes while maintaining consistent appearance, voice, and interaction throughout the clip.

AI

Physics Simulation

Renders gravity, fluid dynamics, and complex object interactions to produce realistic motion in action and product shots.

VID

Video Style Transfer

Locks onto motion from a reference video so the performance is preserved while the visual environment is replaced.

AI

Seed Control

Accepts a seed value as input to enable reproducible generation outputs across multiple runs.

AI

Text Rendering

Supports rendering legible text within generated video frames, useful for graphics, titles, and on-screen labels.

Pricing for Wan 2.6

Primary API pricing shown in the same “quick compare” spirit as the reference page.

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Wan

Configuration & Parameters

The configurable options currently documented for this model.

Resolution

Select
Default: 720p
1080p 720p

Duration

Select
Default: 5
5 seconds 10 seconds 15 seconds

Shot Type

Toggle Group
Default: single

Negative Prompt

Text

Description of what to exclude from the video.

Seed

Seed

A specific value that is used to guide the 'randomness' of the generation.

Supported Request Parameters

Parameters currently listed by OpenRouter or the local catalog for this model.

Resolution Duration Shot Type Negative Prompt Seed

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Community discussion

What people think about Wan 2.6

Wan 2.6 discussions are most active in r/HiggsfieldAI, r/comfyui, r/IndianArtAI. Top Reddit threads cluster around safety and censorship questions. The strongest match in this snapshot has 228 upvotes and 78 comments.

r/HiggsfieldAI 9 upvotes 51 comments December 17, 2025
Is WAN 2.6 still uncensored on Higgsfield?

WAN 2.5 was uncensored on higgsfield, can anyone tell me if WAN 2.6 is still uncensored? Higgsfield no longer gives enough credits to run off even ONE test video.

The "official" WAN 2.5/2.6 page is heavily censored making it useless. Try to describe a G rated scene where people are sunbathing by a pool and it will probably be blocked.

Open Reddit thread
r/IndianArtAI 24 upvotes 5 comments December 24, 2025
tried wan 2.6

hey guys i tried using wan 2.6. multi-cut 15s 1080p it looks quite great lip-sync is what just seems off to me. give you opinions

Open Reddit thread

While the official launch event is scheduled for tomorrow (Dec 17), the model has just gone live on partner platforms like **Fal.ai and Replicate** and the results are stunning.

**The Key Specs:**

**Resolution:** 1080p at 24fps.

**Audio:** Features **built-in** lip-sync and native audio generation(See the cat drumming in the video; it’s generated with the video, not added later).

**Duration:** Up to 15 seconds and **Capabilities:** Text to Video, Image to Video and Video to Video.

**The "Open Source" Question:** Previous versions (Wan 2.1) were open-weights, **but right now,** Wan 2.6 is only available via commercial APIs.

The community is **debating** whether Alibaba will drop the weights at tomorrow's event or if the "Open Source Era" for **SOTA** video models is closing.

**Do you think Alibaba will open-source this tomorrow to undercut Sora/Runway, or are they pivoting to a closed API model?**

**Source: Wan Ai(Official site)**

🔗: https://www.wan-ai.co/wan-2-6

Open Reddit thread
View more discussions →
FAQ

Common questions about Wan 2.6

What is the context window for Wan 2.6?

Wan 2.6 has a context window of 1,000 tokens, which applies to the text prompt input used to guide video generation.

What is the maximum video length and resolution Wan 2.6 can produce?

Wan 2.6 generates video at up to 1080p resolution and 24 frames per second, with a maximum clip length of 15 seconds.

Does Wan 2.6 require a separate tool to add audio to generated videos?

No. Wan 2.6 generates native audio — including synchronized dialogue, sound effects, and lip movements — as part of the same generation pass that produces the video.

What input types does Wan 2.6 accept?

Wan 2.6 accepts text prompts, image URLs, selection inputs (such as style or mode options), toggle group settings, and a seed value for reproducibility.

What is the training data cutoff for Wan 2.6?

The model's training data has a cutoff of December 2025.

Is Wan 2.6 open source?

Yes. Wan 2.6 is released as an open-source model by Alibaba.

More models from Wan

Continue browsing adjacent models from the same provider.

← All AI Models