Wan

Wan 2.6

Wan 2.6 is a multimodal AI generation model developed by Alibaba Cloud and released in December 2025. It uses a Mixture-of-Experts architecture with 14 billion total parameters, activating roughly 20% of them during inference. The model supports text-to-video, image-to-video, reference-to-video, and image generation modes, and accepts prompts in both English and Chinese. Video outputs can reach up to 15 seconds at 1080p resolution and 24 frames per second. What distinguishes Wan 2.6 from many generation models is its native audio output — synchronized dialogue, sound effects, and lip-sync are generated alongside video without requiring separate post-production tools. The model also supports multi-shot storytelling from a single prompt, maintaining character consistency across scenes with automatic camera transitions. It is well suited for content creators, marketers, and developers who need high-fidelity video and image output, particularly those aiming to produce publish-ready content with minimal manual editing.

December 2025 2,000 context N/A output

Text-to-Video Native Audio Sync Image-to-Video Image Generation Multi-Shot Storytelling Reference-to-Video

Overview ↓ About ↓ Capabilities ↓ Pricing ↓ Parameters ↓ Tools ↓ Resources ↓ Community ↓ FAQ ↓

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Wan

Input Context Window

The number of tokens supported by the input context window.

2,000 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

N/A tokens

Open Source

Whether the model's code is available for public use.

Release Date

When the model was first released.

December 2025

Knowledge Cut-off Date

When the model's knowledge was last updated.

December 2025

API Providers

The providers that offer this model. This is not an exhaustive list.

Wan

Modalities

Types of data this model can process.

Image Text Video Audio

What is Wan 2.6

A fuller summary of positioning, capabilities, and source-specific details for Wan 2.6.

Wan 2.6 is a multimodal AI generation model developed by Alibaba Cloud and released in December 2025. It uses a Mixture-of-Experts architecture with 14 billion total parameters, activating roughly 20% of them during inference. The model supports text-to-video, image-to-video, reference-to-video, and image generation modes, and accepts prompts in both English and Chinese. Video outputs can reach up to 15 seconds at 1080p resolution and 24 frames per second.

What distinguishes Wan 2.6 from many generation models is its native audio output — synchronized dialogue, sound effects, and lip-sync are generated alongside video without requiring separate post-production tools. The model also supports multi-shot storytelling from a single prompt, maintaining character consistency across scenes with automatic camera transitions. It is well suited for content creators, marketers, and developers who need high-fidelity video and image output, particularly those aiming to produce publish-ready content with minimal manual editing.

Capabilities

What Wan 2.6 supports

VID

Text-to-Video

Generates video clips from text prompts at up to 1080p resolution and 24 fps, with clips reaching up to 15 seconds in length.

AUD

Native Audio Sync

Produces synchronized audio — including dialogue, sound effects, and lip-sync — directly alongside generated video without external dubbing tools.

IMG

Image-to-Video

Animates a source image into a video clip while preserving the subject's appearance and style from the input reference.

IMG

Image Generation

Supports text-to-image, image-to-image transformation, and image editing at resolutions up to 2048×2048 pixels.

Multi-Shot Storytelling

A single prompt can produce multi-scene narratives with automatic camera transitions and consistent characters across shots.

VID

Reference-to-Video

Accepts uploaded reference images or video to maintain subject appearance, style, and motion consistency across generated outputs.

Prompt Expansion

Optional AI-powered prompt expansion enriches short or simple text inputs to improve output quality and detail.

Seed Control

Accepts a seed value as input, allowing reproducible generation results for iterative creative workflows.

Pricing for Wan 2.6

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Input tokens N/A Per million tokens

Output tokens N/A Per million tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Wan

Configuration & Parameters

The configurable options currently documented for this model.

Width

Number

Default: 1024 Range: 768 - 1440

Height

Number

Default: 1024 Range: 768 - 1440

Negative Prompt

Text

Description of what to exclude from the video.

Seed

A specific value that is used to guide the 'randomness' of the generation.

Supported Request Parameters

Parameters currently listed by OpenRouter or the local catalog for this model.

Width Height Negative Prompt Seed

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Official Announcement Blog Post Announcements

→

Model Overview & Capabilities Guide Documentation

→

Wan 2.6 Demo & Feature Showcase Playground

→

Wan Official Website Other

→

AI tools related to Wan 2.6

These tools are strongly connected to Wan 2.6 through direct product references, provider mentions, or explicit model mappings.

AI Image Generator

Wannafake

Wannafake is a web-based tool for swapping faces in videos using a single reference photo. It operates on a pay-as-you-go model without subscriptions, allowing users to purchase seconds of processing time to use at their convenience. The platform includes integrated video clipping, enabling users to trim footage and pay only for the processed segments. Additionally, it supports parallel processing, allowing users to generate multiple videos simultaneously.

Free 0 visits 7 saves

AI Agent

Wanderboat

Wanderboat is an AI-driven travel platform that identifies and organizes top points of interest using videos, images, and expert insights. Users can ask questions via chat, documents, or maps to receive personalized recommendations for everything from local cuisine to photography spots. Additionally, the platform features an AI travel planner to assist with itinerary building, trip logistics, accommodation searches, and travel tips.

Free 12 visits 5 saves

AI API

iWAND

iWAND is an AI-powered fashion stylist designed for e-commerce, specifically for Shopify stores. By leveraging advanced AI models to curate style feeds and assistant-led discovery, it functions as a virtual stylist to personalize the shopping experience, increase customer satisfaction, and drive sales.

Free 0 visits 3 saves

Other

Wander

Wander is a platform that connects travelers and tourists to help them find companions for their adventures. Users can create profiles, post upcoming trips for others to join, or request to join existing travel plans. The platform aims to foster a global community where individuals can connect and share experiences, whether they are solo backpackers, traveling with friends, or exploring their local area.

Free 0 visits 3 saves

Community discussion

What people think about Wan 2.6

Wan 2.6 discussions are most active in r/HiggsfieldAI, r/comfyui, r/IndianArtAI. Top Reddit threads cluster around safety and censorship questions. The strongest match in this snapshot has 225 upvotes and 78 comments.

r/HiggsfieldAI 7 upvotes 51 comments December 17, 2025

Is WAN 2.6 still uncensored on Higgsfield?

WAN 2.5 was uncensored on higgsfield, can anyone tell me if WAN 2.6 is still uncensored? Higgsfield no longer gives enough credits to run off even ONE test video.

The "official" WAN 2.5/2.6 page is heavily censored making it useless. Try to describe a G rated scene where people are sunbathing by a pool and it will probably be blocked.

Open Reddit thread

r/comfyui 134 upvotes 183 comments December 16, 2025

WAN 2.6 has been released, but it's a commercial version. Does this mean the era of open-source WAN models is over?

Although WAN2.2's performance is already very close to industrial production capabilities, who wouldn't want to see an even better open-source model emerge? Will there be open-source successors to the WAN series?

Open Reddit thread

r/IndianArtAI 23 upvotes 5 comments December 24, 2025

tried wan 2.6

hey guys i tried using wan 2.6. multi-cut 15s 1080p it looks quite great lip-sync is what just seems off to me. give you opinions

Open Reddit thread

r/singularity 225 upvotes 78 comments December 16, 2025

Alibaba just dropped "Wan 2.6" (Sora Rival) on API platforms ahead of tomorrow's official event. Features 1080p, Native Audio Sync and 15s clips.

While the official launch event is scheduled for tomorrow (Dec 17), the model has just gone live on partner platforms like **Fal.ai and Replicate** and the results are stunning.

**The Key Specs:**

**Resolution:** 1080p at 24fps.

**Audio:** Features **built-in** lip-sync and native audio generation(See the cat drumming in the video; it’s generated with the video, not added later).

**Duration:** Up to 15 seconds and **Capabilities:** Text to Video, Image to Video and Video to Video.

**The "Open Source" Question:** Previous versions (Wan 2.1) were open-weights, **but right now,** Wan 2.6 is only available via commercial APIs.

The community is **debating** whether Alibaba will drop the weights at tomorrow's event or if the "Open Source Era" for **SOTA** video models is closing.

**Do you think Alibaba will open-source this tomorrow to undercut Sora/Runway, or are they pivoting to a closed API model?**

**Source: Wan Ai(Official site)**

🔗: https://www.wan-ai.co/wan-2-6

Open Reddit thread

r/StableDiffusion 4 upvotes 41 comments January 27, 2026

I downloaded comfyui from the website and i'm confused, what the hell are Wan 2.6 and Kling 2.6 workflows, those models don't exist don't they? Is this the right comfyui?

Open Reddit thread

View more discussions →

FAQ

Common questions about Wan 2.6

What is the context window for Wan 2.6?

Wan 2.6 has a context window of 2,000 tokens, which applies to text prompt inputs.

What input types does Wan 2.6 accept?

The model accepts image URL arrays, numeric values (such as width and height dimensions), text prompts, and a seed value for reproducible outputs.

What is the training data cutoff for Wan 2.6?

According to the available metadata, Wan 2.6 has a training date of December 2025.

What video resolution and length does Wan 2.6 support?

Wan 2.6 can generate video at up to 1080p resolution and 24 frames per second, with clips up to 15 seconds long.

Does Wan 2.6 support languages other than English?

Yes, Wan 2.6 accepts prompts in both English and Chinese.

What architecture does Wan 2.6 use?

Wan 2.6 uses a Mixture-of-Experts (MoE) architecture with 14 billion total parameters, activating approximately 20% of them during each generation pass for improved inference speed.

More models from Wan

Continue browsing adjacent models from the same provider.

← All AI Models