Create Stunning Videos & Images
with Wan AI
Wan 2.7 is here. The Image model generates lifelike faces, precise color palettes, and 3K-token text — #1 in blind tests. The Video model adds native audio, first/last frame control, 9-grid input, and instruction editing at 1080p. Wan 2.1 remains open-source with no content filters.
Wan 2.7-Image
Unified Image Generation & Editing
Wan 2.7-Image brings lifelike portrait generation, precise color control, and ultra-long text rendering. Human preference blind test #1 in China.
Thousand Unique Faces
Break free from AI 'standard faces'. Customize bone structure, eye shape (almond, deep-set, phoenix), face shape (oval, round, square), and fine facial details for truly unique, lifelike portraits.
Color Palette Control
One-click extract color distributions from reference images. Reproduce master painting palettes or align with brand color guidelines while maintaining composition integrity.
3K Token Text Rendering
Render up to 3,000 tokens of text across 12 languages with print-level clarity. Generate complex tables, mathematical formulas, and full A4 pages of academic content.
Interactive Click-to-Edit
Point and edit — simply select an area to add, align, or move elements with pixel-level precision. Native interactive editing module for intuitive image manipulation.
Multi-Subject Consistency
Support up to 9 reference images for consistent character and style across storyboards, e-commerce photo sets, movie posters, and multi-angle architectural views.
Batch Generation (12 Images)
Generate up to 12 images at once for creating consistent series, PPT illustrations, storyboard sequences, e-commerce model photos, and multi-view renders.
Unified Generation + Understanding Architecture
Wan 2.7-Image features a leading unified generation-understanding architecture. Through shared latent space semantic mapping, it achieves a leap from pixel fitting to deep semantic cognition. Wan 2.7-Image-pro with larger-scale training is also available.
Powerful Features
Everything you need to create professional AI videos & images
Text to Video
Transform your text descriptions into stunning, high-quality videos with advanced AI understanding. Support for Chinese, English, Japanese, and more.
Image to Video
Bring your static images to life with natural motion and cinematic effects. Wan 2.7 supports 9-grid multi-image to video for richer scene composition.
Motion Control
Precise camera movement and object trajectory control for professional results. Pan, zoom, rotate with cinematic precision.
High Resolution
Generate videos up to 1080p/24fps and images with print-level clarity. Wan 2.7 VBench score 90%+, ranking #1 in human preference blind tests.
Multi-Language
Native support for prompts in English, Chinese, Japanese, Korean, and German with excellent semantic understanding.
Open Source
Wan 2.1 is fully open-source under Apache 2.0. Run locally on consumer GPUs (8GB+ VRAM) or use cloud API for newer versions.
AI Image Generation
Wan 2.7-Image generates and edits images with a unified model. Supports face customization, color palette control, 3K token text rendering, and batch generation of up to 12 images.
Uncensored / NSFW
Wan 2.1 runs locally on your GPU with zero content filters. Full control over generation — no restrictions, no censorship. Community fine-tuned versions available on GitHub and Hugging Face.
State-of-the-Art Architecture
Built on cutting-edge Diffusion Transformer technology with MoE (Mixture of Experts) and native multimodal capabilities, achieving top-tier performance in VBench benchmarks.
Diffusion Transformer (DiT)
Advanced transformer-based diffusion model enabling superior temporal coherence and complex motion understanding for realistic video generation.
Causal 3D VAE (Wan-VAE)
Efficient spatiotemporal compression with 4×8×8 ratio, supporting arbitrary length 1080p video encoding while preserving precise temporal information.
Mixture of Experts (MoE)
27B total parameters with 14B activation, reducing computation by ~50% while improving complex scene generation and multi-character interactions.
Native Multimodal
Unified architecture for text, image, video, and audio processing. Native lip-sync support with precise mouth movement matching to speech.
Model Specifications Comparison
| Metric | Wan2.1 | Wan2.2 | Wan2.6 | Wan2.7 |
|---|---|---|---|---|
| Max Resolution | 720P | 720P | 1080P | 1080P+ |
| Max Duration | 5s | 5s | 15s | 2-15s |
| Frame Rate | 24fps | 24fps | 24fps | 24fps |
| Parameters | 14B | 27B | 27B+ | 27B+ |
| VBench Score | 86.22% | 87.5% | 89%+ | 90%+ |
| Image Generation | - | - | Basic | Unified Gen+Edit |
Evolution of Wan Series
Continuous innovation from Wan 2.1 to Wan 2.7. Both the Image model (unified generation & editing) and Video model (1080p, native audio, 15s) are now available.
Wan 2.1
- 14B parameter flagship model
- VBench score 86.22% (Global #1)
- Chinese/English text effects
- Consumer GPU support (6GB+)
Wan 2.2
- MoE architecture (27B total)
- 60+ cinematic parameters
- Character replacement tech
- 50% computation savings
Wan 2.5
- Native multimodal architecture
- Audio-visual synchronization
- 10-second generation
- Photo singing & dancing
Wan 2.6
- 15-second video (China's longest)
- Multi-shot narrative system
- Role-playing & voice cloning
- Full lip-sync support
Wan 2.7
- Wan 2.7-Image: unified gen+edit model
- Video: 1080p 15s with native audio
- First/last frame, 9-grid, instruction edit
- Open-source expected mid-to-late Q2 2026
AI Video Playground
Start creating your AI video in seconds
Quick prompts:
Unlimited Creative Possibilities
From personal creativity to professional production, Wan AI empowers creators across all industries.
Short Video Creation
Create engaging short-form content for TikTok, YouTube Shorts, Instagram Reels. Generate creative videos from simple text prompts.
Advertising & Marketing
Produce professional product demos, brand commercials, and marketing materials at a fraction of traditional costs.
Film & Animation
Generate concept videos, storyboard previews, and animated sequences for film pre-production and indie projects.
Education & Training
Create educational content with physics simulations, process demonstrations, and interactive learning materials.
Digital Human & Avatar
Generate realistic digital humans for news broadcasting, virtual assistants, and interactive entertainment.
Gaming & Entertainment
Create game trailers, cutscenes, character animations, and promotional content for gaming industry.
Video Gallery
See what others have created with Wan AI
Sexy woman dancing in lingerie
Beautiful woman running on beach in bikini
Mysterious woman holding a glowing magic orb
Majestic waterfall in tropical rainforest
Neon-lit cyberpunk city at night
Elegant woman in white dress walking in garden
Colorful ink flowing in water abstract art
Aurora borealis dancing over snowy mountains
Female warrior wielding a sword
Astronaut floating in outer space
Graceful woman dancing elegantly
Purple fluid abstract art
Ocean waves crashing on rocky shore
Mysterious witch performing magic ritual
Wan vs Competition
Comprehensive comparison: Wan 2.7 vs SeedDance 2.0, Sora 2, Kling 3.0, Veo 3.1, and Runway Gen-4.5. Based on April 2026 data. Wan 2.7-Image ranks #1 in human preference.
| Metrics | Wan 2.7 Recommended | SeedDance 2.0 | Sora 2 | Kling 3.0 | Veo 3.1 | Gen-4.5 |
|---|---|---|---|---|---|---|
| Max Duration | 2-15s | 15s | 25s | 10s | 10s | 10s |
| Resolution | 1080p | 1080p | 1080p | 4K/60fps | 1080p | 1080p |
| Open Source | (wan 2.1 open source) | |||||
| Real Person Input | ||||||
| Video Reference Clips | 5 | 1 | — | 1 | — | 1 |
| Free to Use | ||||||
| Instruction Editing | ||||||
| Lip Sync | ★★★★★ | ★★★★☆ | ★★★★☆ | ★★★★☆ | ★★★★☆ | ★★★☆☆ |
| Style Consistency | ★★★★★ | ★★★★☆ | ★★★★☆ | ★★★★☆ | ★★★★☆ | ★★★☆☆ |
| Cost | Free | $ | $$$ | $ | $$$$ | $$ |
Frequently Asked Questions
Everything you need to know about Wan AI video generator.
Ready to Create Amazing Videos?
Join thousands of creators using Wan AI to bring their ideas to life. Free to use, Wan 2.1 is open-source.
$1 FREE Credit
25% Cashback
50 Free Generations
No credit card required
10M+
Videos
500K+
Users
99.9%
Uptime
24/7
Support