Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Wan2.5-Preview arrives with a groundbreaking multimodal foundation that unifies understanding and generation across text, imagery, audio, and video. Its native multimodal design, trained jointly across diverse data sources, enables tighter modal alignment, smoother instruction execution, and highly coherent audio-visual output. Through reinforcement learning from human feedback, it continually adapts to aesthetic preferences, resulting in more natural visuals and fluid motion dynamics. Wan2.5 supports cinematic 1080p video generation with synchronized audio, including multi-speaker content, layered sound effects, and dynamic compositions. Creators can control outputs using text prompts, reference images, or audio cues, unlocking a new range of storytelling and production workflows. For still imagery, the model achieves photorealism, artistic versatility, and strong typography, plus professional-level chart and design rendering. Its editing tools allow users to perform conversational adjustments, merge concepts, recolor products, modify materials, and refine details at pixel precision. This preview marks a major leap toward fully integrated multimodal creativity powered by AI.
Description
Z-Image is a family of open-source image generation foundation models created by Alibaba's Tongyi-MAI team, utilizing a Scalable Single-Stream Diffusion Transformer architecture to produce both photorealistic and imaginative images from textual descriptions with only 6 billion parameters, which enhances its efficiency compared to many larger models while maintaining competitive quality and responsiveness to instructions. This model family comprises several variants, including Z-Image-Turbo, a distilled version designed for rapid inference that achieves results with as few as eight function evaluations and sub-second generation times on compatible GPUs; Z-Image, the comprehensive foundation model tailored for high-fidelity creative outputs and fine-tuning processes; Z-Image-Omni-Base, a flexible base checkpoint aimed at fostering community-driven advancements; and Z-Image-Edit, specifically optimized for image-to-image editing tasks while demonstrating strong adherence to instructions. Each variant of Z-Image serves distinct purposes, catering to a wide range of user needs within the realm of image generation.
API Access
Has API
API Access
Has API
Screenshots View All
No images available
Integrations
AIReel
APIFree
Eromify
Inspix AI
Oxen.ai
Piooy
Prism
Wan AI
ZenCreator
Integrations
AIReel
APIFree
Eromify
Inspix AI
Oxen.ai
Piooy
Prism
Wan AI
ZenCreator
Pricing Details
Free
Free Trial
Free Version
Pricing Details
Free
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Alibaba
Founded
1999
Country
China
Website
wan.video
Vendor Details
Company Name
Z-Image
Founded
1999
Country
China
Website
github.com/Tongyi-MAI/Z-Image