The wait is finally over! Alibaba has officially launched VACE AI, their all-in-one video creation and editing tool. First introduced on March 11, 2025, Alibaba VACE is now fully available with AI models and code to the public. This Alibaba VACE release gives everyone access to powerful video manipulation capabilities. Whether you’re a developer, content creator, filmmaker, or just someone curious about AI-generated videos, this tool is packed with features that will blow your mind.
In our previous article, we’ve already covered Alibaba VACE, including its technical details and key features. If you want a deep dive into those details, check out our full breakdown here. For now, we’ll focus on the newly released models and how to get started with them.
Table of Contents
What is Alibaba VACE?
Alibaba VACE is a next-generation AI video editing model that brings together multiple video-related tasks in a single platform. It supports everything from simple edits to complex video transformations. VACE AI Video Editing capabilities let you move objects around, swap elements, reference styles from other media, expand your frame, or animate static images – all within one unified system. So, whether you want to modify a scene, replace objects, or create entirely new video sequences, VACE AI makes it all possible with just a few inputs.
VACE AI Models and Code Now Available
On March 31, 2025, Alibaba made the VACE AI model publicly accessible.
- Alibaba has released VACE-Wan2.1-1.3B-Preview and VACE-LTX-Video-0.9 models. They are now available on HuggingFace and ModelScope!
- Alibaba has released code for model inference, preprocessing, and the gradio demos.
- The larger models (VACE-Wan2.1-1.3B and VACE-Wan2.1-14B) are still in the pipeline and will be released later.
Released Alibaba VACE Models
The released Alibaba VACE models use either Wan or LTX models as their foundation. Both models are available on Hugging Face and ModelScope for developers and researchers. Let’s check out what’s currently available:
1. VACE-Wan2.1-1.3B-Preview
This VACE model works with videos around 81 × 480 × 832 in size and comes with an Apache-2.0 license.
- diffusion_pytorch_model.safetensors (7.15 GB)
2. VACE-LTX-Video-0.9
This VACE AI model handles videos around 97 × 512 × 768 and is released under the RAIL-M license. It consists of:
- ltx-video-2b-v0.9.safetensors (11.4 GB)
While these VACE Models can accept any resolution inputs, you’ll get the best results when sticking within their recommended video size ranges.
How to Get Started With VACE AI Models
Before getting started, make sure your system meets these basic requirements:
- Python 3.10.13
- CUDA version 12.4
- PyTorch >= 2.5.1
Step 1: Clone the Repository
git clone https://github.com/ali-vilab/VACE.git && cd VACE
Step 2: Install Dependencies
pip install torch==2.5.1 torchvision==0.20.1 –index-url https://download.pytorch.org/whl/cu124
pip install -r requirements.txt
Step 3: Install the Model
For Wan2.1-based VACE:
pip install wan@git+https://github.com/Wan-Video/Wan2.1
For LTX-Video-0.9-based VACE:
pip install ltx-video@git+https://github.com/Lightricks/LTX-Video@ltx-video-0.9.1 sentencepiece --no-deps
Store them in the <repo-root>/models/ directory for easy access.
Step 4: Setup for VACE Preprocessing Tools
If you need the preprocessing tools:
pip install -r requirements/annotator.txt
Download VACE-Annotators to the models directory in your repository root.
How to Use VACE AI Video Tool
VACE lets you input text prompts and optionally add video, masks, and images for generation or editing. Here are the main ways to use it:
1. End-to-End Running with VACE
For a simple end-to-end pipeline without diving into technical details:
# Run V2V depth
python vace/vace_pipeline.py --base wan --task depth --video assets/videos/test.mp4 --prompt 'your prompt here'
# Run MV2V inpainting by providing bbox
python vace/vace_pipeline.py --base wan --task inpainting --mode bbox --bbox 50,50,550,700 --video assets/videos/test.mp4 --prompt 'your prompt here'
This runs video preprocessing and model inference in sequence. Your output and intermediate files will save to the ./results/ folder by default.
2. VACE Preprocessing Step
For more control over inputs:
# Process video depth
python vace/vace_preproccess.py --task depth --video assets/videos/test.mp4
# Process video inpainting by providing bbox
python vace/vace_preproccess.py --task inpainting --mode bbox --bbox 50,50,550,700 --video assets/videos/test.mp4
The outputs save to ./processed/ by default.
3. VACE Model Inference Step
After preprocessing, run model inference:
# For Wan2.1 single GPU inference
python vace/vace_wan_inference.py --ckpt_dir <path-to-model> --src_video <path-to-src-video> --src_mask <path-to-src-mask> --src_ref_images <paths-to-src-ref-images> --prompt "your prompt here"
# For Wan2.1 Multi GPU Acceleration inference
pip install “xfuser>=0.4.1”
torchrun --nproc_per_node=8 vace/vace_wan_inference.py --dit_fsdp --t5_fsdp --ulysses_size 1 --ring_size 8 --ckpt_dir <path-to-model> --src_video <path-to-src-video> --src_mask <path-to-src-mask> --src_ref_images <paths-to-src-ref-images> --prompt "your prompt here"
# For LTX inference, run
python vace/vace_ltx_inference.py --ckpt_path <path-to-model> --text_encoder_path <path-to-model> --src_video <path-to-src-video> --src_mask <path-to-src-mask> --src_ref_images <paths-to-src-ref-images> --prompt "your prompt here"
Using VACE AI with Gradio
If you prefer a no-code interface, you can use Gradio demos to experiment with VACE AI.
1. To launch the preprocessing tool:
python vace/gradios/preprocess_demo.py
2. To run VACE AI model inference:
# Wan2.1 gradio inference
python vace/gradios/vace_wan_demo.py
# LTX gradio inference
python vace/gradios/vace_ltx_demo.py
This will launch an interactive web interface where you can upload videos, enter prompts, and see real-time AI-powered edits.
What’s Next for Alibaba VACE?
The current Alibaba VACE models are impressive but still have some limitations. Videos can’t extend much beyond 5-10 seconds with good quality, and the larger VACE Models are still coming soon. The next big improvements for Alibaba VACE will likely focus on length and performance.
VACE represents a major step forward in AI video tools by combining so many functions in one system. Whether you’re a professional video editor looking to streamline workflows or a creative experimenter wanting to try new ideas, Alibaba VACE offers powerful capabilities.
Getting Started Today with Alibaba VACE
With the release of these models and code, Alibaba has given everyone the chance to experiment with the VACE AI Video tool. Whether you choose the VACE-Wan2.1-1.3B-Preview or VACE-LTX-Video-0.9 model depends on your specific needs.
Both offer access to the core Alibaba VACE functionality. The ability to move objects within videos, swap elements, reference styles from other media, expand frames, and animate static images all within Alibaba VACE opens doors to endless creative possibilities.
Now’s the perfect time to start experimenting with this groundbreaking Alibaba VACE video model!
| Latest From Us
- Forget Towers: Verizon and AST SpaceMobile Are Launching Cellular Service From Space

- This $1,600 Graphics Card Can Now Run $30,000 AI Models, Thanks to Huawei

- The Global AI Safety Train Leaves the Station: Is the U.S. Already Too Late?

- The AI Breakthrough That Solves Sparse Data: Meet the Interpolating Neural Network

- The AI Advantage: Why Defenders Must Adopt Claude to Secure Digital Infrastructure







