ComfyUI: Multi-action video generation with WAN 2.1

Apex Artist·Jan 2026

Demo summary

The creator demonstrates prompting the WAN model to have a character speak while performing complex actions like walking to a table and opening a laptop.

Step-by-step

Enter a multi-action prompt describing the character speaking while performing physical tasks
Generate a signature voice file using the Chatterbox workflow
Export the generated audio as an MP3 file
Input the prompt and audio into the WAN 2.1 model

Tips

Combine dialogue with physical actions like walking or interacting with objects to test the model's complexity

All demos from “Let's try WAN 2.6”

0:300:35Generate lip-synced video with WAN 2.1 on FreepikThe user demonstrates loading an image and an MP3 audio file into the WAN model on the Freepik platform to generate a video of a woman speaking with synchronized audio.Freepik· AI Lip Sync Generator
1:050:21Multi-action video generation with WAN 2.1CurrentThe creator demonstrates prompting the WAN model to have a character speak while performing complex actions like walking to a table and opening a laptop.ComfyUI· Text to Video
Watch “Let's try WAN 2.6” →

Text to Video

6:043:25Text-to-Video generation with LTX 2.3The video demonstrates generating a 7-second video from a detailed text prompt describing a woman at a zoo, showing how the model handles specific dialogue and character actions.AIKnowledge2Go
3:130:22Generating AI video and audio with LTX-2.3The user shows how to input a prompt including dialogue and sound effects into the LTX-2.3 workflow in ComfyUI to generate a video with synchronized audio.MDMZ
7:390:27Input prompt and run generationThe demo shows entering a text description into the prompt box and clicking 'Queue Prompt' to start the Wan 2.2 Animate generation process.MDMZ
1:050:21Multi-action video generation with WAN 2.1CurrentThe creator demonstrates prompting the WAN model to have a character speak while performing complex actions like walking to a table and opening a laptop.Apex Artist