ComfyUI: Multi-action video generation with WAN 2.1

Demo summary
The creator demonstrates prompting the WAN model to have a character speak while performing complex actions like walking to a table and opening a laptop.
Step-by-step
- Enter a multi-action prompt describing the character speaking while performing physical tasks
- Generate a signature voice file using the Chatterbox workflow
- Export the generated audio as an MP3 file
- Input the prompt and audio into the WAN 2.1 model
Tips
- Combine dialogue with physical actions like walking or interacting with objects to test the model's complexity
All demos from “Let's try WAN 2.6”
0:300:35Generate lip-synced video with WAN 2.1 on FreepikThe user demonstrates loading an image and an MP3 audio file into the WAN model on the Freepik platform to generate a video of a woman speaking with synchronized audio.Freepik· AI Lip Sync Generator
1:050:21Multi-action video generation with WAN 2.1CurrentThe creator demonstrates prompting the WAN model to have a character speak while performing complex actions like walking to a table and opening a laptop.ComfyUI· Text to Video- Watch “Let's try WAN 2.6” →
Text to Video
7:390:27Input prompt and run generationThe demo shows entering a text description into the prompt box and clicking 'Queue Prompt' to start the Wan 2.2 Animate generation process.MDMZ
6:043:25Text-to-Video generation with LTX 2.3The video demonstrates generating a 7-second video from a detailed text prompt describing a woman at a zoo, showing how the model handles specific dialogue and character actions.AIKnowledge2Go
3:130:22Generating AI video and audio with LTX-2.3The user shows how to input a prompt including dialogue and sound effects into the LTX-2.3 workflow in ComfyUI to generate a video with synchronized audio.MDMZ
1:050:21Multi-action video generation with WAN 2.1CurrentThe creator demonstrates prompting the WAN model to have a character speak while performing complex actions like walking to a table and opening a laptop.Apex Artist
ComfyUI