Multitalk: Parallel multi-speaker animation

Demo summary
The user demonstrates a more advanced multi-speaker setup where two audio tracks are played in parallel to animate a conversation between two people in a single reference image.
Step-by-step
- Upload a reference image containing two people
- Select the 'in parallel' option for audio playback
- Split your original audio into two separate tracks using an external audio editor
- Mute the second speaker in the first track and the first speaker in the second track, ensuring both files maintain the same total duration
- Upload both processed audio clips to the platform
- Calculate and set the video length in frames (Duration in seconds multiplied by 25)
- Enable 'tcash' skip stepping and set a 2x speed up starting at 10% of generation
- Click Generate
Options
- Select 'two audio sources played in a row' for sequential speaking/singing instead of parallel
- Toggle 'keep the background' for image-to-video generation
Watch out for
- Parallel audio tracks must have the exact same duration
- The person on the left in the image is automatically assumed to be the first speaker
- Requires external post-processing of audio clips to isolate voices while maintaining timing
Tips
- Ensure the image composition aligns with the speaker order (left to right)
- Minimal prompting is needed when using a strong reference image
- The AI can realistically capture stutters, pauses, and background character movement
Highlights
“everything looks very fluid and natural and realistic”
All demos from “Make AI videos with talking + pose + reference control. MultiTalk & VACE tutorial”
5:271:27Overview of the Wan2GP interface for Multi-TalkThe creator walks through the Wan2GP Gradio interface, explaining how to select the Multi-Talk model and the specific 'Vase Multi-Talk Fusion X' version for better performance on low VRAM.Multitalk· AI Animation Generator
8:224:37Generate talking head video from image and audioThe user demonstrates uploading a reference image and an audio clip to Multi-Talk, configuring background removal and text prompts to generate a video of a woman speaking in a park.Multitalk· AI Avatar Video Generator
13:571:32Simulate angry expressions with Multi-TalkThe demo shows how to use an angry reference image and matching audio to generate a highly expressive video that captures the pitch and intensity of the speaker's anger.Multitalk· AI Lip Sync Generator
15:291:10Animate sad emotions and cryingThe creator demonstrates Multi-Talk's ability to handle complex emotions by animating a sad character who pauses and breathes in sync with a crying audio track.Multitalk· AI Lip Sync Generator
17:441:28Lip-syncing anime charactersA demonstration of applying Japanese audio to an anime still image, showing how the tool handles non-human characters and different languages.Multitalk· AI Lip Sync Generator
19:393:03Animate multiple speakers in a podcast sceneThe video shows how to configure Multi-Talk for two speakers by uploading an image of two people and two sequential audio clips, assigning voices based on their position in the frame.Multitalk· AI Avatar Video Generator
22:173:21Parallel multi-speaker animationCurrentThe user demonstrates a more advanced multi-speaker setup where two audio tracks are played in parallel to animate a conversation between two people in a single reference image.Multitalk· AI Avatar Video Generator
26:322:34Transfer human motion with VACE and Multi-TalkThe demo shows how to use a control video of a person dancing to drive the body movements of a reference image while simultaneously applying a Spanish lip-sync track.Multitalk· Video to Video- Watch “Make AI videos with talking + pose + reference control. MultiTalk & VACE tutorial” →
AI Avatar Video Generator
8:224:37Generate talking head video from image and audioThe user demonstrates uploading a reference image and an audio clip to Multi-Talk, configuring background removal and text prompts to generate a video of a woman speaking in a park.AI Search
19:393:03Animate multiple speakers in a podcast sceneThe video shows how to configure Multi-Talk for two speakers by uploading an image of two people and two sequential audio clips, assigning voices based on their position in the frame.AI Search
22:173:21Parallel multi-speaker animationCurrentThe user demonstrates a more advanced multi-speaker setup where two audio tracks are played in parallel to animate a conversation between two people in a single reference image.AI Search
0:500:47Multi-person conversational video generationMultiTalk is shown animating a group image where two separate people interact and respond to each other using different audio tracks.NadimExplainsAI
Multitalk