demobook

Multitalk: Parallel multi-speaker animation

Demo summary

The user demonstrates a more advanced multi-speaker setup where two audio tracks are played in parallel to animate a conversation between two people in a single reference image.

Step-by-step

  1. Upload a reference image containing two people
  2. Select the 'in parallel' option for audio playback
  3. Split your original audio into two separate tracks using an external audio editor
  4. Mute the second speaker in the first track and the first speaker in the second track, ensuring both files maintain the same total duration
  5. Upload both processed audio clips to the platform
  6. Calculate and set the video length in frames (Duration in seconds multiplied by 25)
  7. Enable 'tcash' skip stepping and set a 2x speed up starting at 10% of generation
  8. Click Generate

Options

  • Select 'two audio sources played in a row' for sequential speaking/singing instead of parallel
  • Toggle 'keep the background' for image-to-video generation

Watch out for

  • Parallel audio tracks must have the exact same duration
  • The person on the left in the image is automatically assumed to be the first speaker
  • Requires external post-processing of audio clips to isolate voices while maintaining timing

Tips

  • Ensure the image composition aligns with the speaker order (left to right)
  • Minimal prompting is needed when using a strong reference image
  • The AI can realistically capture stutters, pauses, and background character movement

Highlights

everything looks very fluid and natural and realistic

All demos from “Make AI videos with talking + pose + reference control. MultiTalk & VACE tutorial

  1. 5:271:27Overview of the Wan2GP interface for Multi-TalkThe creator walks through the Wan2GP Gradio interface, explaining how to select the Multi-Talk model and the specific 'Vase Multi-Talk Fusion X' version for better performance on low VRAM.MultitalkAI Animation Generator
  2. 8:224:37Generate talking head video from image and audioThe user demonstrates uploading a reference image and an audio clip to Multi-Talk, configuring background removal and text prompts to generate a video of a woman speaking in a park.MultitalkAI Avatar Video Generator
  3. 13:571:32Simulate angry expressions with Multi-TalkThe demo shows how to use an angry reference image and matching audio to generate a highly expressive video that captures the pitch and intensity of the speaker's anger.MultitalkAI Lip Sync Generator
  4. 15:291:10Animate sad emotions and cryingThe creator demonstrates Multi-Talk's ability to handle complex emotions by animating a sad character who pauses and breathes in sync with a crying audio track.MultitalkAI Lip Sync Generator
  5. 17:441:28Lip-syncing anime charactersA demonstration of applying Japanese audio to an anime still image, showing how the tool handles non-human characters and different languages.MultitalkAI Lip Sync Generator
  6. 19:393:03Animate multiple speakers in a podcast sceneThe video shows how to configure Multi-Talk for two speakers by uploading an image of two people and two sequential audio clips, assigning voices based on their position in the frame.MultitalkAI Avatar Video Generator
  7. 22:173:21Parallel multi-speaker animationCurrentThe user demonstrates a more advanced multi-speaker setup where two audio tracks are played in parallel to animate a conversation between two people in a single reference image.MultitalkAI Avatar Video Generator
  8. 26:322:34Transfer human motion with VACE and Multi-TalkThe demo shows how to use a control video of a person dancing to drive the body movements of a reference image while simultaneously applying a Spanish lip-sync track.MultitalkVideo to Video
  9. Watch “Make AI videos with talking + pose + reference control. MultiTalk & VACE tutorial” →

AI Avatar Video Generator

  1. 8:224:37Generate talking head video from image and audioThe user demonstrates uploading a reference image and an audio clip to Multi-Talk, configuring background removal and text prompts to generate a video of a woman speaking in a park.AI Search
  2. 19:393:03Animate multiple speakers in a podcast sceneThe video shows how to configure Multi-Talk for two speakers by uploading an image of two people and two sequential audio clips, assigning voices based on their position in the frame.AI Search
  3. 22:173:21Parallel multi-speaker animationCurrentThe user demonstrates a more advanced multi-speaker setup where two audio tracks are played in parallel to animate a conversation between two people in a single reference image.AI Search
  4. 0:500:47Multi-person conversational video generationMultiTalk is shown animating a group image where two separate people interact and respond to each other using different audio tracks.NadimExplainsAI