demobook

Generate multi-person talking videos on Hugging Face

Demo summary

The user demonstrates how to use the Multi-Talk interface on Hugging Face by entering a detailed text prompt, uploading a conditioning image, and adding two separate audio files in WAV format to generate a video of two people singing.

Step-by-step

  1. Enter a detailed text prompt describing the scene and the characters
  2. Upload a conditioning image
  3. Upload two separate audio files in WAV format
  4. Open Advanced Settings to view or adjust sample steps
  5. Click to generate the video

Options

  • Use ChatGPT to write the detailed prompt for you
  • Increase the sample steps in Advanced Settings

Watch out for

  • Audio files must be in WAV format
  • The output video quality may not be very high

Tips

  • Stick with the default 12 sample steps
  • Use WAV format for better voice quality
  • Provide a very detailed prompt describing character appearance and positioning

Highlights

overall I'm really impressed by this by this multi-dog model

AI Avatar Video Generator

  1. 6:211:01Generate multi-person talking videos on Hugging FaceCurrentThe user demonstrates how to use the Multi-Talk interface on Hugging Face by entering a detailed text prompt, uploading a conditioning image, and adding two separate audio files in WAV format to generate a video of two people singing.SkillCurb