demobook

ComfyUI: Lip-syncing audio to images with LTX 2.3

ComfyUITry it →Watch full video →AIKnowledge2Go ·

Demo summary

The tutorial shows how to upload an MP3 file and a character image to generate a lip-synced video, including a demonstration of the voice isolation subgraph in ComfyUI.

Step-by-step

  1. Select Workflow 4 in ComfyUI
  2. Upload an MP3 audio file
  3. Upload a character image
  4. Select the correct LTX model
  5. Adjust the resolution settings (e.g., 960)
  6. Enter an animation prompt describing the character and background
  7. Click to run the generation

Options

  • Deactivate the voice isolation subgraph if using music instead of a character voice
  • Bypass the audio splitting by routing audio directly through the subgraph
  • Remove 'lip sync audio' from the text prompt as it is not required for the effect to work

Watch out for

  • The workflow is specifically trained for character voices using a mail brand row format
  • You must ensure the selected resolution is compatible with your specific GPU capacity

Tips

  • Use the voice isolation subgraph to clean up audio for better lip-sync results
  • Keep the animation prompt descriptive of the environment and camera movement (e.g., 'subtle camera push in')
  • Note that LTX2 performs well even when the character's lips are a similar color to their beard

Highlights

I really love it. I mean... I think LTX2 did a great job

All demos from “LTX 2.3 - The New KING Of UNCENSORED AI VIDEO

  1. 0:594:52Configure LTX 2.3 in ComfyUIThe creator walks through the ComfyUI node setup for LTX 2.3, explaining the GGUF model loader, VAE settings, and how to adjust resolution and frame counts for optimal rendering.ComfyUIAI Animation Generator
  2. 6:043:25Text-to-Video generation with LTX 2.3The video demonstrates generating a 7-second video from a detailed text prompt describing a woman at a zoo, showing how the model handles specific dialogue and character actions.ComfyUIText to Video
  3. 10:194:27Image-to-Video with LTX 2.3The creator demonstrates bringing a static image of a vampire warlord to life by using an image input node and a descriptive prompt to guide the animation and environmental effects.ComfyUIImage to Video
  4. 14:523:11Lip-syncing audio to images with LTX 2.3CurrentThe tutorial shows how to upload an MP3 file and a character image to generate a lip-synced video, including a demonstration of the voice isolation subgraph in ComfyUI.ComfyUIAI Lip Sync Generator
  5. Watch “LTX 2.3 - The New KING Of UNCENSORED AI VIDEO” →

AI Lip Sync Generator

  1. 14:523:11Lip-syncing audio to images with LTX 2.3CurrentThe tutorial shows how to upload an MP3 file and a character image to generate a lip-synced video, including a demonstration of the voice isolation subgraph in ComfyUI.AIKnowledge2Go
  2. 36:000:39Singing characters with LTX-2 and custom audioThe creator demonstrates how to use LTX-2 to make a character sing by providing a custom audio file and a specific singing prompt.pixaroma
  3. 10:070:59Lip-syncing with custom audioThe creator demonstrates importing an audio file into the timeline and using a specific prompt structure to synchronize the character's mouth movements with the audio track.What Dreams Cost
  4. 5:411:07Configure frame window and motion settings for Infinite TalkThe user demonstrates how to adjust the frame window size and motion frame overlap to balance video smoothness, lip-sync consistency, and VRAM usage.pixaroma
  5. 8:411:11Create a multi-person talking video in ComfyUIThe creator demonstrates the multi-talk version of the workflow, showing how to assign separate audio files to two different characters in a single image using the multimodel setting.pixaroma
  6. 3:270:37Two-person image to video conversation in ComfyUIThe creator demonstrates how to use the 'add' setting in the ComfyUI node widget to load separate audio files for a back-and-forth conversation between two characters.ComfyUI Studio
  7. 4:041:22Parallel audio lip-sync for two charactersThe video shows how to use the 'PAR' setting in the WAN workflow to merge audio tracks for simultaneous or perfectly timed character interactions, including masking two faces in the node.ComfyUI Studio
  8. 5:420:22Video-to-video lip-sync with WAN workflowThe demonstration shows how to take an existing video and sync the lip movements to a new audio file using the ComfyUI workflow.ComfyUI Studio
  9. 4:460:35Installing and using the Sync custom node in ComfyUISebastian walks through installing the Sync custom node via Git URL in ComfyUI and setting up a workflow with video/audio input nodes and the Sync generate node.Sebastian Kamph
  10. 5:291:09Combining image-to-video models with Sync in ComfyUIThe video demonstrates a complex workflow using Krea.ai to transform a shot into a cinematic style before passing the generated video into Sync for final lip-syncing.Sebastian Kamph
  11. 3:201:14Animate static images with Infinite Talk V2A demonstration of the Infinite Talk V2 workflow where a static thumbnail image is paired with an audio track to create a lip-synced, animated talking character.Yaroflasher
  12. 4:341:29Video-to-video lip sync with Infinite TalkThe creator shows how to use a vid-to-vid workflow to analyze an audio file and automatically synchronize a character's mouth and expressions in an existing video clip.Yaroflasher