ComfyUI: Lip-syncing audio to images with LTX 2.3

AIKnowledge2Go·Mar 2026

Demo summary

The tutorial shows how to upload an MP3 file and a character image to generate a lip-synced video, including a demonstration of the voice isolation subgraph in ComfyUI.

Step-by-step

Select Workflow 4 in ComfyUI
Upload an MP3 audio file
Upload a character image
Select the correct LTX model
Adjust the resolution settings (e.g., 960)
Enter an animation prompt describing the character and background
Click to run the generation

Options

Deactivate the voice isolation subgraph if using music instead of a character voice
Bypass the audio splitting by routing audio directly through the subgraph
Remove 'lip sync audio' from the text prompt as it is not required for the effect to work

Watch out for

The workflow is specifically trained for character voices using a mail brand row format
You must ensure the selected resolution is compatible with your specific GPU capacity

Tips

Use the voice isolation subgraph to clean up audio for better lip-sync results
Keep the animation prompt descriptive of the environment and camera movement (e.g., 'subtle camera push in')
Note that LTX2 performs well even when the character's lips are a similar color to their beard

Highlights

“I really love it. I mean... I think LTX2 did a great job”

All demos from “LTX 2.3 - The New KING Of UNCENSORED AI VIDEO”

0:594:52Configure LTX 2.3 in ComfyUIThe creator walks through the ComfyUI node setup for LTX 2.3, explaining the GGUF model loader, VAE settings, and how to adjust resolution and frame counts for optimal rendering.ComfyUI· AI Animation Generator
6:043:25Text-to-Video generation with LTX 2.3The video demonstrates generating a 7-second video from a detailed text prompt describing a woman at a zoo, showing how the model handles specific dialogue and character actions.ComfyUI· Text to Video
10:194:27Image-to-Video with LTX 2.3The creator demonstrates bringing a static image of a vampire warlord to life by using an image input node and a descriptive prompt to guide the animation and environmental effects.ComfyUI· Image to Video
14:523:11Lip-syncing audio to images with LTX 2.3CurrentThe tutorial shows how to upload an MP3 file and a character image to generate a lip-synced video, including a demonstration of the voice isolation subgraph in ComfyUI.ComfyUI· AI Lip Sync Generator
Watch “LTX 2.3 - The New KING Of UNCENSORED AI VIDEO” →

AI Lip Sync Generator

10:070:59Lip-syncing with custom audioThe creator demonstrates importing an audio file into the timeline and using a specific prompt structure to synchronize the character's mouth movements with the audio track.What Dreams Cost
6:310:34Animate talking avatar with LTX 2.3The creator uploads the generated image and audio into the ComfyUI LTX 2.3 workflow and runs the generation to produce a lip-synced video.Prince does AI
14:523:11Lip-syncing audio to images with LTX 2.3CurrentThe tutorial shows how to upload an MP3 file and a character image to generate a lip-synced video, including a demonstration of the voice isolation subgraph in ComfyUI.AIKnowledge2Go
36:000:39Singing characters with LTX-2 and custom audioThe creator demonstrates how to use LTX-2 to make a character sing by providing a custom audio file and a specific singing prompt.pixaroma
8:381:53Multi-character lip-sync workflow in InfiniteTalkThe tutorial demonstrates the multi-audio workflow, showing how to assign different audio files to specific characters (parallel vs. add types) for a multi-person scene.Atelier Darren
4:460:35Installing and using the Sync custom node in ComfyUISebastian walks through installing the Sync custom node via Git URL in ComfyUI and setting up a workflow with video/audio input nodes and the Sync generate node.Sebastian Kamph
5:291:09Combining image-to-video models with Sync in ComfyUIThe video demonstrates a complex workflow using Krea.ai to transform a shot into a cinematic style before passing the generated video into Sync for final lip-syncing.Sebastian Kamph
5:411:07Configure frame window and motion settings for Infinite TalkThe user demonstrates how to adjust the frame window size and motion frame overlap to balance video smoothness, lip-sync consistency, and VRAM usage.pixaroma
8:411:11Create a multi-person talking video in ComfyUIThe creator demonstrates the multi-talk version of the workflow, showing how to assign separate audio files to two different characters in a single image using the multimodel setting.pixaroma
3:270:37Two-person image to video conversation in ComfyUIThe creator demonstrates how to use the 'add' setting in the ComfyUI node widget to load separate audio files for a back-and-forth conversation between two characters.ComfyUI Studio
4:041:22Parallel audio lip-sync for two charactersThe video shows how to use the 'PAR' setting in the WAN workflow to merge audio tracks for simultaneous or perfectly timed character interactions, including masking two faces in the node.ComfyUI Studio
4:341:29Video-to-video lip sync with Infinite TalkThe creator shows how to use a vid-to-vid workflow to analyze an audio file and automatically synchronize a character's mouth and expressions in an existing video clip.Yaroflasher