ComfyUI: Sampling and video output generation

ComfyUI Try it →Watch full video →Viraj Builds · Apr 2026

Demo summary

The demonstration shows the final sampling process using the Lightning LoRA settings (CFG 1, 7-10 steps) and combining the decoded frames with audio for the final video file.

Step-by-step

Set the CFG scale to 1 for the Lightning LoRA
Set the sampling steps to a range between 7 and 10
Connect the text, image, and audio embeddings to the sampler
Decode the output samples using the Wan VAE
Combine the decoded images with the input audio
Apply the global frame rate to the final output

Options

Set CFG to approximately 5 if not using the LoRA
Set sampling steps to 30-50 if not using the LoRA

Watch out for

Ensure CFG is set to 1 specifically when using the distilled Lightning LoRA with the Wan video model

All demos from “AI Talking Head Videos With Perfect Lip Sync (ComfyUI + InfiniteTalk)”

2:560:54Image preparation for InfiniteTalkThe video shows the process of loading a portrait image and resizing it to the specific 384x640 resolution required by the Wan video model using standard ComfyUI nodes.ComfyUI· AI Crop Image
3:503:32Setting up Wan 2.1 and InfiniteTalk modelsA walkthrough of the model group in ComfyUI, showing the configuration of Wan Video Block Swap for VRAM management, the Light X2V LoRA for faster generation, and the InfiniteTalk GGUF model for audio conditioning.ComfyUI· AI Animation Generator
9:081:03Sampling and video output generationCurrentThe demonstration shows the final sampling process using the Lightning LoRA settings (CFG 1, 7-10 steps) and combining the decoded frames with audio for the final video file.ComfyUI· AI Animation Generator
Watch “AI Talking Head Videos With Perfect Lip Sync (ComfyUI + InfiniteTalk)” →

AI Animation Generator

2:070:23Load Wan 2.2 Animate workflow and install nodesThe user demonstrates how to drag and drop the Wan 2.2 Animate workflow into ComfyUI and use the Manager to install missing custom nodes.MDMZ
2:551:05Configure video input and output settingsThe demo shows how to upload a source video to ComfyUI, set the frame count, and adjust the output dimensions to match the original aspect ratio.MDMZ
16:361:56Setting up HunyuanVideo 1.5 in ComfyUIThe video demonstrates how to update ComfyUI and import the HunyuanVideo 1.5 JSON workflow files to create a node-based generation environment.AI Search
20:151:53Text-to-Video generation in ComfyUIA step-by-step demo of configuring the Hunyuan nodes in ComfyUI, entering a prompt for a 'giant cat', and rendering the final 720p video.AI Search
29:151:33Running HunyuanVideo with GGUF (Low VRAM)The video shows how to use the GGUF loader node to run a compressed version of HunyuanVideo 1.5, enabling video generation on GPUs with as little as 6GB of VRAM.AI Search
0:594:52Configure LTX 2.3 in ComfyUIThe creator walks through the ComfyUI node setup for LTX 2.3, explaining the GGUF model loader, VAE settings, and how to adjust resolution and frame counts for optimal rendering.AIKnowledge2Go
1:230:41Setting up LTX-2.3 in ComfyUIThe creator demonstrates how to browse templates in ComfyUI, search for LTX 2.3, and download the required missing models for the text-to-video workflow.MDMZ
13:071:21Applying LoRA to Wan 2.2 video generationThe video shows how to integrate a LoRA (Low-Rank Adaptation) into a Wan 2.2 workflow to achieve specific cinematic movements like a face zoom.pixaroma
31:133:44Text-to-video with LTX-2The video walks through setting up the LTX-2 model in ComfyUI to generate high-resolution video clips from text prompts and images.pixaroma
39:075:27Cloud-based ComfyUI on RunPod/RunHubThe video shows how to run complex video workflows in the cloud using RunHub AI, demonstrating the interface and execution of InfiniteTalk and Wan 2.2 without local hardware.pixaroma
2:351:46Configure Infinite Talk models in ComfyUIThe creator demonstrates how to organize the necessary models within the ComfyUI workflow, including the Lightning LoRA, quantized Infinite Talk UNET models, and the Wan 2.1 VAE and Clip Vision nodes.Aiconomist
11:060:26Combining audio, images, and promptsA demonstration of layering a specific action prompt (patting stomach) over a specific audio timestamp to create a fully directed AI scene.What Dreams Cost