Load source footage and models in ComfyUI

ComfyUI·Jun 2026

Demo summary

The user demonstrates importing source video footage and loading the necessary model nodes including WAN Video, VAE, and Clip Vision within the ComfyUI interface.

Step-by-step

Load the source footage into the workflow
Decode the footage into individual frames
Load the WAN Video model
Load the VAE model
Load the Clip Vision model
Load the detection tools for masking and motion analysis

Tips

Check the metadata such as resolution, frame count, and frame rate during the decoding process

All demos from “Learn How To: Face Swap”

0:290:24Load source footage and models in ComfyUICurrentThe user demonstrates importing source video footage and loading the necessary model nodes including WAN Video, VAE, and Clip Vision within the ComfyUI interface.ComfyUI· AI Animation Generator
0:530:48Generate and refine head masks with Florence 2 and SAM 2The workflow shows using Florence 2 for object detection to target the head and SAM 2 for precise segmentation, including adjusting the 'grow mask expand' value to improve blending.ComfyUI· AI Inpainting
1:410:46Prepare driving data and auto-prompts with Qwen2-VLThe demo shows running pose detection on source footage and using Qwen2-VL (referred to as Gwen VL) to generate a semantic text description from a reference image for the face swap.ComfyUI· AI Face Swap Generator
3:280:27Simplified face swap using ComfyUI App ModeA demonstration of the simplified 'App Mode' interface where users can upload footage and a reference image to perform a face swap without interacting with the node graph.ComfyUI· AI Face Swap Generator
Watch “Learn How To: Face Swap” →

AI Animation Generator

1:420:59Seed hunting with a multi-stage LTX 2.3 workflowThe creator demonstrates his custom ComfyUI workflow that generates four low-resolution LTX 2.3 samples simultaneously to find a 'golden seed' before upscaling to 1080p.Fox•Fur•Essence Films
16:361:56Setting up HunyuanVideo 1.5 in ComfyUIThe video demonstrates how to update ComfyUI and import the HunyuanVideo 1.5 JSON workflow files to create a node-based generation environment.AI Search
20:151:53Text-to-Video generation in ComfyUIA step-by-step demo of configuring the Hunyuan nodes in ComfyUI, entering a prompt for a 'giant cat', and rendering the final 720p video.AI Search
29:151:33Running HunyuanVideo with GGUF (Low VRAM)The video shows how to use the GGUF loader node to run a compressed version of HunyuanVideo 1.5, enabling video generation on GPUs with as little as 6GB of VRAM.AI Search
0:510:36Configure Infinite Talk and Wan 2.1 models in ComfyUIThe user demonstrates loading the Infinite Talk model alongside the Wan 2.1 I2V 14B model within ComfyUI, including enabling block swap and torch compile for VRAM optimization.Olares
1:551:13Configure sampling and window settings for long video generationThe user walks through the Wan Video Wrapper sampling node, explaining how to set frame window size, motion frame overlap, and start steps for consistent video generation.Olares
0:310:46Generate cinematic video with LTX MSR workflowThe creator demonstrates using the LTX MSR workflow in ComfyUI to generate a video from multiple reference images and a prompt, highlighting the 3D camera movement and character consistency.Apex Artist
0:290:24Load source footage and models in ComfyUICurrentThe user demonstrates importing source video footage and loading the necessary model nodes including WAN Video, VAE, and Clip Vision within the ComfyUI interface.ComfyUI
0:431:12Configure LTX-2.3 MSR LoRA and Prompt Relay in ComfyUIThe creator demonstrates setting up a ComfyUI workflow using the LTX-2.3 model with the MSR LoRA and Prompt Relay nodes to manage video generation on 8GB of VRAM.bigboss97
4:261:28Overview of the TensNodes Consistent Character Workflow in ComfyUIThe creator walks through a four-stage ComfyUI workflow utilizing TensNodes to process reference images and text for stable LTX video generation.SOTAI
3:411:25Configure LTX 2.3 foundational parametersA walkthrough of setting dimensions, frame counts, and loading core models including the LTX 2.3 distill model and audio VAE within a ComfyUI workflow.SOTAI
5:060:37Process visual and audio inputs in ComfyUIThe demo shows how to use the LTX V image-to-video condition node and empty latent audio node to prepare data for the generation engine.SOTAI