demobook

Load source footage and models in ComfyUI

Demo summary

The user demonstrates importing source video footage and loading the necessary model nodes including WAN Video, VAE, and Clip Vision within the ComfyUI interface.

Step-by-step

  1. Load the source footage into the workflow
  2. Decode the footage into individual frames
  3. Load the WAN Video model
  4. Load the VAE model
  5. Load the Clip Vision model
  6. Load the detection tools for masking and motion analysis

Tips

  • Check the metadata such as resolution, frame count, and frame rate during the decoding process

All demos from “Learn How To: Face Swap

  1. 0:290:24Load source footage and models in ComfyUICurrentThe user demonstrates importing source video footage and loading the necessary model nodes including WAN Video, VAE, and Clip Vision within the ComfyUI interface.ComfyUIAI Animation Generator
  2. 0:530:48Generate and refine head masks with Florence 2 and SAM 2The workflow shows using Florence 2 for object detection to target the head and SAM 2 for precise segmentation, including adjusting the 'grow mask expand' value to improve blending.ComfyUIAI Inpainting
  3. 1:410:46Prepare driving data and auto-prompts with Qwen2-VLThe demo shows running pose detection on source footage and using Qwen2-VL (referred to as Gwen VL) to generate a semantic text description from a reference image for the face swap.ComfyUIAI Face Swap Generator
  4. 3:280:27Simplified face swap using ComfyUI App ModeA demonstration of the simplified 'App Mode' interface where users can upload footage and a reference image to perform a face swap without interacting with the node graph.ComfyUIAI Face Swap Generator
  5. Watch “Learn How To: Face Swap” →

AI Animation Generator

  1. 2:070:23Load Wan 2.2 Animate workflow and install nodesThe user demonstrates how to drag and drop the Wan 2.2 Animate workflow into ComfyUI and use the Manager to install missing custom nodes.MDMZ
  2. 2:551:05Configure video input and output settingsThe demo shows how to upload a source video to ComfyUI, set the frame count, and adjust the output dimensions to match the original aspect ratio.MDMZ
  3. 16:361:56Setting up HunyuanVideo 1.5 in ComfyUIThe video demonstrates how to update ComfyUI and import the HunyuanVideo 1.5 JSON workflow files to create a node-based generation environment.AI Search
  4. 20:151:53Text-to-Video generation in ComfyUIA step-by-step demo of configuring the Hunyuan nodes in ComfyUI, entering a prompt for a 'giant cat', and rendering the final 720p video.AI Search
  5. 29:151:33Running HunyuanVideo with GGUF (Low VRAM)The video shows how to use the GGUF loader node to run a compressed version of HunyuanVideo 1.5, enabling video generation on GPUs with as little as 6GB of VRAM.AI Search
  6. 0:594:52Configure LTX 2.3 in ComfyUIThe creator walks through the ComfyUI node setup for LTX 2.3, explaining the GGUF model loader, VAE settings, and how to adjust resolution and frame counts for optimal rendering.AIKnowledge2Go
  7. 1:230:41Setting up LTX-2.3 in ComfyUIThe creator demonstrates how to browse templates in ComfyUI, search for LTX 2.3, and download the required missing models for the text-to-video workflow.MDMZ
  8. 13:071:21Applying LoRA to Wan 2.2 video generationThe video shows how to integrate a LoRA (Low-Rank Adaptation) into a Wan 2.2 workflow to achieve specific cinematic movements like a face zoom.pixaroma
  9. 31:133:44Text-to-video with LTX-2The video walks through setting up the LTX-2 model in ComfyUI to generate high-resolution video clips from text prompts and images.pixaroma
  10. 39:075:27Cloud-based ComfyUI on RunPod/RunHubThe video shows how to run complex video workflows in the cloud using RunHub AI, demonstrating the interface and execution of InfiniteTalk and Wan 2.2 without local hardware.pixaroma
  11. 2:351:46Configure Infinite Talk models in ComfyUIThe creator demonstrates how to organize the necessary models within the ComfyUI workflow, including the Lightning LoRA, quantized Infinite Talk UNET models, and the Wan 2.1 VAE and Clip Vision nodes.Aiconomist
  12. 11:060:26Combining audio, images, and promptsA demonstration of layering a specific action prompt (patting stomach) over a specific audio timestamp to create a fully directed AI scene.What Dreams Cost