demobook

ComfyUI: Running MultiTalk on RunPod

Demo summary

The creator walks through executing the MultiTalk high-quality workflow on a RunPod instance, monitoring VRAM usage with nvitop while generating the video.

Step-by-step

  1. Drag and drop the 720p long context high quality workflow into ComfyUI
  2. Upload your demo image and audio file
  3. Set the width and height to match your image (e.g., 720x1280)
  4. Calculate and enter the number of frames based on audio length (25 frames per second)
  5. Enter the text prompt describing the character's action
  6. Set 'blocks to swap' to 0 for high VRAM machines or 40 for low VRAM machines
  7. Click the Run button at the bottom of the interface
  8. Monitor the generation progress via the terminal or nvitop
  9. Right-click the result and select Save Preview to download the video

Options

  • Enable 'tiled VAE' or 'all tiling' if you encounter out-of-memory errors
  • Bypass the block swapping node on high VRAM machines to speed up model loading
  • Download the entire output folder as an archive from the workspace file browser

Watch out for

  • The number of frames is not automatically set; you must manually calculate it based on audio duration
  • Initial model loading on RunPod can take a significant amount of time
  • High-quality processing requires 10 steps and significant VRAM (up to 35GB for 720p)

Tips

  • Use 25 frames for every one second of audio
  • Monitor nvitop to verify the GPU is being fully utilized (e.g., checking watt usage and VRAM)
  • Check the terminal status to see the exact duration of the uploaded audio for more accurate frame calculation

Highlights

the machine is ready and set... the rest of the usage is exactly same as in the Windows tutorial part.

All demos from “MultiTalk Full Tutorial With 1-Click Installer - Make Talking and Singing Videos From Static Images

  1. 7:406:12Generate 480p talking video in ComfyUIThe creator demonstrates loading a static image and audio file into a ComfyUI workflow using the MultiTalk node to generate a 10-second talking animation.ComfyUIAI Animation Generator
  2. 13:524:06High-quality 720p long context generationA demonstration of the 720p long context workflow in ComfyUI, showing how to adjust resolution, prompt, and block swap parameters for higher fidelity output.ComfyUIAI Animation Generator
  3. 18:560:49Side-by-side video quality comparisonThe creator uses an 'Ultimate Video Upscaler' tool to perform a side-by-side comparison between the 480p and 720p generated outputs.ComfyUIAI Video Upscaler
  4. 47:445:08Running MultiTalk on RunPodCurrentThe creator walks through executing the MultiTalk high-quality workflow on a RunPod instance, monitoring VRAM usage with nvitop while generating the video.ComfyUIAI Animation Generator
  5. Watch “MultiTalk Full Tutorial With 1-Click Installer - Make Talking and Singing Videos From Static Images” →

AI Animation Generator

  1. 2:070:23Load Wan 2.2 Animate workflow and install nodesThe user demonstrates how to drag and drop the Wan 2.2 Animate workflow into ComfyUI and use the Manager to install missing custom nodes.MDMZ
  2. 2:551:05Configure video input and output settingsThe demo shows how to upload a source video to ComfyUI, set the frame count, and adjust the output dimensions to match the original aspect ratio.MDMZ
  3. 16:361:56Setting up HunyuanVideo 1.5 in ComfyUIThe video demonstrates how to update ComfyUI and import the HunyuanVideo 1.5 JSON workflow files to create a node-based generation environment.AI Search
  4. 20:151:53Text-to-Video generation in ComfyUIA step-by-step demo of configuring the Hunyuan nodes in ComfyUI, entering a prompt for a 'giant cat', and rendering the final 720p video.AI Search
  5. 29:151:33Running HunyuanVideo with GGUF (Low VRAM)The video shows how to use the GGUF loader node to run a compressed version of HunyuanVideo 1.5, enabling video generation on GPUs with as little as 6GB of VRAM.AI Search
  6. 0:594:52Configure LTX 2.3 in ComfyUIThe creator walks through the ComfyUI node setup for LTX 2.3, explaining the GGUF model loader, VAE settings, and how to adjust resolution and frame counts for optimal rendering.AIKnowledge2Go
  7. 1:230:41Setting up LTX-2.3 in ComfyUIThe creator demonstrates how to browse templates in ComfyUI, search for LTX 2.3, and download the required missing models for the text-to-video workflow.MDMZ
  8. 13:071:21Applying LoRA to Wan 2.2 video generationThe video shows how to integrate a LoRA (Low-Rank Adaptation) into a Wan 2.2 workflow to achieve specific cinematic movements like a face zoom.pixaroma
  9. 31:133:44Text-to-video with LTX-2The video walks through setting up the LTX-2 model in ComfyUI to generate high-resolution video clips from text prompts and images.pixaroma
  10. 39:075:27Cloud-based ComfyUI on RunPod/RunHubThe video shows how to run complex video workflows in the cloud using RunHub AI, demonstrating the interface and execution of InfiniteTalk and Wan 2.2 without local hardware.pixaroma
  11. 2:351:46Configure Infinite Talk models in ComfyUIThe creator demonstrates how to organize the necessary models within the ComfyUI workflow, including the Lightning LoRA, quantized Infinite Talk UNET models, and the Wan 2.1 VAE and Clip Vision nodes.Aiconomist
  12. 11:060:26Combining audio, images, and promptsA demonstration of layering a specific action prompt (patting stomach) over a specific audio timestamp to create a fully directed AI scene.What Dreams Cost