Pollo AI: Text-to-video with sound using Pollo 2

Demo summary
The creator demonstrates the Pollo 2 video model, showing how it generates 10-second clips with integrated sound effects and character speech from text prompts.
Step-by-step
- Navigate to the AI video section
- Click on Text to Video
- Select the Pollo 2 model
- Choose your desired aspect ratio and resolution (up to 1080p)
- Select the number of video outputs (up to four)
- Enter a text prompt describing the scene, sound effects, or a topic of conversation
- Click to generate the 10-second video clip
Options
- Start with a starting image instead of text
- Prompt specific sound effects
- Prompt specific dialogue or a general topic of conversation
- Generate videos at 480p resolution
Watch out for
- The model may struggle to accurately recreate specific celebrity likenesses
- Dialogue generation may only apply to one character even if multiple are prompted
Tips
- Ensure you check the resolution settings before generating to avoid accidentally outputting in 480p
- Provide a topic of conversation to trigger character speech even if you don't have specific lines
Highlights
“I mean, that's pretty nice in 480p.”
All demos from “Is Pollo.ai video better than Veo 3.1? I tested it!”
1:272:22Image generation with Nano Banana Pro on Pollo.aiThe creator demonstrates the text-to-image interface on Pollo.ai, comparing the standard Nano Banana model with the new Pro model for realism, skin texture, and complex infographic reasoning.Pollo AI· AI Realistic Image Generator
3:510:42Multi-panel comic generation with Pollo Image 1.6A demonstration of the Pollo Image 1.6 model's ability to follow complex prompts for a multi-panel superhero comic strip, outperforming other models in panel count and theme consistency.Pollo AI· AI Image Generator
4:330:35Editing images in Pollo.ai Canvas ModeThe video walks through the Canvas Mode interface, showing tools for inpainting, upscaling, uncropping, and background removal using text prompts.Pollo AI· AI Photo Editor
5:141:46Text-to-video with sound using Pollo 2CurrentThe creator demonstrates the Pollo 2 video model, showing how it generates 10-second clips with integrated sound effects and character speech from text prompts.Pollo AI· Text to Video
8:371:43Video model comparison: Pollo 2 vs Veo 3.1A side-by-side comparison of Pollo 2 against Veo 3.1 and Veo 3.1 Fast, demonstrating Pollo's superior handling of background sound effects and dialogue accuracy.Pollo AI· AI Video Generator
11:440:19AI Video Editing: Adding environmental effectsThe creator shows the AI video editing feature by taking an original video and applying a 'blizzard' prompt to transform the environment while keeping the subject intact.Pollo AI· Video to Video- Watch “Is Pollo.ai video better than Veo 3.1? I tested it!” →
Text to Video
14:522:25Convert text prompts to video using Kling and Luma modelsThe user demonstrates Pollo AI's text-to-video feature by prompting for a 'tiger running on the moon' and comparing the outputs of the Kling 1.6 and Luma AI models.Blog With Ben
6:080:19Generate an anime story video with Pollo AIThe user demonstrates generating a story video by entering a text prompt about a hero and selecting an 'action' theme within the Pollo AI agent interface.AI Cash Tom
1:471:09Generate text-to-video with Veo 3.1 in Pollo AIThe user selects the Veo 3.1 model and enters a prompt for a woman in a red dress to generate a 6-second widescreen video.Paul J Lipsky
2:560:35Generate text-to-video with Sora 2 in Pollo AIThe user demonstrates using the Sora 2 model within Pollo AI to create a video of raindrops hitting a window from a text prompt.Paul J Lipsky
0:360:25Generate video with Sora 2 in Pollo AIThe user demonstrates generating a high-fidelity video of a city fly-through using the Sora 2 model within the Pollo AI dashboard, highlighting the inclusion of native synchronized audio.AI Master
5:141:46Text-to-video with sound using Pollo 2CurrentThe creator demonstrates the Pollo 2 video model, showing how it generates 10-second clips with integrated sound effects and character speech from text prompts.Bob Doyle Media
Pollo AI