Text-based video editing in Descript

Demo summary
The creator demonstrates how to upload a video and edit it by highlighting and deleting text from the automatically generated transcript, which removes the corresponding video footage.
Step-by-step
- Create a new video project
- Drag and drop your video file into the project window
- Wait for the automatic transcription to complete
- Highlight the specific text in the transcript that corresponds to the footage you want to remove
- Press delete to cut the footage from the video
Options
- Record video directly within Descript instead of uploading a file
Highlights
“you will be able to edit it just like a Word document super simply”
All demos from “Descript For Beginners 2025 | Everything You NEED To KNOW!”
2:261:10Text-based video editing in DescriptCurrentThe creator demonstrates how to upload a video and edit it by highlighting and deleting text from the automatically generated transcript, which removes the corresponding video footage.DDescript· AI Video Editor
4:330:36Using the Blade tool on the timelineThe video shows how to use the Blade tool (shortcut B) to manually split clips on the timeline and delete specific segments.DDescript· AI Video Editor
7:150:46Remove filler words with Underlord AIThe demonstration shows Descript's Underlord AI identifying and batch-deleting filler words like 'um' and 'uh' from a recording.DDescript· AI Video Editor
10:570:32Add automated captions to videoThe demonstration shows how to apply and customize dynamic captions to a video project using Descript's built-in captioning styles.DDescript· AI Video Editor- Watch “Descript For Beginners 2025 | Everything You NEED To KNOW!” →
AI Video Editor
0:430:16Text-based video editing in DescriptThe user demonstrates how deleting a sentence from the automatically generated transcript in Descript simultaneously removes the corresponding footage from the video timeline.Learn Online Video
1:130:15Remove filler words with UnderlordThe video shows the Underlord AI assistant in Descript automatically detecting and deleting filler words like 'ums' and 'likes' from a talking head video.Learn Online Video
2:310:43Adding B-roll and sound effects from Descript libraryThe creator searches and inserts stock waterfall footage and cinematic sound effects directly from Descript's built-in media archive.Learn Online Video
3:270:28Generate AI images in DescriptThe user types a text prompt for a 'chicken salad' into Descript's Generate Image tool to create a custom visual asset for the video.Learn Online Video
2:261:10Text-based video editing in DescriptCurrentThe creator demonstrates how to upload a video and edit it by highlighting and deleting text from the automatically generated transcript, which removes the corresponding video footage.Vince Opra
4:330:36Using the Blade tool on the timelineThe video shows how to use the Blade tool (shortcut B) to manually split clips on the timeline and delete specific segments.Vince Opra
7:150:46Remove filler words with Underlord AIThe demonstration shows Descript's Underlord AI identifying and batch-deleting filler words like 'um' and 'uh' from a recording.Vince Opra
10:570:32Add automated captions to videoThe demonstration shows how to apply and customize dynamic captions to a video project using Descript's built-in captioning styles.Vince Opra
0:310:40Correct eye contact with Descript AIThe creator demonstrates the 'Eye Contact' feature in Descript, which digitally adjusts his eyes to look at the camera lens even while he is reading a script off-screen.Done By Lunch
1:110:25Shorten word gaps in DescriptThe user shows how to use the 'Shorten word gaps' tool to automatically identify and remove silences longer than one second across a 14-minute video file.Done By Lunch
1:360:40Remove retakes automatically in DescriptThe 'Remove retake' feature is shown identifying repeated phrases in a transcript and automatically selecting the best take to keep while ignoring the others.Done By Lunch
2:160:34Automated multicam editing in DescriptThe creator demonstrates 'Automatic Multicam' which processes two separate video files and automatically cuts between speakers to create a finished interview layout.Done By Lunch