Analyze speech for text XMP metadata
Adobe Premiere Pro and Soundbooth analyze spoken words and generate text metadata. You can edit and search text metadata like any other metadata properties. You can then navigate to the times at which specific words are spoken, to better align edits, advertising, and subtitles.
For more information, see the video tutorial Using Speech Search to Speed Editing.
Note: Useful results from speech analysis require good audio quality. Background noise significantly reduces accuracy. To remove such noise, use the tools and processes in Soundbooth.
Analyze speech to create text metadata
If you import files with a speech metadata into After Effects, each word appears as a layer marker on layers based on these footage items.
Navigate to a specific word in speech metadata
Edit speech metadata
In the Speech Analysis section, do any of the following:
Copy text from speech metadata to the clipboard for use in a text editor
Right-click the transcript, and choose Copy All.
Improve speech analysis with reference scripts
Accuracy of the speech to text conversion depends on the clarity of the spoken words and the quality of the recorded dialog. Dialog recorded in a noisy environment or with poor microphone placement cannot produce highly accurate results even with a reference script. You can nevertheless use a reference script to improve speech analysis. A reference script is a text document containing dialog similar to the dialog recorded in your assets.
There are two types of reference scripts:
Speech Analysis supports reference scripts only in the UTF-8 encoded text format, including Adobe Story scripts, which have the .astx filename extension.
Note: To make reference scripts available in Soundbooth, first complete the steps below in Adobe Premiere Pro.
Improve speech analysis with Adobe Story, OnLocation, and Adobe Premiere Pro
You can use Adobe Story, OnLocation, and Adobe Premiere Pro to create the most accurate speech analysis. Import a script written in Adobe Story into OnLocation. OnLocation produces a list of shot placeholders for each scene. Either record these shots using OnLocation during production, or link the placeholder shots to their respective video files when you import the video files into OnLocation. In either case, OnLocation embeds the text for each shot from the original script into the metadata of the shot.
When you import the clips into Adobe Premiere Pro, it automatically uses the Adobe Story script as a reference script. When Adobe Premiere Pro finds enough matches with the embedded script, Adobe Premiere Pro replaces the analyzed speech text with the embedded script text. Adobe Premiere Pro carries over correct spelling, proper names, and punctuation from the reference script, benefits that standard speech analysis cannot provide.
The closeness of the match between the embedded script text and the recorded dialog determines the accuracy of matched-script text. If 100% accuracy is important, edit and revise the script text first. Ensure that the script matches the recorded dialog before using it as a reference script.