How to Use Speech to Text in Filmora?

I’m going to show you how you can turn your speech into text using Filmora. Filmora has a speech to text feature that lets you create engaging captions for your video.

Viewers can watch with the volume turned down and still read text that reflects the spoken words.

You can get that effect using Filmora.

Here’s how you can do it step by step.

Filmora Speech-to-Text Tutorial

Set up your clip

Insert a video or an audio file into the project timeline. I have an audio file I recorded a few minutes ago, and the line is: Filmora speech to text automatically converts spoken words into text.

Right now there is no text on this video, but if I use speech to text there will be text appearing here.

How to Use Speech to Text in Filmora screenshot 1

Choose your caption mode

Click the audio or video clip, then click Speech to Text. You will see two options: Speech to Text and Dynamic Captions.

Speech to Text is the classic way to create subtitles, while Dynamic Captions gives you a bigger font, a more dynamic look, and some animation.

How to Use Speech to Text in Filmora screenshot 2

If you want to upload a video and mimic the style from some viral TikTok videos, try Dynamic Captions.

If you want a simple caption like subtitles from a movie, select Speech to Text.

I want to upload this to the internet, so Dynamic Captions is better for me.

Read More: How To Use Ai Text To Speech In Filmora Video Editor

Language, scope, and cost

Select the language. I’m using English, and there’s an option to translate, but there’s no need for translation here.

Choose the file scope: clip selection or entire timeline sequence.

How to Use Speech to Text in Filmora screenshot 3

Clip selection means only the item you selected earlier. Entire project applies it to everything on your timeline.

This affects how much you pay using AI credits, and longer audio costs more, so I’m selecting clip selection.

How to Use Speech to Text in Filmora screenshot 4

Generate captions

When you’re ready, click Generate and wait for the result.

It usually takes anywhere from a few seconds to a few minutes, especially if your video or audio is very long.

When it’s done, you will see text available on the video, and you can play it back to review.

How to Use Speech to Text in Filmora screenshot 5

Style your dynamic captions

I want a different styling. Click the caption file, then in the Text menu you will see many templates to choose from.

I want something that looks more attractive and shows several words at the same time.

How to Use Speech to Text in Filmora screenshot 6

Pick the template you like and click Apply to All. Play it back to see the updated look of your captions. You should see a much better looking version of the caption.

Read More: How To Use Ai Masking In Filmora

Fix recognition errors

It should be Filmora, not “humorous humor.” This is common if you are not a native speaker, and names or brands are sometimes not in the dictionary, which makes the AI struggle with certain words. That’s why you might want to manually edit the text.

How to Use Speech to Text in Filmora screenshot 7

Click the caption, click Text, then go to Basic.

Change the text manually to “Filmora,” then play it back again to confirm it’s correct.

Filmora speech to text automatically converts spoken words into text.

How to Use Speech to Text in Filmora screenshot 8

If your audio quality needs a boost to improve recognition, see how to clean it up with the AI voice enhancer in Filmora.

Final Thoughts

That’s how you convert speech to text using Filmora. Pick your mode, set your language and scope, generate, then style and edit as needed. Clean audio and quick manual fixes give you accurate, great looking captions.

Leave a comment