Google cloud Speech -to-text support for 73 distinct languages, and 137 different local variants over 125 languages, the Speech-to-Text API allows you to quickly and accurately convert audio to text. In this video, Anu Shrivastava, Developer program engineer walk you through the best tips and tricks for lowering your word error rate when using the Speech-to-Text API to transcribe audio files into text. Watch along to learn how you can boost your automated speech recognition accuracy without having to train your own custom model.
Click on the video below to watch it in detail:
Chapters:
0:00 - Intro
0:48 - What is Speech-to-Text API?
1:11 - Speech-to-Text API quickstart demo
1:47 - How do you measure output accuracy?
2:53 - Tips for checking accuracy at scale
3:49 - Improving accuracy with the Speech Adaption API
6:42 - Wrap up
Extra Credit
- Speech-to-Text API Documentation → https://goo.gle/3Bo4JEQ
- Measuring and Improving Speech Accuracy Lab → https://goo.gle/3cOD2eR