SpeechRecognition Python library

Spoken Language Processing in Python

Daniel Bourke

Machine Learning Engineer/YouTube Creator

Why the SpeechRecognition library?

Some existing python libraries

CMU Sphinx
Kaldi
SpeechRecognition
Wav2letter++ by Facebook

Getting started with SpeechRecognition

Install from PyPi:

$ pip install SpeechRecognition

Compatible with Python 2 and 3
We'll use Python 3

Using the Recognizer class

# Import the SpeechRecognition library
import speech_recognition as sr

# Create an instance of Recognizer
recognizer = sr.Recognizer()

# Set the energy threshold
recognizer.energy_threshold = 300

Using the Recognizer class to recognize speech

Recognizer class has built-in functions which interact with speech APIs
- recognize_bing()
- recognize_google()
- recognize_google_cloud()
- recognize_wit()

Input: audio_file

Output: transcribed speech from audio_file

SpeechRecognition Example

Focus on recognize_google()
Recognize speech from an audio file with SpeechRecognition:

# Import SpeechRecognition library
import speech_recognition as sr

# Instantiate Recognizer class
recognizer = sr.Recognizer()

# Transcribe speech using Goole web API
recognizer.recognize_google(audio_data=audio_file
                            language="en-US")

Learning speech recognition on DataCamp is awesome!

Your turn!

Spoken Language Processing in Python