SpeechRecognition Python library

Spoken Language Processing in Python

Daniel Bourke

Machine Learning Engineer/YouTube Creator

Why the SpeechRecognition library?

Some existing python libraries

  • CMU Sphinx
  • Kaldi
  • SpeechRecognition
  • Wav2letter++ by Facebook
Spoken Language Processing in Python

Getting started with SpeechRecognition

Install from PyPi:

$ pip install SpeechRecognition
  • Compatible with Python 2 and 3
  • We'll use Python 3
Spoken Language Processing in Python

Using the Recognizer class

# Import the SpeechRecognition library
import speech_recognition as sr

# Create an instance of Recognizer recognizer = sr.Recognizer()
# Set the energy threshold recognizer.energy_threshold = 300
Spoken Language Processing in Python

Using the Recognizer class to recognize speech

  • Recognizer class has built-in functions which interact with speech APIs
    • recognize_bing()
    • recognize_google()
    • recognize_google_cloud()
    • recognize_wit()

Input: audio_file

Output: transcribed speech from audio_file

Spoken Language Processing in Python

SpeechRecognition Example

  • Focus on recognize_google()

  • Recognize speech from an audio file with SpeechRecognition:

# Import SpeechRecognition library
import speech_recognition as sr

# Instantiate Recognizer class recognizer = sr.Recognizer()
# Transcribe speech using Goole web API recognizer.recognize_google(audio_data=audio_file language="en-US")
Learning speech recognition on DataCamp is awesome!
Spoken Language Processing in Python

Your turn!

Spoken Language Processing in Python

Preparing Video For Download...