Spoken Language Processing in Python
Daniel Bourke
Machine Learning Engineer/YouTube Creator
import speech_recognition as sr
# Setup recognizer instance recognizer = sr.Recognizer()
# Read in audio file clean_support_call = sr.AudioFile("clean-support-call.wav")
# Check type of clean_support_call type(clean_support_call)
<class 'speech_recognition.AudioFile'>
recognizer.recognize_google(audio_data=clean_support_call)
AssertionError: ``audio_data`` must be audio data
# Convert from AudioFile to AudioData with clean_support_call as source:
# Record the audio clean_support_call_audio = recognizer.record(source)
# Check the type type(clean_support_call_audio)
<class 'speech_recognition.AudioData'>
# Transcribe clean support call
recognizer.recognize_google(audio_data=clean_support_call_audio)
hello I'd like to get some help setting up my account please
duration
and offset
both None
by default# Leave duration and offset as default
with clean_support_call as source:
clean_support_call_audio = recognizer.record(source,
duration=None,
offset=None)
# Get first 2-seconds of clean support call
with clean_support_call as source:
clean_support_call_audio = recognizer.record(source,
duration=2.0)
hello I'd like to get
Spoken Language Processing in Python