Spoken Language Processing in Python
Daniel Bourke
Machine Learning Engineer/YouTube Creator
Different kinds all of audio files
Digital sounds measured in frequency (kHz)
Audiobooks and spoken language are between 8 and 16 kHz
We can't see audio files so we have to transform them first
import wave
good-morning.wav
# Import audio file as wave object
good_morning = wave.open("good-morning.wav", "r")
# Convert wave object to bytes
good_morning_soundwave = good_morning.readframes(-1)
# View the wav file in byte form
good_morning_soundwave
b'\xfd\xff\xfb\xff\xf8\xff\xf8\xff\xf7\...
Spoken Language Processing in Python