Introduction to audio data in Python

Spoken Language Processing in Python

Daniel Bourke

Machine Learning Engineer/YouTube Creator

Dealing with audio files in Python

Different kinds all of audio files
- mp3
- wav
- m4a
- flac
Digital sounds measured in frequency (kHz)
- 1 kHz = 1000 pieces of information per second

import wave

Audio file saved as good-morning.wav

# Import audio file as wave object
good_morning = wave.open("good-morning.wav", "r")

# Convert wave object to bytes
good_morning_soundwave = good_morning.readframes(-1)

# View the wav file in byte form
good_morning_soundwave

b'\xfd\xff\xfb\xff\xf8\xff\xf8\xff\xf7\...

Spoken Language Processing in Python