Spoken Language Processing in Python
Daniel Bourke
Machine Learning Engineer/YouTube Creator
Verschillende soorten audiobestanden
Digitale geluiden gemeten in frequentie (kHz)
Audioboeken en spraak liggen tussen 8 en 16 kHz
We kunnen audiobestanden niet zien, dus we moeten ze eerst omzetten
import wave
good-morning.wav# Import audio file as wave object
good_morning = wave.open("good-morning.wav", "r")
# Convert wave object to bytes
good_morning_soundwave = good_morning.readframes(-1)
# View the wav file in byte form
good_morning_soundwave
b'\xfd\xff\xfb\xff\xf8\xff\xf8\xff\xf7\...
Spoken Language Processing in Python