Audio parameters with PyDub
Every audio file you work with will have a number of characteristics associated with them, such as, channels, frame rate (or sample rate), sample width and more.
Knowing these parameters is useful to ensure your audio files are compatible with various API requirements for speech transcription.
For example, many APIs recommend a minimum frame rate (wav_file.frame_rate
) of 16,000 Hz.
When you create an instance of AudioSegment
, PyDub
automatically infers these parameters from your audio files and saves them as attributes.
In this exercise, we'll explore these attributes.
Diese Übung ist Teil des Kurses
Spoken Language Processing in Python
Interaktive Übung
Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.
# Import audio file
wav_file = AudioSegment.from_file(file="wav_file.wav")
# Find the frame rate
print(wav_file.____)