OpenAI's text-to-speech (TTS)
OpenAI now provide models for creating human-like speech from a text input, so-called text-to-speech or TTS. OpenAI provide several voices to choose from, and they provide the ability to stream the response to local files or downstream applications.
Bu egzersiz
Multi-Modal Systems with the OpenAI API
kursunun bir parçasıdırEgzersiz talimatları
- Create the text-to-speech request for
"Hi! How's your day going?", using the"ballad"voice. - Stream the response to an
.mp3file.
Uygulamalı interaktif egzersiz
Bu örnek kodu tamamlayarak bu egzersizi bitirin.
client = OpenAI(api_key="")
# Create the text-to-speech request
response = client.audio.speech.create(
model="gpt-4o-mini-tts",
____,
input="Hi! How's your day going?"
)
# Stream the response to an MP3 file
response.____("output.mp3")