LoslegenKostenlos loslegen

OpenAI's text-to-speech (TTS)

OpenAI now provide models for creating human-like speech from a text input, so-called text-to-speech or TTS. OpenAI provide several voices to choose from, and they provide the ability to stream the response to local files or downstream applications.

Diese Übung ist Teil des Kurses

Multi-Modal Systems with the OpenAI API

Kurs anzeigen

Anleitung zur Übung

  • Create the text-to-speech request for "Hi! How's your day going?", using the "ballad" voice.
  • Stream the response to an .mp3 file.

Interaktive Übung

Vervollständige den Beispielcode, um diese Übung erfolgreich abzuschließen.

client = OpenAI(api_key="")

# Create the text-to-speech request
response = client.audio.speech.create(
  model="gpt-4o-mini-tts",
  ____,
  input="Hi! How's your day going?"
)

# Stream the response to an MP3 file
response.____("output.mp3")
Code bearbeiten und ausführen