Skip to content

Create speech

POST https://api.fastapi.ai/v1/audio/speech

Generates audio from the input text.

Request body


model string Required
One of the available TTS models: tts-1 or tts-1-hd


input string Required
The text to generate audio for. The maximum length is 4096 characters.


voice string Required
The voice to use when generating the audio. Supported voices are alloy, ash, coral, echo, fable, onyx, nova, sage and shimmer. Previews of the voices are available in the Text to speech guide.


response_format string Optional Defaults to mp3
The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.


speed number Optional Defaults to 1
The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.

Returns


The audio file content.

Example

Request

bash
curl https://api.fastapi.ai/v1/audio/speech \
  -H "Authorization: Bearer $FAST_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tts-1",
    "input": "The quick brown fox jumped over the lazy dog.",
    "voice": "alloy"
  }' \
  --output speech.mp3

那年我双手插兜, 让bug稳如老狗