Create speech

POST https://api.fastapi.ai/v1/audio/speech

Generates audio from the input text.

Request body

model string Required
One of the available TTS models: tts-1 or tts-1-hd

input string Required
The text to generate audio for. The maximum length is 4096 characters.

voice string Required
The voice to use when generating the audio. Supported voices are alloy, ash, coral, echo, fable, onyx, nova, sage and shimmer. Previews of the voices are available in the Text to speech guide.

response_format string Optional Defaults to mp3
The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.

speed number Optional Defaults to 1
The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.

Returns

The audio file content.

Example

Request

bash

curl https://api.fastapi.ai/v1/audio/speech \
  -H "Authorization: Bearer $FAST_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tts-1",
    "input": "The quick brown fox jumped over the lazy dog.",
    "voice": "alloy"
  }' \
  --output speech.mp3

Create speech ​

Request body ​

Returns ​

Example ​

Request ​

Create speech

Request body

Returns

Example

Request