Create speech
POST https://api.fastapi.ai/v1/audio/speech
Generates audio from the input text.
Request body
model string Required
One of the available TTS models: tts-1 or tts-1-hd
input string Required
The text to generate audio for. The maximum length is 4096 characters.
voice string Required
The voice to use when generating the audio. Supported voices are alloy, ash, coral, echo, fable, onyx, nova, sage and shimmer. Previews of the voices are available in the Text to speech guide.
response_format string Optional Defaults to mp3
The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.
speed number Optional Defaults to 1
The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.
Returns
The audio file content.
Example
Request
bash
curl https://api.fastapi.ai/v1/audio/speech \
-H "Authorization: Bearer $FAST_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "tts-1",
"input": "The quick brown fox jumped over the lazy dog.",
"voice": "alloy"
}' \
--output speech.mp3