Create speech
POST https://api.fastapi.ai/v1/audio/speech
Generates audio from the input text.
Request body
model
string Required
One of the available TTS models: tts-1
or tts-1-hd
input
string Required
The text to generate audio for. The maximum length is 4096 characters.
voice
string Required
The voice to use when generating the audio. Supported voices are alloy
, ash
, coral
, echo
, fable
, onyx
, nova
, sage
and shimmer
. Previews of the voices are available in the Text to speech guide.
response_format
string Optional Defaults to mp3
The format to audio in. Supported formats are mp3
, opus
, aac
, flac
, wav
, and pcm
.
speed
number Optional Defaults to 1
The speed of the generated audio. Select a value from 0.25
to 4.0
. 1.0
is the default.
Returns
The audio file content.
Example
Request
bash
curl https://api.fastapi.ai/v1/audio/speech \
-H "Authorization: Bearer $FAST_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "tts-1",
"input": "The quick brown fox jumped over the lazy dog.",
"voice": "alloy"
}' \
--output speech.mp3