Create translation

POST https://api.fastapi.ai/v1/audio/translations

Translates audio into English.

Request body

file file Required
The audio file object (not file name) translate, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.

model string Required
ID of the model to use. Only whisper-1 (which is powered by our open source Whisper V2 model) is currently available.

prompt string Optional
An optional text to guide the model's style or continue a previous audio segment. The prompt should be in English.

response_format string Optional Defaults to json
The format of the output, in one of these options: json, text, srt, verbose_json, or vtt.

temperature number Optional Defaults to 0
The sampling temperature, between 0 and 1. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. If set to 0, the model will use log probability to automatically increase the temperature until certain thresholds are hit.

Returns

Returns the Audio objects.

The translated text.

Example

Request

bash

curl https://api.fastapi.ai/v1/audio/translations \
  -H "Authorization: Bearer $FAST_API_KEY" \
  -H "Content-Type: multipart/form-data" \
  -F file="@/path/to/file/german.m4a" \
  -F model="whisper-1"

Response

bash

{
  "text": "Hello, my name is Wolfgang and I come from Germany. Where are you heading today?"
}

Create translation ​

Request body ​

Returns ​

Example ​