Skip to content

Create translation

POST https://api.fastapi.ai/v1/audio/translations

Translates audio into English.

Request body


file file Required
The audio file object (not file name) translate, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.


model string Required
ID of the model to use. Only whisper-1 (which is powered by our open source Whisper V2 model) is currently available.


prompt string Optional
An optional text to guide the model's style or continue a previous audio segment. The prompt should be in English.


response_format string Optional Defaults to json
The format of the output, in one of these options: json, text, srt, verbose_json, or vtt.


temperature number Optional Defaults to 0
The sampling temperature, between 0 and 1. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. If set to 0, the model will use log probability to automatically increase the temperature until certain thresholds are hit.

Returns


The translated text.

Example

Request

bash
curl https://api.fastapi.ai/v1/audio/translations \
  -H "Authorization: Bearer $FAST_API_KEY" \
  -H "Content-Type: multipart/form-data" \
  -F file="@/path/to/file/german.m4a" \
  -F model="whisper-1"

Response

bash
{
  "text": "Hello, my name is Wolfgang and I come from Germany. Where are you heading today?"
}

那年我双手插兜, 让bug稳如老狗