Provide your bearer token in the Authorization header when making requests to protected resources. Example: Authorization: Bearer ********************
Example:
Bearer {{YOUR_API_KEY}}
Body Params multipart/form-data
file
file
required
The audio file object to be transcribed (not the file name) is in the format of flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.
model
string
required
The model ID to be used. Currently, only Whisper-1 is available.
language
string
optional
Input the language of the audio. Providing input language in ISO-639-1 format can improve accuracy and latency.
prompt
string
optional
An optional text to guide the style of the model or continue with the previous audio paragraph. The prompt should match the audio language.
response_format
string
optional
Default is JSON The format for transcription output can be selected from JSON, Text, SRT, Verbose_JSON, or VTT.
temperature
number
optional
Default is 0 Sampling temperature, between 0 and 1. A higher value like 0.8 will make the output more random, while a lower value like 0.2 will make it more concentrated and deterministic. If set to 0, the model will automatically increase the temperature using logarithmic probability until a specific threshold is reached.
{"text":"Imagine the wildest idea that you've ever had, and you're curious about how it might scale to something that's a 100, a 1,000 times bigger. This is a place where you can get to do that."}