语音转文本 / gpt-4o-transcribe

主站接口①

https://api2.aigcbest.top/v1

POST

/audio/transcriptions

官方文档：https://platform.openai.com/docs/api-reference/audio/createTranscription

请求参数

Authorization

在 Header 添加参数

Authorization

，其值为在 Bearer 之后拼接 Token

示例：

Authorization: Bearer ********************

Header 参数

Content-Type

string

必需

示例值:

multipart/form-data

string

必需

示例值:

application/json

Authorization

string

可选

示例值:

Bearer {{YOUR_API_KEY}}

Body 参数multipart/form-data

file

必需

要转录的音频文件，采用以下格式之一：mp3、mp4、mpeg、mpga、m4a、wav 或 webm。

示例值:

file:///Users/jun/Downloads/response.wav

model

string

必需

要使用的模型的 ID。仅whisper-1当前可用。

示例值:

gpt-4o-transcribe

prompt

string

可选

可选文本，用于指导模型的风格或继续之前的音频片段。提示应与音频语言相匹配。

示例值:

eiusmod nulla

response_format

string

可选

示例值:

json

temperature

number

可选

采样温度，介于 0 和 1 之间。较高的值（如 0.8）将使输出更加随机，而较低的值（如 0.2）将使输出更加集中和确定。如果设置为 0，模型将使用对数概率自动升高温度，直到达到特定阈值。

示例值:

language

string

可选

输入音频的语言。以ISO-639-1格式提供输入语言将提高准确性和延迟。

示例代码

Shell

JavaScript

Java

Swift

PHP

Python

HTTP

Objective-C

Ruby

OCaml

Dart

curl --location --request POST 'https://api2.aigcbest.top/v1/audio/transcriptions' \
--header 'Accept: application/json' \
--header 'Authorization: Bearer ' \
--header 'Content-Type: multipart/form-data' \
--form 'file=@"/Users/jun/Downloads/response.wav"' \
--form 'model="gpt-4o-transcribe"' \
--form 'prompt="eiusmod nulla"' \
--form 'response_format="json"' \
--form 'temperature="0"' \
--form 'language=""'

返回响应

🟢200OK

application/json

Body

text

string

必需

示例

{
    "text": "Imagine the wildest idea that you've ever had, and you're curious about how it might scale to something that's a 100, a 1,000 times bigger. This is a place where you can get to do that."
}

修改于 2025-06-26 00:14:53

语音转文本 / whisper-1

音频翻译