语音识别请求(汇总)
POST
https://api.mindcraft.com.cn/v1/audio/transcriptions
请求参数
Header 参数
Authorization
string
认证信息
示例值:
Bearer {{api_key}}
Body 参数multipart/form-data
file
file
通用参数 上传文件
model
enum<string>
模型选择
枚举值:
TX_ASR_long_8k_zhTX_ASR_long_8k_enTX_ASR_long_16k_zhTX_ASR_long_16k_zh-PYTX_ASR_long_16k_zh_medicalTX_ASR_long_16k_enTX_ASR_long_16k_yueTX_ASR_long_16k_jaTX_ASR_long_16k_koTX_ASR_long_16k_viTX_ASR_long_16k_msTX_ASR_long_16k_idTX_ASR_long_16k_filTX_ASR_long_16k_thTX_ASR_long_16k_ptTX_ASR_long_16k_trTX_ASR_long_16k_arTX_ASR_long_16k_esTX_ASR_long_16k_hiTX_ASR_long_16k_frTX_ASR_long_16k_deTX_ASR_long_16k_zh_dialectTX_ASRL_long_8k_zh_largeTX_ASRL_long_16k_zh_largeTX_ASRL_long_16k_multi_langTX_ASR_sentence_8k_zhTX_ASR_sentence_8k_enTX_ASR_sentence_16k_zhTX_ASR_sentence_16k_zh-PYTX_ASR_sentence_16k_zh_medicalTX_ASR_sentence_16k_enTX_ASR_sentence_16k_yueTX_ASR_sentence_16k_jaTX_ASR_sentence_16k_koTX_ASR_sentence_16k_viTX_ASR_sentence_16k_msTX_ASR_sentence_16k_idTX_ASR_sentence_16k_filTX_ASR_sentence_16k_thTX_ASR_sentence_16k_ptTX_ASR_sentence_16k_trTX_ASR_sentence_16k_arTX_ASR_sentence_16k_esTX_ASR_sentence_16k_hiTX_ASR_sentence_16k_frTX_ASR_sentence_16k_deTX_ASR_sentence_16k_zh_dialectALI_ASR_realtime_paraformer-realtime-v1ALI_ASR_realtime_paraformer-realtime-v2ALI_ASRL_long_sensevoice-v1ZJ_ASR_sentence
示例值:
TX_ASR_sentence_16k_tr
format
enum<string>
音频格式
枚举值:
wavpcmogg-opusspeexsilkmp3m4aaacamrraw
示例值:
wav
sample_rate
integer
必需
<= 20
示例值:
8000
channel_num
enum<integer>
必需
枚举值:
12
默认值:
1
示例值:
1
language
enum<string>
可选
枚举值:
autozhenyuejakoruitdeescaidthnlptcsplelmstlbghrdatrvihehuukuznorosvfataazbnmykmhiknlomlmrmnnepasiswteurha
示例值:
auto
speaker
enum<integer>
可选
枚举值:
01
示例值:
1
speaker_num
integer