GPTACG_api

创建转录

GEThttps://api.gptacg.top/v1/audio/transcriptions

请求参数

Body 参数multipart/form-data

file

必需

要转录的音频文件对象(不是文件名),格式为:flac、mp3、mp4、mpeg、mpga、m4a、ogg、wav 或 webm。

model

string

必需

要使用的模型 ID。目前只有 whisper-1 是可用的。

language

string

可选

输入音频的语言。以 ISO-639-1 格式提供输入语言可以提高准确性和延迟。

prompt

string

可选

一个可选的文本来指导模型的风格或继续之前的音频段落。提示应该与音频语言匹配。

response_format

string

可选

默认为 json
转录输出的格式,可选择:json、text、srt、verbose_json 或 vtt。

temperature

number

可选

默认为 0
采样温度,between 0 和 1。更高的值像 0.8 会使输出更随机,而更低的值像 0.2 会使其更集中和确定性。如果设置为 0,模型将使用对数概率自动增加温度直到达到特定阈值。

示例代码

返回响应

成功(200)

HTTP 状态码: 200

内容格式: JSONapplication/json

text

string

必需

{
  "text": "Imagine the wildest idea that you've ever had, and you're curious about how it might scale to something that's a 100, a 1,000 times bigger. This is a place where you can get to do that."
}

最后修改时间： 2 个月前