代码之家 › 专栏 › 技术社区 › Nelson Teixeira

糟糕的沃森抄本

ibm-watson

Nelson Teixeira · 技术社区 · 5 年前

我使用了以下命令:

curl -X POST -u "apikey:<key>" --header "Content-Type: audio/mp3" --data-binary @./file.mp3 
"https://api.us-south.speech-to-text.watson.cloud.ibm.com/instances/<code>/v1/recognize/model=pt-BR_BroadbandModel"

https://drive.google.com/file/d/1Xuibxksudp55uwaz6oSOccTZ3pP7Dya9/view?usp=sharing

华生不可能有这么糟糕的抄本。我错过了什么?我需要先设置一些参数还是在音频中做一些工作?

我也试过窄带模型。我也试过flac。

0 回复 | 直到 5 年前

-1

optimus 5 年前

沃森ibmapi似乎没有为最终用户正确编码,原因似乎是他们的api设计对于转录过于复杂。它有一个错误,我相信他们的团队还没有破译出来

不过,与谷歌合作是明智的

    pip install --upgrade SpeechRecognition(linux, unix systems)
or  C:\path_to_ python.exe -m pip install --upgrade SpeechRecognition (windows)

这是一个具有所有内置功能的模块不同api创建者(如ibm)的容量仅仅通过使用

import speech_recogntion as sr
r = sr.Recognizer()
with sr.AudioFile("path to audio file") as source:
       #r.adjust_for_ambient_noise() depending on if you have background noise 
      audio = r.record(source)

然后; 识别文件输出其中xxx是列表中的api创建者。说

  google, ibm, azure or bing(with microsoft)
  t = r.recognize_xxx(audio, credentials, ...)

这只是一个粗略的指南

推荐文章

Obehi Inegbedion · Watson Studio在尝试创建免费服务时出错

3 年前

grunter-hokage · 如何从Unity中的Watson对话服务获取$上下文变量?

7 年前

user6269864 · Watson对话中的分类错误

7 年前

grunter-hokage · 从Unity中的Watson对话中获取意图和实体

7 年前

Shaun Yan · 您好,如何使用IBM Watson对话显示与使用IBM Watson的facebook messenger链接的图像?

7 年前

Aurangzeb Rathore · Watson NLU-没有为语言检测提供足够的文本

7 年前

mateuszb · 如何检索已注册回调URL的列表?

7 年前

Federico Bacci · 无法使用IBM Watson SDK为HoloLens构建Unity项目

7 年前

Leo · Watson对话节点。js使用learning\u opt\u out创建工作区

7 年前

RileyZ71 · IBM Watson自然语言理解上传多个文档进行分析

7 年前