Questions tagged [azure-speech]

Azure speech is a cloud based speech to text service in the Microsoft Azure family of machine learning and artificial intelligence APIs

140 questions
5
votes
1 answer

How To Save Azure Continuous Speech Recognition Results In A Variable?

I am trying to use Azure Continuous Speech Recognition for a Speech to Text project. Here is the sample code that was provided by Azure: def speech_recognize_continuous_from_file(): """performs continuous speech recognition with input from an…
Arman Ghoreshi
  • 123
  • 1
  • 8
4
votes
2 answers

How to enable word level Confidence for MS Azure Speech to Text Service

How to get word-level confidence for each word using MS Azure speech to text service? Currently, I am getting confidence value for sentence-level and I need word-level confidence for further processing.
3
votes
1 answer

azure-cognitiveservices-speech api error while using with AWS Lambda

I am trying to use Microsoft speech to text api(azure-cognitiveservices-speech) in AWS Lambda. In AWS Lambda I am using docker image(public.ecr.aws/lambda/python:3.8). when I run the the code I am getting the following error: Speech Recognition…
3
votes
1 answer

How to use Webhook for Microsoft cognitive Speech to Text V3

I'm trying to understand how to use a webhook in microsoft speech to text V3. According to the docs i was able to create a webhook and ping it. Now whenever a transcription is complete the webhook is called, but the body object in request is empty…
3
votes
2 answers

Azure Text to speech Throttled due to too many requests websocket error code: 1007 with only 3 concurrent requests

I'm using Azure text to speech service. I have a lot of small ssml files for which I generate the audio, then combine them using FFMPEG. I wrote a script to generate all the small audio files. If the script makes 3 concurrent requests, then after…
otusweb
  • 1,638
  • 1
  • 19
  • 30
3
votes
2 answers

SpeakSsmlAsync returns BadRequest

When calling SpeakSsmlAsync (Microsoft Speech SDK), the following error message is returned: > CANCELED: Reason=Error > CANCELED: ErrorCode=BadRequest > CANCELED: ErrorDetails=[HTTPAPI result code = HTTPAPI_OK. HTTP status code=400.] > CANCELED:…
3
votes
0 answers

Does the "PhraseListGrammar Class" from the Azure "Speech Service" just work when using US english as the speech recognition language?

I'm setting up an application for Android using Unity, where I want to apply speech to text functionalities, for that purpose I'm using the SDK of Azure's Speech-Service. The problem is that when I say a word, the recognizer tries to guess the word…
3
votes
2 answers

Microsoft Cognitive speech services - Speech to text batch transcription permission error

Hello! I am trying to use MS cognitive services for speech to text transcription. I need to run a sample first before I build it into my production app though. So I am trying out this: Git sample This sample basically just asks you to fill in the…
3
votes
1 answer

Binary file to base64 nodejs

I'm getting a binary audio file when I call the api tts.speech.microsoft.com and I would like to transform this binary into a base64 string. I've been trying a lot of things, for example: Buffer.from(body, "binary").toString("base64"); does not…
GuillaumeC
  • 535
  • 1
  • 8
  • 18
2
votes
1 answer

Azure Batch Transcription: Error when downloading the recording URI. StatusCode: Conflict (fail to download)

I m tryting to use the Batch transcription of Microsoft Azure. following the Official documentation I guess should be like this: I send the Audio to the service (POST) Get results. (GET with parameters of the post) but I m not getting…
2
votes
1 answer

Azures speech to text webhook header X-MicrosoftSpeechServices-Signature algorithm?

I'm trying to implement an authentication middleware in php for the webhook callback of azures speech to text api. I would like to make use of the X-MicrosoftSpeechServices-Signature header for this. The docs state, that this is the sha256 encrypted…
Chris
  • 4,238
  • 4
  • 28
  • 49
2
votes
1 answer

Speaker diarization for 3+ speakers using Azure

Does Azure's batch transcription support speaker diarization for more than 2 speakers? I checked their Rest API documentation and didn't find anything relevant. Are there other ways to do this using Azure cognitive services?
2
votes
2 answers

Microsoft Azure Cognitive Speech to Text - Returns undefined API Error

currently I am creating an STT application using NodeJS v16.x microsoft-cognitiveservices-speech-sdk v1.17.0 I have created an azure Speech Cognitive resource in the region westeurope and verified the key is correct. Now I am using the following…
TheDome
  • 343
  • 3
  • 10
2
votes
1 answer

Intercepting audio blobs or saving audio to a file with Azure Speech to Text on Android

Is it possible to save the transcribed audio to a file or even intercept the audio blobs using the Java SDK ? Currently I'm trying with com.microsoft.cognitiveservices.speech:client-sdk:1.19.0 I'm looking at the SDK but doesn't have any clear…
2
votes
0 answers

Azure pronunciation/fluency assessment for non-native english speaker

I am using azure speech's fluency assessment. https://learn.microsoft.com/en-in/azure/cognitive-services/speech-service/how-to-pronunciation-assessment?pivots=programming-language-python My questions is: the scoring returned keeping a native…
Anuj Gupta
  • 6,328
  • 7
  • 36
  • 55
1
2 3
9 10