7

I want transcribe longer audio files (at least 5 minutes) using REST APIs from Microsoft. There are a lot of different products and names, e.g. Speech service API or Bing Speech API. None of the REST APIs I tried so far supports transcribing longer audio files.

The documentation states there is a REST API exactly for this case: https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/batch-transcription

What is the endpoint for this service?

strohne
  • 165
  • 2
  • 5

1 Answers1

8

There is a sample available on GitHub here: https://github.com/PanosPeriorellis/Speech_Service-BatchTranscriptionAPI

The endpoint is CRIS's endpoint, as in this code:

private const string HostName = "cris.ai";
// ...
var client = CrisClient.CreateApiV2Client(SubscriptionKey, HostName, Port);

Then I found on the documentation that the API is exposed on Swagger (link visible here), so it's easier to explore the methods available (switch from 2.0beta to 2.0 on top):

So to create a new transcription, the path is: /api/speechtotext/v2.0/transcriptions, called with the POST method, so the full endpoint is:

Please note that the level of your subscription key needed to use the transcription must be a Standard level pricing S0, not Free one.

Nicolas R
  • 13,812
  • 2
  • 28
  • 57
  • Thank you for the answer. This doesn't help yet. Which URL path etc. is this? I tried `https://westeurope.cris.ai/api/speechtotext/v2.0/transcriptions` with a subscription key from `https://westeurope.cris.ai/Subscriptions` and I get error 400 "Only "Standard" subscriptions for the region of the called service are valid." – strohne Nov 07 '18 at 19:16
  • So it looks like you have to get an API key of "standard" pricing. Your current key was a free one? – Nicolas R Nov 07 '18 at 22:15
  • yes. the problem was that I needed a S0 key, see https://github.com/MicrosoftDocs/azure-docs/issues/17188#issuecomment-437428984 can you add the details about the subscription key to your answer? then I will accept it as the right answer – strohne Nov 13 '18 at 15:36
  • @strohne were you able to transcribe multiple files with a single API call? If so can you explain how? I am only able to successfully transcribe a single file with this API and can't find multi-file documentation. – James May 16 '19 at 16:49
  • 1
    @James I only transcribed files one by one with multiple calls. – strohne Jul 12 '19 at 15:20
  • I think I am very late in responding to this question. But there is a way you can do batch transcription . Please check this https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/batch-transcription or more details – Harmandeep Singh Kalsi May 13 '20 at 08:51
  • @HarmandeepSinghKalsi at the time the batch transcription didn't work. I haven't tested this lately so it may be more up to date. – James Jul 29 '20 at 17:33
  • Yeah , now they have released v3.0 which has batch transcription API wherein you can just give the container path and all the files will be considered for the transcription – Harmandeep Singh Kalsi Jul 29 '20 at 18:48