Questions tagged [google-cloud-speech]

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API.

  • The API recognizes over 80 languages and variants, to support your global user base. You can transcribe the text of users dictating to an application’s microphone, enable command-and-control through voice, or transcribe audio files, among many other use cases.
  • Recognize audio uploaded in the request, and integrate with your audio storage on Google Cloud Storage, by using the same technology Google uses to power its own products.

try it

576 questions
74
votes
15 answers

ImportError: No module named google.cloud

I am unable to import google.cloud.speech from google.cloud import speech I have installed it using : pip install --upgrade google-cloud-speech -t dir-name It is giving me below error while importing it from dir-name ImportError: No module named…
nishith
  • 1,223
  • 1
  • 12
  • 21
39
votes
1 answer

Difference between Android Speech to Text API (Recognizer Intent) and Google Cloud Speech API?

So i'm looking into building a speech to text app for fun. I did some research and found an inbuilt Speech to Text API using RecognizerIntent that is free, but also found that google is now offerieng a cloud speech API that the charge for. My…
AndroidDev21921
  • 695
  • 1
  • 8
  • 21
33
votes
1 answer

Unable to get results from Google text to speech api while streaming audio from web

I want to stream audio from the web and convert that to text using Python Google-cloud-speech API. I have integrated that in my Django channels code. For frontend, I have directly copied this code and the backend has this code (please see below).…
21
votes
3 answers

How can I extract the preceding audio (from microphone) as a buffer when silence is detected (JS)?

I'm using the Google Cloud API for Speech-to-text, with a NodeJS back-end. The app needs to be able to listen for voice commands, and transmit them to the back-end as a buffer. For this, I need to send the buffer of the preceding audio when silence…
azhar
  • 1,709
  • 1
  • 19
  • 41
18
votes
3 answers

Google Speech Recognition API Result is Empty

I'm performing an asynchronous request to Google Cloud Speech API, and I do not know how to get the result of operation: Request POST: https://speech.googleapis.com/v1beta1/speech:asyncrecognize Body: { "config":{ "languageCode"…
Bruno
  • 438
  • 1
  • 5
  • 13
17
votes
2 answers

How to get SSML timestamps from Google Cloud text-to-speech API

I want to use SSML markers through the Google Cloud text-to-speech API to request the timing of these markers in the audio stream. These timestamps are necessary in order to provide cues for effects, word/section highlighting and feedback to the…
15
votes
3 answers

User data privacy in Google speech to text api / Google cloud speech solution

What level of data privacy is maintained when I used Google speech to text api or google cloud speech solution. I want to know this because I am working on an application which converts audio files into text and those files may contain user…
14
votes
2 answers

Google Speech-to-text API, InvalidArgument: 400 Must use single channel (mono)

I keep getting this error InvalidArgument: 400 in google Speech-to-text, and the problem seems to be that I an using a 2 channel audio(Stereo), and the API is waiting for a wav in (Mono). If I convert the file in a audio editor it might work, but I…
14
votes
1 answer

How do I stream live audio from the browser to Google Cloud Speech via socket.io?

I have a situation with a React-based app where I have an input for which I wanted to allow voice input as well. I'm okay making this compatible with Chrome and Firefox only, so I was thinking of using getUserMedia. I know I'll be using Google…
Amber B.
  • 1,134
  • 10
  • 20
13
votes
5 answers

Is there a way to generate an SRT file (or similar) using Google Cloud Speech?

In order to generate subtitles for my videos, I converted them to audio files and used the Cloud Speech-to-Text. It works, but it only generates transcriptions, whereas what I need is a *.srt/*.vtt/similar file. What I need is what YouTube does: to…
Lucas Caton
  • 3,027
  • 1
  • 24
  • 34
10
votes
3 answers

How to end Google Speech-to-Text streamingRecognize gracefully and get back the pending text results?

I'd like to be able to end a Google speech-to-text stream (created with streamingRecognize), and get back the pending SR (speech recognition) results. In a nutshell, the relevant Node.js code: // create SR stream const stream =…
noseratio
  • 59,932
  • 34
  • 208
  • 486
10
votes
2 answers

Cannot import com.google.cloud.speech.v1.SpeechGrpc in Android

I'm trying to use Google's Speech API in Android project. The example project works. I'm having trouble to use it in my own android app. build.gradle(Module:app) : apply plugin: 'com.android.application' apply plugin: 'com.google.protobuf' ext { …
9
votes
4 answers

google cloud speech ImportError: cannot import name 'enums'

I'm using google-cloud-speech api for my project . I'm using pipenv for virtual environment i installed google-cloud-speech api with pipenv install google-cloud-speech and pipenv update google-cloud-speech i followed this docs…
Linh Nguyen
  • 3,452
  • 4
  • 23
  • 67
9
votes
1 answer

How to receive long running operation results via Google Cloud Pub/Sub API

When long running operations are kicked off in Google Cloud APIs, the service might return an object with an "Operations" shape. The status of the operation can then be polled through this object and the results obtained when complete. I'd like to…
8
votes
1 answer

Why does my python script not recognize speech from audio file?

I have the following piece of code successfully recognizing short (less than 1 min) test audio file, but failing with recognition another long audiofile (1.5h). from google.cloud import speech def run_quickstart(): speech_client =…
Andriy
  • 1,270
  • 3
  • 17
  • 35
1
2 3
38 39