Questions tagged [google-cloud-speech]

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API.

The API recognizes over 80 languages and variants, to support your global user base. You can transcribe the text of users dictating to an application’s microphone, enable command-and-control through voice, or transcribe audio files, among many other use cases.
Recognize audio uploaded in the request, and integrate with your audio storage on Google Cloud Storage, by using the same technology Google uses to power its own products.

try it

576 questions

votes

15 answers

ImportError: No module named google.cloud

I am unable to import google.cloud.speech from google.cloud import speech I have installed it using : pip install --upgrade google-cloud-speech -t dir-name It is giving me below error while importing it from dir-name ImportError: No module named…

asked Jun 06 '17 at 18:39

nishith

1,223
1
12
21

votes

1 answer

Difference between Android Speech to Text API (Recognizer Intent) and Google Cloud Speech API?

So i'm looking into building a speech to text app for fun. I did some research and found an inbuilt Speech to Text API using RecognizerIntent that is free, but also found that google is now offerieng a cloud speech API that the charge for. My…

android google-cloud-speech

asked Nov 30 '16 at 17:21

AndroidDev21921

votes

1 answer

Unable to get results from Google text to speech api while streaming audio from web

I want to stream audio from the web and convert that to text using Python Google-cloud-speech API. I have integrated that in my Django channels code. For frontend, I have directly copied this code and the backend has this code (please see below).…

django speech-to-text google-speech-api django-channels google-cloud-speech

asked May 09 '19 at 17:13

Lokesh Sanapalli

1,012
3
18
39

votes

3 answers

How can I extract the preceding audio (from microphone) as a buffer when silence is detected (JS)?

I'm using the Google Cloud API for Speech-to-text, with a NodeJS back-end. The app needs to be able to listen for voice commands, and transmit them to the back-end as a buffer. For this, I need to send the buffer of the preceding audio when silence…

javascript node.js web-audio-api google-cloud-speech audiocontext

asked Oct 03 '17 at 11:18

azhar

1,709
1
19
41

votes

3 answers

Google Speech Recognition API Result is Empty

I'm performing an asynchronous request to Google Cloud Speech API, and I do not know how to get the result of operation: Request POST: https://speech.googleapis.com/v1beta1/speech:asyncrecognize Body: { "config":{ "languageCode"…

google-api speech-recognition google-cloud-speech

asked Aug 11 '16 at 21:37

Bruno

votes

2 answers

How to get SSML timestamps from Google Cloud text-to-speech API

I want to use SSML markers through the Google Cloud text-to-speech API to request the timing of these markers in the audio stream. These timestamps are necessary in order to provide cues for effects, word/section highlighting and feedback to the…

markers google-cloud-speech google-text-to-speech ssml

asked Aug 06 '19 at 18:24

James

votes

3 answers

User data privacy in Google speech to text api / Google cloud speech solution

What level of data privacy is maintained when I used Google speech to text api or google cloud speech solution. I want to know this because I am working on an application which converts audio files into text and those files may contain user…

google-speech-api google-cloud-speech

asked Jun 15 '17 at 16:52

Bhargav Deshpande

votes

2 answers

Google Speech-to-text API, InvalidArgument: 400 Must use single channel (mono)

I keep getting this error InvalidArgument: 400 in google Speech-to-text, and the problem seems to be that I an using a 2 channel audio(Stereo), and the API is waiting for a wav in (Mono). If I convert the file in a audio editor it might work, but I…

python google-cloud-speech

asked Mar 11 '19 at 16:39

Jose silvestre Rodriguez Ortiz

votes

1 answer

How do I stream live audio from the browser to Google Cloud Speech via socket.io?

I have a situation with a React-based app where I have an input for which I wanted to allow voice input as well. I'm okay making this compatible with Chrome and Firefox only, so I was thinking of using getUserMedia. I know I'll be using Google…

javascript socket.io google-cloud-speech

asked Jun 21 '18 at 19:18

Amber B.

1,134
10
20

votes

5 answers

Is there a way to generate an SRT file (or similar) using Google Cloud Speech?

In order to generate subtitles for my videos, I converted them to audio files and used the Cloud Speech-to-Text. It works, but it only generates transcriptions, whereas what I need is a *.srt/*.vtt/similar file. What I need is what YouTube does: to…

google-cloud-speech

asked Sep 19 '18 at 04:09

Lucas Caton

3,027
1
24
34

votes

3 answers

How to end Google Speech-to-Text streamingRecognize gracefully and get back the pending text results?

I'd like to be able to end a Google speech-to-text stream (created with streamingRecognize), and get back the pending SR (speech recognition) results. In a nutshell, the relevant Node.js code: // create SR stream const stream =…

javascript node.js async-await speech-recognition google-cloud-speech

asked Oct 28 '20 at 02:11

noseratio

59,932
34
208
486

votes

2 answers

Cannot import com.google.cloud.speech.v1.SpeechGrpc in Android

I'm trying to use Google's Speech API in Android project. The example project works. I'm having trouble to use it in my own android app. build.gradle(Module:app) : apply plugin: 'com.android.application' apply plugin: 'com.google.protobuf' ext { …

java android google-cloud-platform google-speech-api google-cloud-speech

asked Jul 04 '17 at 19:45

tezz

votes

4 answers

google cloud speech ImportError: cannot import name 'enums'

I'm using google-cloud-speech api for my project . I'm using pipenv for virtual environment i installed google-cloud-speech api with pipenv install google-cloud-speech and pipenv update google-cloud-speech i followed this docs…

python import google-speech-api pipenv google-cloud-speech

asked Jun 28 '19 at 02:34

Linh Nguyen

3,452
4
23
67

votes

1 answer

How to receive long running operation results via Google Cloud Pub/Sub API

When long running operations are kicked off in Google Cloud APIs, the service might return an object with an "Operations" shape. The status of the operation can then be polled through this object and the results obtained when complete. I'd like to…

google-app-engine google-cloud-platform google-cloud-pubsub google-cloud-speech

asked Jun 15 '17 at 12:54

ximus

votes

1 answer

Why does my python script not recognize speech from audio file?

I have the following piece of code successfully recognizing short (less than 1 min) test audio file, but failing with recognition another long audiofile (1.5h). from google.cloud import speech def run_quickstart(): speech_client =…

google-cloud-speech

asked Jun 29 '17 at 21:48

Andriy

1,270
3
17
35

2 3

…

38 39 Next