Questions tagged [google-speech-api]

With Google Speech API you can convert speech to text file based or live stream

The https://cloud.google.com/speech/ and is part of https://cloud.google.com/products/ to allow for speech to text conversion.

When using a live stream words are returned almost realtime. This is limited by 1 minute or when paused for ~1 second. When using an asynchronous file the speech to text can be as long as 80 minutes. See https://cloud.google.com/speech/limits

For more see https://cloud.google.com/speech/docs/

837 questions
74
votes
15 answers

ImportError: No module named google.cloud

I am unable to import google.cloud.speech from google.cloud import speech I have installed it using : pip install --upgrade google-cloud-speech -t dir-name It is giving me below error while importing it from dir-name ImportError: No module named…
nishith
  • 1,223
  • 1
  • 12
  • 21
41
votes
5 answers

portaudio.h: No such file or directory

I got the following error while trying to install pyaudio using pip3 in ubuntu 16.04: Collecting pyaudio Downloading PyAudio-0.2.11.tar.gz Installing collected packages: pyaudio Running setup.py install for pyaudio ... error Complete output…
Monojit Sarkar
  • 657
  • 1
  • 5
  • 15
39
votes
8 answers

Request had invalid authentication credentials. Expected OAuth 2 access token error in cloud speech api

i have followed the google cloud speech api quickstart of requesting api by using curl -s -H "Content-Type: application/json" \ -H "Authorization: Bearer "$(gcloud auth application-default print-access-token) \ …
Patel
  • 531
  • 2
  • 5
  • 11
33
votes
1 answer

Unable to get results from Google text to speech api while streaming audio from web

I want to stream audio from the web and convert that to text using Python Google-cloud-speech API. I have integrated that in my Django channels code. For frontend, I have directly copied this code and the backend has this code (please see below).…
25
votes
1 answer

Streaming Audio in FLAC or AMR_WB to the Google Speech API

I need to run the google speech api in somewhat low bandwidth environments. Based on reading about best practices, it seems my best bet is to use the AMR_WB format. However, the following code produces no exceptions, and I get no responses in the…
Wesley
  • 5,381
  • 9
  • 42
  • 65
24
votes
3 answers

Google Speech Recognition API: timestamp for each word?

It's possible to use Google's Speech recognition API to get a transcription for an audio file (WAV, MP3, etc.) by doing a request to http://www.google.com/speech-api/v2/recognize?... Example: I have said "one two three for five" in a WAV file.…
Basj
  • 41,386
  • 99
  • 383
  • 673
19
votes
2 answers

Is there a way to force Google Speech api to return only words as response?

I am using Googles this api :- https://www.google.com/speech-api/v2/recognize?output=json&lang="+ language_code+"&key="My key" for speech recognition and it's working very well. The issue is with numbers i.e, if I say one two three four the…
Sunil Sunny
  • 3,949
  • 4
  • 23
  • 53
19
votes
2 answers

Different between Google Speech API and Web Speech API

I am working on web speech recognition. And I found that Google provide a API which call "Google speech API V2" to developer. But I notice there is a limit on every day to use it. After that I found there is a native WEB Speech API also can…
Kim Wong
  • 2,027
  • 4
  • 17
  • 22
17
votes
1 answer

Google Cloud Speech with Javascript

In documentation and tutorial for REST API (Google Sppech API for Node: https://cloud.google.com/nodejs/apis), so my question is how to use the Cloud Speech API in JavaScript. Someone used on any page with javascript? 2020-04-24 EDIT: The accepted…
16
votes
1 answer

How to use google speech recognition api in python?

I have an mp3 file and I want to use Google's speech recognition to get the text out of that file. Any ideas where I can find documentation or examples will be appreciated.
Vivek Anand
  • 621
  • 1
  • 7
  • 15
16
votes
1 answer

Google Speech API V2

I have followed this link to generate API keys, but the Speech API was not found on the console. The application example I'm following is here: https://github.com/gillesdemey/google-speech-v2/ I keep on getting an error: Your client does not have…
Gourish Goudar
  • 169
  • 1
  • 1
  • 4
15
votes
1 answer

INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes

I'm using for the first time the GCS Speech API for a project to convert a series of audio files to text. Each file has around 60 minutes and is a person talking continuously during the whole time. I've installed the GC SDK and I'm using it to…
CIRCLE
  • 4,501
  • 5
  • 37
  • 56
15
votes
3 answers

User data privacy in Google speech to text api / Google cloud speech solution

What level of data privacy is maintained when I used Google speech to text api or google cloud speech solution. I want to know this because I am working on an application which converts audio files into text and those files may contain user…
13
votes
3 answers

GOOGLE_APPLICATION_CREDENTIALS error

I am trying to use the google speech API for converting an audio file to text, but facing some problems with the documentation.I have enabled the speech API and created an service account file. I also got the service account credentials activated.…
JSnow
  • 929
  • 2
  • 11
  • 24
12
votes
1 answer

How to translate live streaming using google speech api?

This is the Google Speech API docs: https://cloud.google.com/speech/docs/sync-recognize I trried this API for 2 weeks, but still can't solve my main purpose (translate live streaming). I'm using PHP. (other language suggestion is allowed, I will…
plonknimbuzz
  • 2,594
  • 2
  • 19
  • 31
1
2 3
55 56