I am currently trying to use Google Speech API to do a live speech to text transcription in a web application. In order to do that I have to use the RPC streaming recognition (web sockets). I know there are multiple client libraries, but none of them gives the possibility to stream the audio directly from the web app to the Google Speech API. No plain javascript libraries.
I also know it probably is possible to do this by setting up a web socket connection between the front-end and the backend, and then, in my case, use the NodeJS client library to stream to the Google Speech API. However, this seems to be unnecessary complex.
Is there really no supported way of using the streaming recognition directly from a web app?
Does anyone know how this could be done?
EDIT I havent gotten as far as actually sending a stream to the service, which is the baseline of my question. Let me rephrase my question: Is there a way to send an audio stream to the Google Speech API directly from the browser/microphone? My app is created in JavaScript (Angular).
I've used IBM Watson S2T before, and they deliver a JavaScript SDK available through bower that can transcript audio from microphone directly to the service without passing it through a backend layer.
Regards,
Kjetil