I am interested in running the webkitSpeechRecognition API programmatically. I want to take an audio file that is uploaded to a server and use the webkitSpeechRecognition API on the back-end to recognize the text and return the result to the client.
One possibility is running some form of "embedded" version of Chrome, but I'm not sure how I would pass in the audio input. Another would be to use some form of C++ bindings to access the API, but I'm not sure if this is overly complicated.
Is this possible? How could this be accomplished?