The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.
Questions tagged [speech-to-text]
2372 questions
78
votes
8 answers
Offline Speech Recognition In Android (JellyBean)
It looks as though Google has made offline speech recognition available from Google Now for third-party apps. It is being used by the app named Utter.
Has anyone seen any implementations of how to do simple voice commands with this offline speech…

rmooney
- 6,123
- 3
- 29
- 29
44
votes
5 answers
How do I convert speech to text?
How could I take MP3 and convert the speech to text?
I've got some recorded notes from a conference and from meetings (there is a single voice on the recording, which is my voice). I thought it would be easier and intellectually interesting to…

Johnny Maelstrom
- 47,581
- 5
- 21
- 18
33
votes
1 answer
Unable to get results from Google text to speech api while streaming audio from web
I want to stream audio from the web and convert that to text using Python Google-cloud-speech API. I have integrated that in my Django channels code.
For frontend, I have directly copied this code and the backend has this code (please see below).…

Lokesh Sanapalli
- 1,012
- 3
- 18
- 39
32
votes
4 answers
record/save audio from voice recognition intent
I want to save/record the audio that Google recognition service used for speech to text operation (using RecognizerIntent or SpeechRecognizer).
I experienced many ideas:
onBufferReceived from RecognitionListener: I know, this is not working, just…

Slim
- 528
- 1
- 6
- 9
29
votes
2 answers
good Speech recognition API
I am working on a college project in which I am using speech recognition. Currently I am developing it on Windows 7 and I'm using system.speech API package which comes along with .net and I am doing it on C#.
The problem I am facing is dictation…

swordfish
- 4,899
- 5
- 33
- 61
26
votes
9 answers
Getting started with speech recognition and python
I would like to know where one could get started with speech recognition. Not with a library or anything that is fairly "Black Box'ed" But instead, I want to know where I can Actually make a simple speech recognition script. I have done some…

bs7280
- 1,074
- 3
- 18
- 32
25
votes
5 answers
Is there a way to use the SpeechRecognizer API directly for speech input?
The Android Dev website provides an example of doing speech input using the built-in Google Speech Input Activity. The activity displays a pre-configured pop-up with the mic and passes its results using onActivityResult()
My question:
Is there a…

vladimir.vivien
- 512
- 1
- 6
- 8
25
votes
6 answers
Python pocketsphinx RequestError: missing PocketSphinx module: ensure that PocketSphinx is set up correctly
I am trying to make a Python app that can record audio and translate it into english text using PyAudio, SpeechRecognition and PocketSphinx. I'm running on a Mac OS X El Capitan, version 10.11.2.
Following a tutorial like this one and others, I've…

cheevahagadog
- 4,638
- 3
- 15
- 15
24
votes
3 answers
Google Speech Recognition API: timestamp for each word?
It's possible to use Google's Speech recognition API to get a transcription for an audio file (WAV, MP3, etc.) by doing a request to http://www.google.com/speech-api/v2/recognize?...
Example: I have said "one two three for five" in a WAV file.…

Basj
- 41,386
- 99
- 383
- 673
21
votes
2 answers
x-webkit-speech is deprectated. A JS replacement for simple speech input for ?
I noticed this warning in Chrome devtool's console:
The x-webkit-speech input field attribute is deprecated. Please use the JavaScript API instead.
So what's the best JS API replacement for simple speech input on input fields?
Perhaps there exists…

Web_Designer
- 72,308
- 93
- 206
- 262
20
votes
3 answers
Powershell can speak, but can it write if i speak?
Following is the way to make powershell to speak.
Add-Type -AssemblyName System.Speech
$synthesizer = New-Object -TypeName System.Speech.Synthesis.SpeechSynthesizer
$synthesizer.Speak('Hey, I can speak!')
Actually i would like to do opposite.…

Samselvaprabu
- 16,830
- 32
- 144
- 230
20
votes
1 answer
Open Source Software For Transcribing Speech in Audio Files
Can anyone recommend reliable open source software for transcribing English speech in wav files? The two main programs I've researched are Sphinx and Julius, but I've never been able to get either to work, and the documentation with each on…

Cerin
- 60,957
- 96
- 316
- 522
20
votes
1 answer
Voice Input to Populate Edit Text in android?
I am working on voice input in android. I used the sample from
http://developer.android.com/resources/samples/ApiDemos/src/com/example/android/apis/app/VoiceRecognition.html
And while testing on Xperia X10, I got the "Speak now" dialog but before…

Panache
- 987
- 5
- 16
- 35
20
votes
6 answers
Voice/Speech to text
I need an API or library (preferably free) that will convert voice/speech through a microphone, into text (string).
Additionally, I will need an API or library that can do text-to-speech.
I'd like to use C# and .NET, but other languages will…

charles_har
- 275
- 1
- 4
- 6
20
votes
2 answers
JavaScript Speech-to-Text for blind people
I'm developing a website, and I would like to help blind people to use it by the voice, so I will use:
Text-to-speech, to give some posibilities to the user
Speech-to-text, to allow user to use her voice to select one
I already have some…

sgmonda
- 2,615
- 1
- 19
- 29