Questions tagged [speech]

Speech is the vocalized form of human communication.

There are two main categories of speech-related computer applications:

speech-recognition - problems related to recognizing speech. One of most common problems is converting human speech to its textual representation - speech-to-text.
speech-synthesis involving production of human speech. See also text-to-speech.

974 questions

125

votes

4 answers

How can I use speech recognition without the annoying dialog in android phones

Is this possible without modify the android APIs? I've found a article about this. There's one a comment that I should do modifications to the android APIs. But it didn't say how to do the modification. Can anybody give me some suggestions on how…

android speech-recognition speech

asked Jun 11 '11 at 15:56

Jim31837

1,589
3
12
12

votes

4 answers

What is the difference between System.Speech.Recognition and Microsoft.Speech.Recognition?

There are two similar namespaces and assemblies for speech recognition in .NET. I’m trying to understand the differences and when it is appropriate to use one or the other. There is System.Speech.Recognition from the assembly System.Speech (in…

.net speech-recognition speech ucma2.0 ucs

asked Jun 04 '10 at 19:54

Michael Levy

13,097
15
66
100

votes

16 answers

Voice Recognition Software For Developers

Well the docs finally said it, I need to take it easy on my wrist for a few months. Being that I'm a .NET Developer this could end my livelihood for a little while, something I'm not anxious to do. That said, are there any good handsfree options for…

speech-recognition voice ergonomics speech code-by-voice

asked Sep 17 '08 at 21:45

tekiegreg

1,667
6
25
41

votes

5 answers

Split speech audio file on words in python

I feel like this is a fairly common problem but I haven't yet found a suitable answer. I have many audio files of human speech that I would like to break on words, which can be done heuristically by looking at pauses in the waveform, but can anyone…

python audio speech-recognition speech heuristics

asked Apr 06 '16 at 17:27

user3059201

votes

2 answers

good Speech recognition API

I am working on a college project in which I am using speech recognition. Currently I am developing it on Windows 7 and I'm using system.speech API package which comes along with .net and I am doing it on C#. The problem I am facing is dictation…

c# .net speech-recognition speech speech-to-text

asked Mar 29 '11 at 04:15

swordfish

4,899
5
33
61

votes

8 answers

Open source code for voice detection and discrimination

I have 15 audio tapes, one of which I believe contains an old recording of my grandmother and myself talking. A quick attempt to find the right place didn't turn it up. I don't want to listen to 20 hours of tape to find it. The location may not…

speech-recognition speech pyaudioanalysis

asked Apr 22 '11 at 18:07

Croad Langshan

2,646
3
24
37

votes

3 answers

Audio analysis to detect human voice, gender, age and emotion -- any prior open-source work done?

Is there prior open-source work done in the field of 'Audio analysis' to detect human-voice (say in spite of some background noise), determine speaker's gender, possibly determine no. of speakers, age of speaker(s), and the emotion of speakers? My…

speech-recognition analysis speech emotion

asked Feb 21 '11 at 03:39

mike.dinnone

votes

3 answers

Google Speech Recognition API: timestamp for each word?

It's possible to use Google's Speech recognition API to get a transcription for an audio file (WAV, MP3, etc.) by doing a request to http://www.google.com/speech-api/v2/recognize?... Example: I have said "one two three for five" in a WAV file.…

audio speech-recognition speech-to-text speech google-speech-api

asked Dec 04 '15 at 10:39

Basj

41,386
99
383
673

votes

4 answers

Python Speaker Recognition

I have an audio file, a recorded telephone conversation of 2 people, that I need to separate the voices of 2 speakers automatically. I am new to speech recognition and I looked at wave module of python but failed to find any fruitful…

python voice-recognition speech

asked Sep 05 '11 at 14:07

PJC

votes

5 answers

how I can change the voice synthesizer gender and age in C#?

I would like to change the gender and age of the voice of System.Speech in c#. For example, a girl of 10 years but can not find any simple example to help me adjust the parameters.

c# speech synthesizer

asked Jun 04 '12 at 12:35

Pablo Gonzalez

votes

3 answers

How to capture audio in javascript?

I am currently using getUserMedia(), which is only working on Firefox and Chrome, yet it got deprecated and works only on https (in Chrome). Is there any other/better way to get the speech input in javascript that works on all platforms? E.g. how do…

javascript audio speech getusermedia voice-recording

asked Jan 15 '16 at 21:58

user2212461

3,105
8
49
87

votes

5 answers

Google Speech Recognition API

I'm trying to use the Google Speech API v2 (at address https://www.google.com/speech-api/v2/recognize?...) I need to use my Api Key, but when I use it I get error 403 Forbidden When I use an API key that was on the example project I downloaded it is…

google-api speech-recognition speech

asked May 12 '14 at 12:19

Ron Gross

1,474
5
19
34

votes

2 answers

Can Web Speech API be used in conjunction with Web Audio API?

Is it possible to use the synthesised speech from Web Speech API as a SourceNode inside Web Audio API's audio context?

speech web-audio-api speech-synthesis

asked Sep 19 '13 at 18:56

zya

votes

1 answer

Fastest Speech recognition library C++

I know its a general question topic, but still i want to know whats the fastest speech recognition library in C++? Currently I am using Microsoft SAPI with kniect. It works fine and recognizes words but its abit slow, some times it takes 1,2 seconds…

c++ kinect speech-recognition speech sapi

asked Apr 05 '13 at 06:43

Fahad Rauf

votes

5 answers

How can I do real-time voice activity detection in Python?

I am performing a voice activity detection on the recorded audio file to detect speech vs non-speech portions in the waveform. The output of the classifier looks like (highlighted green regions indicate speech): The only issue I face here is making…

python speech-recognition speech-to-text speech pyaudio

asked Mar 24 '20 at 13:38

Nickil Maveli

29,155
8
82
85

2 3

…

64 65 Next