Questions tagged [speech]

Speech is the vocalized form of human communication.

There are two main categories of speech-related computer applications:

974 questions
125
votes
4 answers

How can I use speech recognition without the annoying dialog in android phones

Is this possible without modify the android APIs? I've found a article about this. There's one a comment that I should do modifications to the android APIs. But it didn't say how to do the modification. Can anybody give me some suggestions on how…
Jim31837
  • 1,589
  • 3
  • 12
  • 12
81
votes
4 answers

What is the difference between System.Speech.Recognition and Microsoft.Speech.Recognition?

There are two similar namespaces and assemblies for speech recognition in .NET. I’m trying to understand the differences and when it is appropriate to use one or the other. There is System.Speech.Recognition from the assembly System.Speech (in…
Michael Levy
  • 13,097
  • 15
  • 66
  • 100
47
votes
16 answers

Voice Recognition Software For Developers

Well the docs finally said it, I need to take it easy on my wrist for a few months. Being that I'm a .NET Developer this could end my livelihood for a little while, something I'm not anxious to do. That said, are there any good handsfree options for…
tekiegreg
  • 1,667
  • 6
  • 25
  • 41
36
votes
5 answers

Split speech audio file on words in python

I feel like this is a fairly common problem but I haven't yet found a suitable answer. I have many audio files of human speech that I would like to break on words, which can be done heuristically by looking at pauses in the waveform, but can anyone…
user3059201
  • 775
  • 2
  • 7
  • 11
29
votes
2 answers

good Speech recognition API

I am working on a college project in which I am using speech recognition. Currently I am developing it on Windows 7 and I'm using system.speech API package which comes along with .net and I am doing it on C#. The problem I am facing is dictation…
swordfish
  • 4,899
  • 5
  • 33
  • 61
26
votes
8 answers

Open source code for voice detection and discrimination

I have 15 audio tapes, one of which I believe contains an old recording of my grandmother and myself talking. A quick attempt to find the right place didn't turn it up. I don't want to listen to 20 hours of tape to find it. The location may not…
Croad Langshan
  • 2,646
  • 3
  • 24
  • 37
25
votes
3 answers

Audio analysis to detect human voice, gender, age and emotion -- any prior open-source work done?

Is there prior open-source work done in the field of 'Audio analysis' to detect human-voice (say in spite of some background noise), determine speaker's gender, possibly determine no. of speakers, age of speaker(s), and the emotion of speakers? My…
mike.dinnone
  • 732
  • 2
  • 8
  • 17
24
votes
3 answers

Google Speech Recognition API: timestamp for each word?

It's possible to use Google's Speech recognition API to get a transcription for an audio file (WAV, MP3, etc.) by doing a request to http://www.google.com/speech-api/v2/recognize?... Example: I have said "one two three for five" in a WAV file.…
Basj
  • 41,386
  • 99
  • 383
  • 673
23
votes
4 answers

Python Speaker Recognition

I have an audio file, a recorded telephone conversation of 2 people, that I need to separate the voices of 2 speakers automatically. I am new to speech recognition and I looked at wave module of python but failed to find any fruitful…
PJC
  • 231
  • 1
  • 2
  • 3
23
votes
5 answers

how I can change the voice synthesizer gender and age in C#?

I would like to change the gender and age of the voice of System.Speech in c#. For example, a girl of 10 years but can not find any simple example to help me adjust the parameters.
Pablo Gonzalez
  • 673
  • 2
  • 10
  • 24
20
votes
3 answers

How to capture audio in javascript?

I am currently using getUserMedia(), which is only working on Firefox and Chrome, yet it got deprecated and works only on https (in Chrome). Is there any other/better way to get the speech input in javascript that works on all platforms? E.g. how do…
user2212461
  • 3,105
  • 8
  • 49
  • 87
20
votes
5 answers

Google Speech Recognition API

I'm trying to use the Google Speech API v2 (at address https://www.google.com/speech-api/v2/recognize?...) I need to use my Api Key, but when I use it I get error 403 Forbidden When I use an API key that was on the example project I downloaded it is…
Ron Gross
  • 1,474
  • 5
  • 19
  • 34
20
votes
2 answers

Can Web Speech API be used in conjunction with Web Audio API?

Is it possible to use the synthesised speech from Web Speech API as a SourceNode inside Web Audio API's audio context?
zya
  • 830
  • 11
  • 25
19
votes
1 answer

Fastest Speech recognition library C++

I know its a general question topic, but still i want to know whats the fastest speech recognition library in C++? Currently I am using Microsoft SAPI with kniect. It works fine and recognizes words but its abit slow, some times it takes 1,2 seconds…
Fahad Rauf
  • 705
  • 2
  • 8
  • 17
17
votes
5 answers

How can I do real-time voice activity detection in Python?

I am performing a voice activity detection on the recorded audio file to detect speech vs non-speech portions in the waveform. The output of the classifier looks like (highlighted green regions indicate speech): The only issue I face here is making…
Nickil Maveli
  • 29,155
  • 8
  • 82
  • 85
1
2 3
64 65