Recognizing human voices in audio data.
Questions tagged [voice-detection]
16 questions
17
votes
4 answers
What is a good approach for extracting portions of speech from an arbitrary audio file?
I have a set of audio files that are uploaded by users, and there is no knowing what they contain.
I would like to take an arbitrary audio file, and extract each of the instances where someone is speaking into separate audio files. I don't want to…

stef
- 14,172
- 2
- 48
- 70
13
votes
3 answers
Voice Detection in Android Application
Description
My application record sound from phone microphone. I am using android standard classes (android.media.AudioRecord) to do that. Application have 2 buttons "Start" and "Stop" when I press start button application start record and when I…

Viktor Apoyan
- 10,655
- 22
- 85
- 147
2
votes
0 answers
why the UI does not show till the while loop breaks?
this code is answered in Voice Detection in Android Application.
It works fine but there is issue that the content of the page does not load until and unless the any voice has been detected and this snippet also makes the app crash few a times…

kishlay raj
- 41
- 7
1
vote
0 answers
Voice and Music detection using Nodejs
im currently working a chrome extension that can cancels background music of videos played in the browser. I found on this forum a way to detect if the browser is playing music or not (first step), it work on a browser but its not actually working…

clemj13
- 21
- 1
1
vote
1 answer
Web Audio API: how can I detect speech and record until silence, with or without a Push-To-Talk button
I'm running succesfully a client web page that act as a voice message sender, using MediaRecorder APIs:
when the user press any key, start an audio recording,
when the key is released, the audio recording is sent, via soketio, to a server for…

Giorgio Robino
- 2,148
- 6
- 38
- 59
1
vote
0 answers
How to write Short[] to wav output file in Android?
I am trying to write Short[] to wav audio file using file output stream but the file only contains scratch sound.
The reason i am using short[] rather than byte[] is because i am trying to use an external library which provides Voice Activity…

Rishabh Sharma
- 33
- 10
1
vote
1 answer
Google Speech API - Is there a way to determine if the audio has human voice or not?
I am making an audio filtering application at work that reads over hundreds of audio files and filters them. So, if the audio has human voice in it, it will accept it and if it does not- it will delete the audio file.
I am using ffmpeg to get the…

stupid_sma
- 11
- 4
1
vote
0 answers
How to run WebRTC library
I would like to use this library: https://github.com/voixen/voixen-vad
They offer some example code at the bottom of the readme.
My question is how do I run it?
What kind of project do I need to create to integrate this library and run the said…

qwe123asd
- 11
- 1
1
vote
1 answer
Voice Activity Detection (VAD/SAR) with LIUM
I wrote a shell script to train several GMMs for some kinds of voice activity and silence. I used LIUM speaker diarization toolkit therefore. I want to use this to do voice activity detection. The following script extracts MFCC features from an wav…

Johann Hagerer
- 1,048
- 2
- 10
- 28
1
vote
0 answers
automaticly stop recording when user stops speaking
I want to make an application that records user's voice but turns recording off when user stops to talk. How can I do that?

user101
- 595
- 1
- 4
- 18
0
votes
0 answers
Any idea where to get unvoiced speech data?
As we have 2 main classes of speech..i.e. voiced and unvoiced. I have been able to test my method with voiced. e.g. a recorded vowel /a/. but i want to see the output when i use unvoiced speech. Could someone point me to a database of these type of…

Farouk Yahaya
- 43
- 1
- 8
0
votes
1 answer
Can I use the voice to open flutter application?
I am working on developing an application to help the blind to find things, and I want to add to the application the feature of operating by voice, for example, when the user says “Open” the application will lunch

Oussama Ridéne
- 252
- 1
- 4
- 14
0
votes
1 answer
Command detection with Deep Neural Networks using Kaldi without binding to a language
Did anybody see any samples how set up simple application to train dnet and then use it to recognize it a limited number of voice commands without binding to a particular language? I believe Kaldi API is quite powerful for it but there is a lack of…

ChatCloud
- 1,152
- 2
- 8
- 22
0
votes
1 answer
Setuptools error when trying to install a package
I’m completely new to Python and I want to install the package py-webrtcvad in Windows 7, but I'm stuck at the error ImportError: No module named 'pip.utils.setuptools_build'. There is an answer to this problem in stackoverflow which seems to solve…

Alonso
- 13
- 4
0
votes
0 answers
how to record a voice when user is talking and how to play recorded voice when user was stop talking ?
I am writing SpeechRecognizer class inside service class to recognize the voice and to record and play it in background continuously, but this code was not working.
Can any one give me an Idea how can I do that?
This is my Activity class:
public…

Somanadh
- 97
- 2
- 11