Questions tagged [voice-detection]

Recognizing human voices in audio data.

16 questions
17
votes
4 answers

What is a good approach for extracting portions of speech from an arbitrary audio file?

I have a set of audio files that are uploaded by users, and there is no knowing what they contain. I would like to take an arbitrary audio file, and extract each of the instances where someone is speaking into separate audio files. I don't want to…
stef
  • 14,172
  • 2
  • 48
  • 70
13
votes
3 answers

Voice Detection in Android Application

Description My application record sound from phone microphone. I am using android standard classes (android.media.AudioRecord) to do that. Application have 2 buttons "Start" and "Stop" when I press start button application start record and when I…
Viktor Apoyan
  • 10,655
  • 22
  • 85
  • 147
2
votes
0 answers

why the UI does not show till the while loop breaks?

this code is answered in Voice Detection in Android Application. It works fine but there is issue that the content of the page does not load until and unless the any voice has been detected and this snippet also makes the app crash few a times…
1
vote
0 answers

Voice and Music detection using Nodejs

im currently working a chrome extension that can cancels background music of videos played in the browser. I found on this forum a way to detect if the browser is playing music or not (first step), it work on a browser but its not actually working…
clemj13
  • 21
  • 1
1
vote
1 answer

Web Audio API: how can I detect speech and record until silence, with or without a Push-To-Talk button

I'm running succesfully a client web page that act as a voice message sender, using MediaRecorder APIs: when the user press any key, start an audio recording, when the key is released, the audio recording is sent, via soketio, to a server for…
1
vote
0 answers

How to write Short[] to wav output file in Android?

I am trying to write Short[] to wav audio file using file output stream but the file only contains scratch sound. The reason i am using short[] rather than byte[] is because i am trying to use an external library which provides Voice Activity…
1
vote
1 answer

Google Speech API - Is there a way to determine if the audio has human voice or not?

I am making an audio filtering application at work that reads over hundreds of audio files and filters them. So, if the audio has human voice in it, it will accept it and if it does not- it will delete the audio file. I am using ffmpeg to get the…
1
vote
0 answers

How to run WebRTC library

I would like to use this library: https://github.com/voixen/voixen-vad They offer some example code at the bottom of the readme. My question is how do I run it? What kind of project do I need to create to integrate this library and run the said…
qwe123asd
  • 11
  • 1
1
vote
1 answer

Voice Activity Detection (VAD/SAR) with LIUM

I wrote a shell script to train several GMMs for some kinds of voice activity and silence. I used LIUM speaker diarization toolkit therefore. I want to use this to do voice activity detection. The following script extracts MFCC features from an wav…
Johann Hagerer
  • 1,048
  • 2
  • 10
  • 28
1
vote
0 answers

automaticly stop recording when user stops speaking

I want to make an application that records user's voice but turns recording off when user stops to talk. How can I do that?
user101
  • 595
  • 1
  • 4
  • 18
0
votes
0 answers

Any idea where to get unvoiced speech data?

As we have 2 main classes of speech..i.e. voiced and unvoiced. I have been able to test my method with voiced. e.g. a recorded vowel /a/. but i want to see the output when i use unvoiced speech. Could someone point me to a database of these type of…
Farouk Yahaya
  • 43
  • 1
  • 8
0
votes
1 answer

Can I use the voice to open flutter application?

I am working on developing an application to help the blind to find things, and I want to add to the application the feature of operating by voice, for example, when the user says “Open” the application will lunch
Oussama Ridéne
  • 252
  • 1
  • 4
  • 14
0
votes
1 answer

Command detection with Deep Neural Networks using Kaldi without binding to a language

Did anybody see any samples how set up simple application to train dnet and then use it to recognize it a limited number of voice commands without binding to a particular language? I believe Kaldi API is quite powerful for it but there is a lack of…
ChatCloud
  • 1,152
  • 2
  • 8
  • 22
0
votes
1 answer

Setuptools error when trying to install a package

I’m completely new to Python and I want to install the package py-webrtcvad in Windows 7, but I'm stuck at the error ImportError: No module named 'pip.utils.setuptools_build'. There is an answer to this problem in stackoverflow which seems to solve…
0
votes
0 answers

how to record a voice when user is talking and how to play recorded voice when user was stop talking ?

I am writing SpeechRecognizer class inside service class to recognize the voice and to record and play it in background continuously, but this code was not working. Can any one give me an Idea how can I do that? This is my Activity class: public…
1
2