Questions tagged [sound-recognition]
49 questions
10
votes
5 answers
Clap sound detection in C#
I would like to be able to detect a clapping sound. Being pretty new to any form of sound recognition, is there simple any way to do this? If not, any good resources for C# sound recognition/detection?
My goal is to automate a clients media center…

jamesrom
- 866
- 3
- 10
- 19
7
votes
1 answer
Expo: Get audio data realtime and send via Socket.IO
App I want to make
I would like to make audio recognition mobile app like Shazam with
Expo
Expo AV(https://docs.expo.io/versions/latest/sdk/audio)
Tensorflow serving
Socket.IO
I want to send recording data to machine learning based recognition…

Watanabe.N
- 1,549
- 1
- 14
- 37
5
votes
0 answers
Bad types trying to use audio recognition loopback
The Microsoft Speech system has a good example code, but I have a problem when adding loopback to record what it is playing not what is coming through mic. To give text description of a video for example while not playing it on the speaker. It seems…

NoBugs
- 9,310
- 13
- 80
- 146
5
votes
2 answers
Software: How does software recognize a song?
I was playing with my phone and there is this app on it that when u use it to record 10 seconds of a song, it tells you the title and author of that song. Now as a software engineer I can't help but wonder, How does this work?

Emerion
- 820
- 6
- 13
4
votes
3 answers
How to retrieve frame number and frame size from wav file
I'm currently doing a thesis about chord recognition with EPCP with wav file as an input, but now I'm stuck at determining number of frames and frame size of a single wav file. I need those data as the parameters for hammingwindow function in NAudio…

Norman Pratama
- 67
- 2
- 12
4
votes
1 answer
Persian speech to text in android
i want to improve my knowledge and work with sound reorganization (speech to text). i find good samples from google but i don't know how can i do so with Persian-Farsi language? i do it with English but how about Farsi? is it need some setting in my…

Mina Dahesh
- 332
- 8
- 21
4
votes
1 answer
How does Shazam / soundhound work?
I am interested in how does shazam or soundhound work.
How does it record a voice and makes a fingerprint that is so simmilar to their database of fingerprints that it finds a match?
I am about to start writing some software in C/C++ but not sure…

Wiggler Jtag
- 669
- 8
- 23
3
votes
1 answer
Audio Classification based on FFT
In python, I'm performing alarm recognition by only checking frequencies and amplitudes. My code takes the FFT of the 1s sound, then compares it with the predetermined frequencies and their amplitudes. Since alarms contains higher frequencies…

Selim Turkoglu
- 149
- 1
- 2
- 11
2
votes
0 answers
Tensorflow sound recognition tutorial gives error: op_def = op_dict[node.op] KeyError: 'DecodeWav'
I am trying to import a pretrained tensorflow model (the default sound recognition one in the tutorial) and I keep getting this error.
I tried importing using both a checkpoint file and a .pb file, and as a beginner, I have no idea about this error.…

utsal
- 21
- 2
2
votes
2 answers
"Sound" Recognition in Swift?
I'm working on an applicaion in Swift and I was thinking about a way to get Non-Speech sound recognition in my project.
I mean is there a way in which I can take in sound inputs and match them against some predefined sounds already incorporated in…

ritvik1512
- 303
- 1
- 3
- 10
1
vote
1 answer
Dictating in MS Visual C#?
I wanna make a text editor but its different other text editors.It will get commands from my voice.
For example ; i will say my name to microphone "Ibrahim AKGUN" .. then this program will get it and write it down like a virtual secretary :)
This is…

İbrahim Akgün
- 1,527
- 11
- 37
- 63
1
vote
0 answers
Appium CodeceptJS - Mock sound recording - Android
I've been automating testing of mobile apps for a year now, but it's my first time, that a part of app records users' voice (it's actually a 3-step process, first recording of "silence" to calibrate to the background noise, second a prolonged…

Pristo
- 11
- 1
1
vote
0 answers
Sound recognition of beep to start and stop recording MP3 stream
I'm trying to make a Python script, that would record local news from internet radio (MP3) stream like this one. The news segment starts and ends whit two different distinct beeps.
Until now, my solution was extremely slow and unoptimized:
Use…

J. Dvoršak
- 11
- 2
1
vote
0 answers
Footsteps recognition on Arduino Nano 33 BLE
I'm trying to build an intrusion detection device by training a model using footsteps sounds.
I'm trying to do it using TensorFlow.
The problem is that this example does not work at all: I've tried to create a custom word called "footsteps",…

Riccardo
- 47
- 6
1
vote
0 answers
Is there any way to compare two spectograms?
I am trying to compare two spectrograms of two different spoken words. Is there any way to compare these to say these two spectrograms are from different words.
Hello (word i spoken)
Play (word i spoken)

Ashish
- 997
- 6
- 20