Questions tagged [sound-recognition]

49 questions
10
votes
5 answers

Clap sound detection in C#

I would like to be able to detect a clapping sound. Being pretty new to any form of sound recognition, is there simple any way to do this? If not, any good resources for C# sound recognition/detection? My goal is to automate a clients media center…
jamesrom
  • 866
  • 3
  • 10
  • 19
7
votes
1 answer

Expo: Get audio data realtime and send via Socket.IO

App I want to make I would like to make audio recognition mobile app like Shazam with Expo Expo AV(https://docs.expo.io/versions/latest/sdk/audio) Tensorflow serving Socket.IO I want to send recording data to machine learning based recognition…
Watanabe.N
  • 1,549
  • 1
  • 14
  • 37
5
votes
0 answers

Bad types trying to use audio recognition loopback

The Microsoft Speech system has a good example code, but I have a problem when adding loopback to record what it is playing not what is coming through mic. To give text description of a video for example while not playing it on the speaker. It seems…
NoBugs
  • 9,310
  • 13
  • 80
  • 146
5
votes
2 answers

Software: How does software recognize a song?

I was playing with my phone and there is this app on it that when u use it to record 10 seconds of a song, it tells you the title and author of that song. Now as a software engineer I can't help but wonder, How does this work?
Emerion
  • 820
  • 6
  • 13
4
votes
3 answers

How to retrieve frame number and frame size from wav file

I'm currently doing a thesis about chord recognition with EPCP with wav file as an input, but now I'm stuck at determining number of frames and frame size of a single wav file. I need those data as the parameters for hammingwindow function in NAudio…
Norman Pratama
  • 67
  • 2
  • 12
4
votes
1 answer

Persian speech to text in android

i want to improve my knowledge and work with sound reorganization (speech to text). i find good samples from google but i don't know how can i do so with Persian-Farsi language? i do it with English but how about Farsi? is it need some setting in my…
Mina Dahesh
  • 332
  • 8
  • 21
4
votes
1 answer

How does Shazam / soundhound work?

I am interested in how does shazam or soundhound work. How does it record a voice and makes a fingerprint that is so simmilar to their database of fingerprints that it finds a match? I am about to start writing some software in C/C++ but not sure…
Wiggler Jtag
  • 669
  • 8
  • 23
3
votes
1 answer

Audio Classification based on FFT

In python, I'm performing alarm recognition by only checking frequencies and amplitudes. My code takes the FFT of the 1s sound, then compares it with the predetermined frequencies and their amplitudes. Since alarms contains higher frequencies…
2
votes
0 answers

Tensorflow sound recognition tutorial gives error: op_def = op_dict[node.op] KeyError: 'DecodeWav'

I am trying to import a pretrained tensorflow model (the default sound recognition one in the tutorial) and I keep getting this error. I tried importing using both a checkpoint file and a .pb file, and as a beginner, I have no idea about this error.…
utsal
  • 21
  • 2
2
votes
2 answers

"Sound" Recognition in Swift?

I'm working on an applicaion in Swift and I was thinking about a way to get Non-Speech sound recognition in my project. I mean is there a way in which I can take in sound inputs and match them against some predefined sounds already incorporated in…
ritvik1512
  • 303
  • 1
  • 3
  • 10
1
vote
1 answer

Dictating in MS Visual C#?

I wanna make a text editor but its different other text editors.It will get commands from my voice. For example ; i will say my name to microphone "Ibrahim AKGUN" .. then this program will get it and write it down like a virtual secretary :) This is…
İbrahim Akgün
  • 1,527
  • 11
  • 37
  • 63
1
vote
0 answers

Appium CodeceptJS - Mock sound recording - Android

I've been automating testing of mobile apps for a year now, but it's my first time, that a part of app records users' voice (it's actually a 3-step process, first recording of "silence" to calibrate to the background noise, second a prolonged…
Pristo
  • 11
  • 1
1
vote
0 answers

Sound recognition of beep to start and stop recording MP3 stream

I'm trying to make a Python script, that would record local news from internet radio (MP3) stream like this one. The news segment starts and ends whit two different distinct beeps. Until now, my solution was extremely slow and unoptimized: Use…
1
vote
0 answers

Footsteps recognition on Arduino Nano 33 BLE

I'm trying to build an intrusion detection device by training a model using footsteps sounds. I'm trying to do it using TensorFlow. The problem is that this example does not work at all: I've tried to create a custom word called "footsteps",…
Riccardo
  • 47
  • 6
1
vote
0 answers

Is there any way to compare two spectograms?

I am trying to compare two spectrograms of two different spoken words. Is there any way to compare these to say these two spectrograms are from different words. Hello (word i spoken) Play (word i spoken)
Ashish
  • 997
  • 6
  • 20
1
2 3 4