Questions tagged [audio-processing]

Audio processing involves the study of mathematical and signal processing techniques to understand or alter the nature of audio signals. The different kind of audio signals under study include speech, music, environmental audio and computer audio. Audio is analyzed in the temporal or spectral domain by applying various filters.

Key concept is to transform the audio into PCM format so you have access to the raw audio curve. Each channel will have its own curve.

Digital audio is represented by a series of points on this curve. Each point is called an audio sample. Numerical value of each sample can be represented in either integer or floating point.

Be aware to map each audio sample numerical value to memory typically requires several bytes of storage. One byte can store only 2^8 distinct values (256) which will result in noticeable distortion. High quality audio is typically stored using at least two bytes of storage per audio sample. When we use two bytes this gives us 2^16 possible values of the raw audio curve height as the audio wobbles up and down. The more bytes we use for storage the higher fidelity we gain as this reduces the gap between each distinct curve height measurement. This called bit depth. CD quality audio uses two bytes per audio sample per channel. The other fundamental aspect of digital audio is Sample Rate with determines the number of samples per second of time.

556 questions

616

votes

13 answers

How can I extract audio from video with ffmpeg?

I tried the following command to extract audio from video: ffmpeg -i Sample.avi -vn -ar 44100 -ac 2 -ab 192k -f mp3 Sample.mp3 but I get the following output libavutil 50.15. 1 / 50.15. 1 libavcodec 52.72. 2 / 52.72. 2 libavformat 52.64. 2…

asked Mar 28 '12 at 17:51

user1269669

6,177
3
14
5

votes

8 answers

Algorithms for determining the key of an audio sample

I am interested in determining the musical key of an audio sample. How would (or could) an algorithm go about trying to approximate the key of a musical audio sample? Antares Autotune and Melodyne are two pieces of software that do this sort of…

algorithm audio analysis sampling audio-processing

asked Jun 29 '10 at 14:58

Alex

4,844
7
44
58

votes

4 answers

How can I Compare 2 Audio Files Programmatically?

I want to compare 2 audio files programmatically. For example: I have a sound file in my iPhone app, and then I record another one. I want to check if the existing sound matches the recorded sound or not ( - similar to voice recognition). How can I…

iphone ipad audio-processing

asked Aug 08 '11 at 08:18

Shishir.bobby

10,994
21
71
100

votes

1 answer

Building audio processing Little Endian SDK with NDK

I am trying to use ndk-build to use native code for audio processing from Little Endian in an Android application (I don't have JNI yet). When I executed ndk-build in jni dir I got ($USER_PATH is path to directory on my computer): Android NDK:…

android c++ android-ndk audio-processing

asked Aug 21 '14 at 09:37

piobab

1,352
2
13
21

votes

3 answers

deeplearning4j - using an RNN/LSTM for audio signal processing

I'm trying to train a RNN for digital (audio) signal processing using deeplearning4j. The idea is to have 2 .wav files: one is an audio recording, the second is the same audio recording but processed (for example with a low-pass filter). The RNN's…

java machine-learning audio-processing deeplearning4j

asked May 06 '17 at 21:44

erikd71

votes

1 answer

How to get below 10ms latency using WASAPI shared mode?

According to Microsoft, starting with Windows 10, applications using shared-mode WASAPI can request buffer sizes smaller than 10ms (see https://msdn.microsoft.com/en-us/library/windows/hardware/mt298187%28v=vs.85%29.aspx). According to the article,…

c++ audio audio-processing wasapi

asked May 26 '16 at 18:16

Sjoerd van Kreel

1,000
6
19

votes

2 answers

How do I use audio sample data from Java Sound?

This question is usually asked as a part of another question but it turns out that the answer is long. I've decided to answer it here so I can link to it elsewhere. Although I'm not aware of a way that Java can produce audio samples for us at this…

java audio javasound audio-processing javax.sound.sampled

asked Nov 09 '14 at 03:28

Radiodef

37,180
14
90
125

votes

3 answers

Perceptual similarity between two audio sequences

I would like to get some sort of distance measure between two pieces of audio. For example, I want to compare the sound of an animal to the sound of a human mimicking that animal, and then return a score of how similar the sounds were. It seems like…

algorithm audio audio-processing

asked Jul 04 '10 at 18:42

Bart

1,077
1
12
22

votes

4 answers

Detecting wind noise

I want to develop an app for detecting wind according the audio stream. I need some expert thoughts here, just to give me guide lines or some links, I know this is not easy task but I am planning to put a lot of effort here. My plan is to detect…

algorithm language-agnostic signal-processing audio-processing

asked Nov 14 '11 at 10:52

Lukap

31,523
64
157
244

votes

1 answer

How to setup for record and playback audio on Mac. VOIP app on Mac

I want to record and playback audio in Mac. Now, I have some problems about the settings for Input/Output/ChannelFormat … I showed you some code I try below. // Setup audio device - (OSStatus) setupAudioDevice { // It's oks …

macos audio voip audio-processing echo-cancellation

asked Aug 08 '16 at 04:29

Long Pham

7,464
3
29
40

votes

4 answers

Bpm audio detection Library

I'm looking for a library that simplify tempo/bpm audio detection. Something similar to this http://adionsoft.net/bpm/ , but to use on *NIX machines. Any language, but preference goes to php, perl, python.

audio-processing

asked Jan 25 '09 at 17:18

user58815

votes

2 answers

Audio and Signal Processing in Haskell

Do you know of alive attempts at audio synthesis / signal processing in Haskell ? Either for live performance or just for offline processing ? I am not looking for libraries relying on an external tool (like bindings to SuperCollider). I have found…

haskell signal-processing audio-processing

asked Feb 01 '13 at 15:47

alpheccar

votes

6 answers

Sound sample recognition library/code

I don't want sound-to-text software. What I need is the following: I'll record multiple (say 50+) audio streams (recordings of radio stations) from that recordings, I'll mark interesting audio clips - their length ranges from 2 to 60 seconds -…

audio signal-processing audio-processing

asked May 12 '10 at 09:54

Daniel Mošmondor

19,718
12
58
99

votes

9 answers

Music Recognition and Signal Processing

I want to build something similar to Tunatic or Midomi (try them out if you're not sure what they do) and I'm wondering what algorithms I'd have to use; The idea I have about the workings of such applications is something like this: have a big…

php python audio signal-processing audio-processing

asked Jan 14 '10 at 23:21

Alix Axel

151,645
95
393
500

votes

4 answers

Algorithm to get the Key and Scale from musical notes?

From a series of MIDI notes stored in array (with MIDI note number), does an algorithm exist to get the most likely key or scale implied by these notes?

algorithm midi audio-processing

asked Feb 06 '13 at 17:01

Amr Hesham

2 3

…

37 38 Next