Sound sample recognition library/code

audio signal-processing audio-processing

Daniel Mošmondor · May 12, 2010 · Viewed 18.2k times · Source

I don't want sound-to-text software. What I need is the following:

I'll record multiple (say 50+) audio streams (recordings of radio stations)
from that recordings, I'll mark interesting audio clips - their length ranges from 2 to 60 seconds - there will be few thousands of such audio clips
library should be able to find other instances of same audio clips from recorded sound streams
confidence factor should be reported to used and additional input provided so the recognition could perform better next time

Do you know of such software library? LGPL would be most valuable to me, but I can go for commercial license as well.

Audio clips will contain both music, text, effects, or any combination thereof. So, TEXT recognition is out of the question.

Architecture: c++, C# for glue, CUDA if possible.

Answer

I have not found any libraries (yet), but two interesting papers, which may give you terminology and background to refine your searches:

EDIT: Searching for "Audio fingerprinting" came to a page of implementations, both open source and commercial.

http://wiki.musicbrainz.org/AudioFingerprint
Picard seems to be well established, and could be useful if your clips contain music.