Sound sample recognition library/code

Daniel Mošmondor picture Daniel Mošmondor · May 12, 2010 · Viewed 18.2k times · Source

I don't want sound-to-text software. What I need is the following:

  • I'll record multiple (say 50+) audio streams (recordings of radio stations)
  • from that recordings, I'll mark interesting audio clips - their length ranges from 2 to 60 seconds - there will be few thousands of such audio clips
  • library should be able to find other instances of same audio clips from recorded sound streams
  • confidence factor should be reported to used and additional input provided so the recognition could perform better next time

Do you know of such software library? LGPL would be most valuable to me, but I can go for commercial license as well.

Audio clips will contain both music, text, effects, or any combination thereof. So, TEXT recognition is out of the question.

Architecture: c++, C# for glue, CUDA if possible.

Answer

mdma picture mdma · May 15, 2010

I have not found any libraries (yet), but two interesting papers, which may give you terminology and background to refine your searches:

EDIT: Searching for "Audio fingerprinting" came to a page of implementations, both open source and commercial.

Here is an introduction to Audio fingerprinting