An Effective and Efficient Method for Query by Humming System Based on Multi-Similarity Measurement Fusion
Since it is the most natural way for people to search a specific melody in large music database, query by humming/singing is attracting more and more researchers attention in the field of content-based music information retrieval. In this task, note-based and frame-based similarity measures are two commonly used methods. However, in previous works, researchers always focus on one of the two methods alone. In this paper, we propose a novel scheme taking advantage of two different similarity measurements to improve not only the retrieval accuracy but also the retrieving speed. First, Earth Movers Distance (EMD), which is note-based and much faster, is adopted to eliminate most unlikely candidate. Then, Dynamic Time Warping (DTW), which is frame-based and more accurate, is executed on these surviving candidates. Finally, fusion strategies of these two similarity measurements are employed to improve the performance of whole system. Experiments show our approach can achieve 92.9% accuracy on the database used in MIREX 2006 QBH contest, which is better than those systems participated in that task.
Lei Wang Shen Huang Sheng Hu Jiaen Liang Bo Xu
Digital Content Technology Research Center, Institute of Automation, Chinese Academy of Sciences, Be Digital Content Technology Research Center, Institute of Automation, Chinese Academy of Sciences, Be
国际会议
2008 International Conference on Audio,Language and Image Processing(2008国际声音、语言、图像过程大会)
镇江
英文
471-475
2008-07-07(万方平台首次上网日期,不代表论文的发表时间)