OBJECTIONABLE AUDIO CONTENT RECOGNITION BASED ON IN-CLASS CLUSTERING METHOD
This paper focuses on automatic adult video sequences recognition from the perspective of feature porno-sounds detection, which serves as a verification step, a supplementary method and an independent detector. To the special of erotic sounds, their feature analysis is given. Our statistics and experiments show that features such as energies in subbands, μ -spectral centroid, mean of shorttime zero-crossing rates, and High short-time zerocrossing rates ratio play important roles in discriminating erotic audio files. At the same time due to the complexity of data within and outside erotic audio class, in-Class Clustering is proposed which selects the most representative subclass for training and classification. All these efforts increase the recall rate and decrease the false positive rate. Experiments on real data from the Internet indicate that the proposed method yields superior performance that 85.35% recall rate and 15.46% false positive rate are achieved.
erotic sounds in-Class Clustering audio classification feature eztraction support vector machine(SVM)
Ziqiang Shi Boyang Gao Tieran Zheng Jiqing Han
School of Computer Science and Technology, Harbin Institute of Technology, Harbin
国际会议
北京
英文
712-716
2009-11-06(万方平台首次上网日期,不代表论文的发表时间)