会议专题

Semi-supervised Learning of Bottleneck Feature for Music Genre Classification

  A good representation of the audio is important for music genre classification.Deep neural networks(DNN)enable a better approach to learn the representation of audio.The representation learned from DNN,which is known as bottleneck feature,is widely used for speech and audio related application.However,in general,it needs a large amount of transcribed data to learn an effective bottleneck feature extractor.While,in reality,the amount of transcribed data is often limited.In this paper,we investigate a semi-supervised learning to train the bottleneck feature for music data.Then,the bottleneck feature is used for music genre classification.Since the target dataset contains few data,which cannot be used train a reliable bottleneck DNN,we train the DNN bottleneck extractor on a large out-of-domain un-transcribed dataset in semi-supervised way.Experimental results show that with the learned bottleneck feature,the proposed system can perform better than the state-of-the-art best methods on GTZAN dataset.

Bottleneck DNN Semi-supervised Multilingual Cross-lingual

Jia Dai Wenju Liu Hao Zheng Wei Xue Chongjia Ni

NLPR,Institute of Automation,Chinese Academy of Sciences,Beijing,China;University of Chinese Academy School of Mathematic and Quantitative Economics,Shandong University of Finance and Economics,Shandon

国际会议

第七届全国模式识别学术会议(The 7th Chinese Conference on Pattern Recognition,CCPR2016)

成都

英文

552-562

2016-11-03(万方平台首次上网日期,不代表论文的发表时间)