会议专题

Excited Commentator Speech Detection with Unsupervised Model Adaptation for Soccer Highlight Extraction

Soccer highlight detection is an active research topic in recent years. In this paper, we present our effort to detect an important audio keyword -excited commentator speech, which contributes to a state-ofthe-art soccer highlight extraction system. We propose an approach of using statistical classifier based on Gaussian mixture models (GMMs) with unsupervised model adaptation. The excited speech and normal speech are modeled as two GMMs, and are updated to compensate for the acoustic mismatch between training and test data via Maximum a posteriori (MAP) adaptation, starting from the pre-trained GMMs. The adaptation is operated in an unsupervised mode, since the correct classification of the test data is not known, and a first pass of detection using old GMMs is performed to produce hypothesized classification results. Experimental results demonstrate the effectiveness of the proposed approach. Based on the excited speech detection alone, we can recall 87% of the goal events.

Yi Sun Zhijian Ou Wei Hu Yimin Zhang

Department of Electronic Engineering, Tsinghua University, Beijing 100084, China Intel China Research Center, Beijing 100080, China

国际会议

第十届中国虚拟现实年会

上海

英文

747-751

2010-10-20(万方平台首次上网日期,不代表论文的发表时间)