SPORTS AUDIO SEGMENTATION AND CLASSIFICATION
The audio stream is an important component of a sports video. In this paper, we present a system for audio segmentation and classification, which can segment and classify the sports audio stream into speech, non-speech very well. The novel point in our research is that we apply the segmentation and clustering method which is often used in speaker diarization system for broadcast news to the analysis of sports videos. After the segmentation and Bayesian Information Criterion (BIC) clustering is performed, Gaussian Mixture Model (GMM) is used in the classifier to identify the kind of sound for each segment. Experiments on a database composed of 6 hour audio stream in the Eurosport TV program show that the average accuracy can reach 87.3% on segmentation and classification. This research is very useful for analyzing the content of sports videos in detail.
audio segmentation and classification sports audio GMM content analysis
Jun Huang Yuan Dong Jiqing Liu Chengyu Dong Haila Wang
Beijing University of Posts and Telecommunications, Beijing France Telecom Research & Development Center, Beijing
国际会议
北京
英文
379-383
2009-11-06(万方平台首次上网日期,不代表论文的发表时间)