Integrating Acoustic and Lexical Features in Topic Segmentation of Chinese Broadcast News Using Maximum Entropy Approach

摘要：

This paper studies how to integrate multi-modal features in automatic topic segmentation of Mandarin broadcast news. The multi-modal feature integration problem is formulated within the Maximum Entropy (MaxEnt) scheme for topic boundary classification by maximizing the entropy and respecting all known constraints (i.e., multiple features contributions). We particularly consider two types of features: (1) acoustic features, which reflect the editorial prosody of broadcast news, including pause duration, speaker change and speech type; and (2) lexical features extracted from speech recognition transcripts, which capture the semantic shifts of topics, including two local cohesiveness features and a new boundary indicator based on overall cohesiveness. Compared to local lexical features, the new overall cohesiveness feature maximizes the lexical cohesiveness of all topic fragments and reflects the fact that topic transitions in broadcast news are smooth and the distributional variations are subtle. Experiments show apparent performance improvement in topic segmentation of Chinese broadcast news by fusing acoustic and lexical features within the MaxEnt scheme.

作者: Lei Xie Yulian Yang Zhi-Qiang Liu Wei Feng Zihan Liu

作者单位: School of Computer Science, Northwestern Polytechnical University, Xi an, China School of Creative Media, City University of Hong Kong, Kownloon, Hong Kong SAR, China

会议类型: 国际会议

会议名称: 2010 International Conference on Audio,Language and Image Processing(2010年音频、语言与图像处理国际会议 ICALIP 2010)

会议地点: 上海

会议语种:英文

页码: 407-413

在线出版日期: 2010-11-23（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Integrating Acoustic and Lexical Features in Topic Segmentation of Chinese Broadcast News Using Maximum Entropy Approach