Methods to Improve Gaussian Mixture Model for Language Identification

摘要：

Multiple-language phone recognition and n-gram language modeling produce the best performance in formal language identification (LID) evaluations, but this method needs many kinds of data that were phonetically transcribed utterances. This phone-based system isnt easily to apply for dialects or minority languages identification, because it is difficulty to obtain kinds of utterances which were orthographically or phonetically transcribed. Gaussian mixture model with the Universal Background Model (GMM UBM), which has been successfully employed in speaker verification, is an effective approach to solve this problem. The GMM -UBM LID system, which didnt obtain utterances that were orthographically or phonetically transcribed, reduces the time requirement for both training and testing. In this paper we proposed a new combination method utilizing Output Scores Fusion techniques for acoustic scores and language model scores to improve the performance of the GMMUBM based LID system. Experiment results show that the combination method as described above is another efficient method for language identification problems.

关键词： GMM-UBM Language Model Scores Fusion LDA GMM Rcognizer

作者: Yonghua Xu Jian Yang Jiang Chen

作者单位: School of Information Science and Engineering Yun Nan University Kunming, 650091, China

会议类型: 国际会议

会议名称: 2010 International Conference on Measuring Technology and Mechatronics Automation(ICMTMA 2010)(2010年检测技术与机电自动化国际会议)

会议地点: 长沙

会议语种:英文

页码: 1781-1784

在线出版日期: 2010-03-13（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Methods to Improve Gaussian Mixture Model for Language Identification