会议专题

Methods to Improve Gaussian Mixture Model for Language Identification

Multiple-language phone recognition and n-gram language modeling produce the best performance in formal language identification (LID) evaluations, but this method needs many kinds of data that were phonetically transcribed utterances. This phone-based system isnt easily to apply for dialects or minority languages identification, because it is difficulty to obtain kinds of utterances which were orthographically or phonetically transcribed. Gaussian mixture model with the Universal Background Model (GMM UBM), which has been successfully employed in speaker verification, is an effective approach to solve this problem. The GMM -UBM LID system, which didnt obtain utterances that were orthographically or phonetically transcribed, reduces the time requirement for both training and testing. In this paper we proposed a new combination method utilizing Output Scores Fusion techniques for acoustic scores and language model scores to improve the performance of the GMMUBM based LID system. Experiment results show that the combination method as described above is another efficient method for language identification problems.

GMM-UBM Language Model Scores Fusion LDA GMM Rcognizer

Yonghua Xu Jian Yang Jiang Chen

School of Information Science and Engineering Yun Nan University Kunming, 650091, China

国际会议

2010 International Conference on Measuring Technology and Mechatronics Automation(ICMTMA 2010)(2010年检测技术与机电自动化国际会议)

长沙

英文

1781-1784

2010-03-13(万方平台首次上网日期,不代表论文的发表时间)