A Chain of Gaussian Mixture Model for Text-independent Speaker Recognition1

摘要：

Text-independent speaker recognition has better flexibility than text-dependent method. However, due to the phonetic content difference, the text-independent methods usually achieve lower performance than textdependent method. In order to combining the flexibility of text-independent method and the high performance of text-dependent method, we propose a new modeling technique named a chain of Gaussian Mixture Model which encoding the temporal correlation of the training utterance in the chain structure. A special decoding network is then used to evaluate the test utterance to find the best possible phonetic matched segments between test utterance and training utterance. The experimental results indicate that the proposed method significantly improve the system performance, especially for the short test utterance.

作者: Yanxiang Chen Ming Liu

作者单位: College of Computer Science & Information,Hefei University of Technology, Hefei, Anhui 230009, China Department of Electrical & Computer Engineering, University of Illinois at Urbana-Champaign, Urbana,

会议类型: 国际会议

会议名称: 2009 Oriental COCOSDA International Conference on Speech Database and Assessments(2009 国际语音交互标准数据评估技术大会)

会议地点: 北京

会议语种:英文

页码: 100-103

在线出版日期: 2009-08-10（万方平台首次上网日期，不代表论文的发表时间）

会议专题

A Chain of Gaussian Mixture Model for Text-independent Speaker Recognition1