A novel stream-weight method for the multi-stream speech recognition system

摘要：

A multi-stream speech recognition system is based on the combination of multiple complementary feature streams. Utilizing the fusion scheme of multi-stream, better performance was achieved in speech recognition system. The stream-weight method plays a very important role in the fusion collaborative scheme. The stream weights should be selected to be proportional to the feature stream reliability and informativeness. The posterior probability estimate is a measure of reliability,and the classification error is a measure of informativeness. The larger separation between class distributions in a given stream implies better discriminative power. The intra-class distances are an estimate of the class variance. The inter-and intra-class distances are combined to yield and estimate of the misclassification error for each stream. An unsupervised stream weight estimation method for multi-stream speech recognition system based on the computation of intra-and inter-class distances in each stream is proposed here. Experiments are conducted using Chinese Academy of Science speech database. Applying the new streamweigh algorithm, we achieve better fusion performance compared with some traditional fusion methods, and the word error rate was decreased by 6％.

关键词： multistream framework stream weighst interclass distance intra-class distance hidden markov models linear predictive cepstral coefficient mel-frequency cepstral coefficient

作者: Hongyu Guo Xiaoqun Zhao Hongmiao Guo Hongyu Guo

作者单位: School of Electronics and Information Engineering Tongji University Shanghai, China Engineering Training Center YanShan University Qinhuandao , Hebei Province, China School of Information Shanghai Ocean University Shanghai, China

会议类型: 国际会议

会议名称: 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems(2010 IEEE 智能计算与智能系统国际会议 ICIS 2010)

会议地点: 厦门

会议语种:英文

页码: 179-182

在线出版日期: 2010-10-29（万方平台首次上网日期，不代表论文的发表时间）

会议专题

A novel stream-weight method for the multi-stream speech recognition system