Fast Speech Keyword Recognition Based on Improved Filler Model

摘要：

　　Most traditional template matching based keyword recognition methods dont need training data,just rely on frame matching.However,the recognition speed is relatively slow and it cant be used in practice.The LVCSR-based method needs to convert the speech signal into text signal before recognition,which has an important impact on the final recognition performance.In this paper,we propose a method based on the filler model framework,which selects the syllable instead of using words as the modelling unit.The search space of our method is composed of all the syllables rather than words.By fixing a part of the Hidden Markov Model(HMM)state probability matrix parameters,our method can obtain important model parameters for a more sufficient training.Meanwhile,a two-stage model training strategy is proposed to reduce the artificial markings of training speech and Linear Discriminant Analysis(LDA)is introduced to improve the efficiency of system identification.Experimental results show that our method can effectively improve the detection rate of keywords and achieve similar detection time under the same conditions.

关键词： spoken keywords detection filler model HMM LDA

作者: Yang Wang Jie Yang Le Zhang

作者单位: College of Information Engineering,Wuhan University of Technology;Key Laboratory of Fiber Optic Sens Department of Electronic & Electrical Engineering,University of Sheffield,Sheffield,United Kingdom

会议类型: 国际会议

会议名称: 2017 IEEE 2nd Advanced Information Technology,Electronic and Automation Control Conference(IAEAC 2017)(2017 IEEE 第2届先进信息技术、电子与自动化控制国际会议)

会议地点: 重庆

会议语种:英文

页码: 530-534

在线出版日期: 2017-03-25（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Fast Speech Keyword Recognition Based on Improved Filler Model