An interactive way to acquire Internet documents for language model adaptation of speech recognition systems

摘要：

In this paper, a new method for language model adaptation based on users feedback in the field of speech recognition is described. Different from other methods, the proposed method conducts corpus collection and language model adaptation in an interactive way. The user can input a small quantity of texts to describe the topic or the basic idea of the speech and evaluate some of the obtained texts as good or useless. The system can learn from the interaction information and acquire textual corpus which is more relevant to the topic of the speech. Experimental results show that for a given speech recognition system using this approach the recognition accuracy is increased by 7 percentage points compared to the same system using traditional adaptation method without interaction.

关键词： speech recognition corpus acquiring users feedback language model adaptation

作者: Hong Zhang Xiangdong Wang Yueliang Qian Shouxun Lin

作者单位: Institute of Computing Technology, Chinese Academy of Sciences Beijing, China, 100190

会议类型: 国际会议

会议名称: 2011 Third International Conference on Intelligent Human-Machine Systems and Cybernetics 第三届智能人机系统与控制论国际会议 IHMSC 2011

会议地点: 杭州

会议语种:英文

页码: 97-100

在线出版日期: 2011-08-26（万方平台首次上网日期，不代表论文的发表时间）

会议专题

An interactive way to acquire Internet documents for language model adaptation of speech recognition systems