会议专题

A Two-pass Architecture for Detecting Reading Miscues

In this paper, we present a CALL system with novel two-pass architecture for sentence reading miscues detection. The research is concentrated on the effect of the language model (LM) of the system, which is necessary for recognizing what is actually spoken by the speaker. We compared the two situations of using LM or not in a one-pass baseline system at first, and found that LM can lead to relatively 60% improvement of miscue detection rate and 80% reduction of false alarm rate. However, the LM still has bad effect on detecting speech errors because the reading miscues are abnormal word sequences and can be easily depressed by it. So we propose to rescore these instances in a second-pass decoding without LM. By means of the second-pass, the miscue detection rate can be improved by 9.6% relatively and the false alarm rate can be reduced by 15.8% relatively.

Changliang Liu Fuping Pan Fengpei Ge Bin Dong Qingwei Zhao Yonghong Yan

ThinkIT Speech Lab, Institute of Acoustics, Chinese Academy of Science

国际会议

2008 International Conference on Audio,Language and Image Processing(2008国际声音、语言、图像过程大会)

镇江

英文

701-707

2008-07-07(万方平台首次上网日期,不代表论文的发表时间)