Effective Modeling of Acoustic Confusions for Mandarin CALL System
Acoustic confusions degrade the accuracy of pronunciation assessment severely in Computer Assisted Language Learning (CALL) systems.This paper presents our recent study on effective modeling of the acoustic confusions.We change the traditional Mandarin syllable structure,which is composed of initial and final,to a novel phoneme structure.Several phoneme splitting strategies are investigated,and the question list used for building and merging decision tree is studied.Experiments show that the optimal phoneme splitting strategy outperforms the traditional initial-final structure in our CALL system,with relative 11.05% ASER improvement for nasal finals.This idea may be extended to improve the performance of automatic speech recognition (ASR).
Fengpei Ge Fuping Pan Changliang Liu Bin Dong Yonghong Yan
ThinkIT laboratory,Institute of Acoustics,Chinese Academy of Sciences Beijing 100190,P.R.China
国际会议
9th International Conference on Signal Processing(第九届国际信号处理学术会议)(ICSP08)
北京
英文
2008-10-26(万方平台首次上网日期,不代表论文的发表时间)