Intonation Evaluation of English Utterances using Synthesized Speech for Computer-Assisted Language Learning
In this paper, we describe a system for intonation evaluation of English utterance by Japanese native speakers using synthesized speech for rapid development of a CALL system. To evaluate the intonation of learners utterance, we need reference utterances, for which English native speakers utterances should be used. However, it is costly to gather native speakers utterances for all sentences in the system. Therefore, we examined an intonation evaluation method using synthesized speech generated by text-to-speech systems instead of real speech. Intonation evaluation system calculates scores between a learners utterance and corresponding utterances by the teachers. We investigated a method of combining multiple scores. In addition, we incorporated a feature for rhythm evaluation into intonation evaluation. As a result, we obtained improvement of correlation between scores by human evaluators and the system. Furthermore, we analyzed a tendency of intonation evaluation by the system through limiting evaluation utterances to find out what degrades the system performance.
CALL prosody intonation Mahalanobis distance multiple regression
Tomoaki KONNO Masashi ITO Motoyuki SUZUKI Akinori ITO Shozo MAKINO
Graduate School of Engineering,Tohoku University Sendai,Miyagi,Japan Institute of Technology and Science,The University of Tokushima Tokushima,Tokushima,Japan
国际会议
北京
英文
2008-10-19(万方平台首次上网日期,不代表论文的发表时间)