Intonation Evaluation of English Utterances using Synthesized Speech for Computer-Assisted Language Learning

摘要：

In this paper, we describe a system for intonation evaluation of English utterance by Japanese native speakers using synthesized speech for rapid development of a CALL system. To evaluate the intonation of learners utterance, we need reference utterances, for which English native speakers utterances should be used. However, it is costly to gather native speakers utterances for all sentences in the system. Therefore, we examined an intonation evaluation method using synthesized speech generated by text-to-speech systems instead of real speech. Intonation evaluation system calculates scores between a learners utterance and corresponding utterances by the teachers. We investigated a method of combining multiple scores. In addition, we incorporated a feature for rhythm evaluation into intonation evaluation. As a result, we obtained improvement of correlation between scores by human evaluators and the system. Furthermore, we analyzed a tendency of intonation evaluation by the system through limiting evaluation utterances to find out what degrades the system performance.

关键词： CALL prosody intonation Mahalanobis distance multiple regression

作者: Tomoaki KONNO Masashi ITO Motoyuki SUZUKI Akinori ITO Shozo MAKINO

作者单位: Graduate School of Engineering,Tohoku University Sendai,Miyagi,Japan Institute of Technology and Science,The University of Tokushima Tokushima,Tokushima,Japan

会议类型: 国际会议

会议名称: The 2008 IEEE International Conference on Natural Language Processing and Knowledge Engineering(IEEE NLP-KE 2008)(2008IEEE自然语言处理与知识工程国际会议)

会议地点: 北京

会议语种:英文

在线出版日期: 2008-10-19（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Intonation Evaluation of English Utterances using Synthesized Speech for Computer-Assisted Language Learning