Speech timing and cross-linguistic studies towards computational human modeling

摘要：

In this paper, we introduce Japanese segmental duration characteristics and computational modeling that we have been studying for around three decades in speech synthesis. A series of experimental results are also shown on loudness dependence in the duration perception. These computational duration modeling and perceptual studies on duration error sensitivity to loudness give some insights for computational human modeling of spoken language capability. As a first trial to figure out how these findings could be efficiently employed in other field like language learning, we introduce our current efforts on the objective evaluation of 2nd language speaking skill and the research consortium of AESOP (Asian English Speech cOrpus Project) where researchers in Asian countries have started to work together.

作者: Yoshinori Sagisaka Hiroaki Kato Minoru Tsuzaki Shizuka Nakamura Chatchawarn Hansakunbuntheung

作者单位: GITI / Language and Speech Science Research Laboratories, Waseda University NICT / ATR Spoken Langua NICT / ATR Media Information Science Laboratories Kyoto City University of Arts GITI / Language and Speech Science Research Laboratories, Waseda University

会议类型: 国际会议

会议名称: 2009 Oriental COCOSDA International Conference on Speech Database and Assessments(2009 国际语音交互标准数据评估技术大会)

会议地点: 北京

会议语种:英文

页码: 1-8

在线出版日期: 2009-08-10（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Speech timing and cross-linguistic studies towards computational human modeling