An Investigation on the Mandarin Prosody of a Parallel Multi-Speaking Rate Speech Corpus

摘要：

Abstract In this paper, the prosody of a parallel multispeaking rate Mandarin read speech corpus is investigated. The corpus contains four parallel speech datasets uttered by a female professional announcer with various speech rates (SRs) of 4.40 (fast), 3.82 (normal), 2.97 (median) and 2.45 (slow) syllables/second. By using the unsupervised joint prosody labeling and modeling (PLM) method proposed previously, the relationship between SR and various prosodic features, including pause duration, patterns of three high-level prosodic constituents, and the break labels, are investigated. The analyses reported in this study could be very informative in developing prosody generation mechanism for text-tospeech and prosody modeling for automatic speech recognition in various SRs.

作者: Chen-Yu Chiang Cheng-Chang Tang Hsiu-Min Yu Yih-Ru Wang Sin-Horng Chen

作者单位: Department of Communication Engineering, National Chiao Tung University, Taiwan Language Center, Chung Hua University, Taiwan

会议类型: 国际会议

会议名称: 2009 Oriental COCOSDA International Conference on Speech Database and Assessments(2009 国际语音交互标准数据评估技术大会)

会议地点: 北京

会议语种:英文

页码: 148-153

在线出版日期: 2009-08-10（万方平台首次上网日期，不代表论文的发表时间）

会议专题

An Investigation on the Mandarin Prosody of a Parallel Multi-Speaking Rate Speech Corpus