会议专题

HMM-based Speech Synthesis with a Flexible Mandarin Stress Adaptation Model

Expressive speech synthesis has recently received much attention. Stress is one key issue which may improve the expressiveness of the synthetic speech. However, rare work was done in Mandarin stress prediction and expression. This paper presents a HMM-based expressive speech synthesis system which supports Mandarin stress synthesis. Mandarin stress was automatically predicted with textual features only using a Maximum Entropy Model. The linear adaptation model was extracted from a large corpus by analyzing their stress related acoustic features. The advantage of the proposed model is it can be easily modified to build a system with another speaking style or emotion. Experiments show that the proposed stress adaptation system can convey stress effectively and generate high expressive speech. The overall performance of the synthetic speech is also improved.

prosody Mandarin stress HMM-based speech synthesis expressive synthesis

Ya Li Shifeng Pan Jianhua Tao

National Laboratory of Pattern Recognition,Institute of Automation, Chinese Academy of Sciences, 100190, Beijing, China

国际会议

2010 IEEE 10th International Conference on Signal Processing(第十届信号处理国际会议 ICSP 2010)

北京

英文

625-628

2010-08-24(万方平台首次上网日期,不代表论文的发表时间)