HMM Based Speech Synthesis with Global Variance Training Method

摘要：

Although Hidden Markov Model based speech synthesis has been proved to have good performance, there are still some factors which degrade the quality of synthesized speech: vocoder, model accuracy and oversmoothing. Experimental results show that oversmoothing in frequency domain mainly affect the quality of synthesized speech whereas over-smoothing in time domain can nearly be ignored. Time domain over-smoothing is generally caused by model structure accuracy problem and frequency domain over-smoothing is caused by training algorithm accuracy problem. MLestimation based parameter training algorithm causes distortion of perception in speech synthesis. The talk will introduce a Global Variance (FV) based Training method into the HTS training structure. The new method tries to enlarge the variance of the spectrum and FO generation. The experiments show that the method improves the synthesizing performance both in voice quality and the expressiveness.

作者: Jianhua Tao

作者单位: Institute of Automation Chinese Academy of Sciences, China

会议类型: 国际会议

会议名称: 2010 4th International Universal Communication Symposium(第四届国际普遍交流学术研讨会 IUCS 2010)

会议地点: 北京

会议语种:英文

页码: 47

在线出版日期: 2010-10-18（万方平台首次上网日期，不代表论文的发表时间）

会议专题

HMM Based Speech Synthesis with Global Variance Training Method