A State Duration Generation Algorithm Considering Global Variance for HMM-based Speech Synthesis

摘要：

The speech parameter generation algorithm considering global variance (GV) for HMM-based speech synthesis proved to be effective against the over-smoothing problem. In this paper this idea is extended to the generation of state duration. A GV model on syllable duration is proposed and a state duration generation algorithm considering this GV model is presented in details. By improving the GV likelihood on syllable duration, the over-averaging effect on generated state duration is much alleviated. Experimental results are promising which show that the proposed method outperforms the conventional one and the naturalness of synthetic speech is improved.

作者: Shifeng Pan Jianhua Tao Yang Wang

作者单位: National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Science, Beijing

会议类型: 国际会议

会议名称: 2011亚太信号与信息处理协会年度峰会(APSIPAASC 2011)

会议地点: 西安

会议语种:英文

页码: 1-5

在线出版日期: 2011-10-18（万方平台首次上网日期，不代表论文的发表时间）

会议专题

A State Duration Generation Algorithm Considering Global Variance for HMM-based Speech Synthesis