A State Duration Generation Algorithm Considering Global Variance for HMM-based Speech Synthesis
The speech parameter generation algorithm considering global variance (GV) for HMM-based speech synthesis proved to be effective against the over-smoothing problem. In this paper this idea is extended to the generation of state duration. A GV model on syllable duration is proposed and a state duration generation algorithm considering this GV model is presented in details. By improving the GV likelihood on syllable duration, the over-averaging effect on generated state duration is much alleviated. Experimental results are promising which show that the proposed method outperforms the conventional one and the naturalness of synthetic speech is improved.
Shifeng Pan Jianhua Tao Yang Wang
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Science, Beijing
国际会议
2011亚太信号与信息处理协会年度峰会(APSIPAASC 2011)
西安
英文
1-5
2011-10-18(万方平台首次上网日期,不代表论文的发表时间)