Control of Fundamental Frequency Contours Using the Generation Process Model in HMM–Based Speech Synthesis
A method was proposed to increase the naturalness of prosody generated with speech synthesis based on hidden Markov models (HMMs). This method adds a constraint to the fundamental frequency contours (F0 contours) during the HMM-based speech synthesis. The constraint adopted is the generation process model of F0 contours (F0 model). The method first extracts the F0 model parameters from the original F0 contour (generated by the HMM-based speech synthesis) and then optimizes them successively by referring to the pre-trained HMMs. The experimental results show that the proposed method can improve naturalness when the F0 control by the original method is inadequate.
HMM-based speech synthesis generation process model F0 contour
Tetsuya Matsuda Keikichi Hirose Nobuaki Minematsu
Graduate School of Information Science and Technology, The University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
国际会议
2010 IEEE 10th International Conference on Signal Processing(第十届信号处理国际会议 ICSP 2010)
北京
英文
617-620
2010-08-24(万方平台首次上网日期,不代表论文的发表时间)