Selecting Optimal Non-uniform Units for Hierarchical Unit Selection
For concatenative speech synthesis based on non-uniform unit selection, the key to improve the synthetic quality is the careful designing of measuring criteria respect to the units adopted. With our previous hierarchical non-uniform unit selection framework 1, two measurements for selecting optimal non-uniform units during searching at different layers are proposed in this paper, including inter-syllable pitch control and spectra distance by phonetic context. These measures are used as components of our cost function, especially for boundaries in front of syllables starting with voiceless consonants. Experiment shows it outperforms our previous system.
Jun Xu Dezhi Huang Yuan Dong Lianhong Cai Haila Wang
Department of Computer Science and Technology,Tsinghua University, Beijing, China Speech and Natural Language Processing Unit, France Telecom R&D Beijing, China Department of Computer Science and Technology, Tsinghua University, Beijing, China
国际会议
2008 International Conference on Audio,Language and Image Processing(2008国际声音、语言、图像过程大会)
镇江
英文
1610-1614
2008-07-07(万方平台首次上网日期,不代表论文的发表时间)