会议专题

Selecting Optimal Non-uniform Units for Hierarchical Unit Selection

For concatenative speech synthesis based on non-uniform unit selection, the key to improve the synthetic quality is the careful designing of measuring criteria respect to the units adopted. With our previous hierarchical non-uniform unit selection framework 1, two measurements for selecting optimal non-uniform units during searching at different layers are proposed in this paper, including inter-syllable pitch control and spectra distance by phonetic context. These measures are used as components of our cost function, especially for boundaries in front of syllables starting with voiceless consonants. Experiment shows it outperforms our previous system.

Jun Xu Dezhi Huang Yuan Dong Lianhong Cai Haila Wang

Department of Computer Science and Technology,Tsinghua University, Beijing, China Speech and Natural Language Processing Unit, France Telecom R&D Beijing, China Department of Computer Science and Technology, Tsinghua University, Beijing, China

国际会议

2008 International Conference on Audio,Language and Image Processing(2008国际声音、语言、图像过程大会)

镇江

英文

1610-1614

2008-07-07(万方平台首次上网日期,不代表论文的发表时间)