A CORPUS-BASED CONCATENATIVE MANDARIN SINGING VOICE SYNTHESIS SYSTEM

摘要：

A Mandarin singing voice synthesis (SVS) system is proposed in this paper. It generates a Mandarin song of an artificial singer based on the lyric and the music score information embedded in a MIDI file of the song. To get good quality of the song, two modules are presented, i.e., the synthesis unit selection module and the prosody and amplitude modification module. In the synthesis unit selection module, the corpus that complies with the lyric and closest to the music score information is selected. Then, an adaptive filter based prosody and amplitude modification algorithms are employed on the selected synthesis units. Through the proposed method, the system can synthesis any Mandarin singing voice on-the-fly by providing it the corpus of all syllables for male and female respectively. To increase the efficiency of the system, a preprocessing is also taken on the corpus. Finally, a subjective evaluation based on MOS is taken on the system and the synthesized sounds show good quality.

关键词： Singing voice synthesis Prosody modification Amplitude modification

作者: SHU-SEN ZHOU QING-CAI CHEN DAN-DAN WANG XIAO-HONG YANG

作者单位: Shenzhen Graduate School, Harbin Institute of Technology, Shenzhen 518055, China

会议类型: 国际会议

会议名称: 2008 International Conference on Machine Learning and Cybernetics(2008机器学习与控制论国际会议)

会议地点: 昆明

会议语种:英文

页码: 2695-2699

在线出版日期: 2008-07-12（万方平台首次上网日期，不代表论文的发表时间）

会议专题

A CORPUS-BASED CONCATENATIVE MANDARIN SINGING VOICE SYNTHESIS SYSTEM