AUTOMATIC SEGMENTATION AND LABELING BASED ON BOUNDARY TYPE FOR MANDARIN CHINESE SPEECH
This paper presents an automatic segmentation and labeling method for mandarin Chinese speech synthesis corpus. In order to improve the accuracy of segmentation, two types of HMM models are utilized to produce the INITIAL/FINAL and syllable boundaries. Three feature detection algorithms are applied to boundary refinement for speech boundaries of voiced/unvoiced/silence. Experimental results show the proposed method can improve the performance of the segmentation system significantly.
Speech segmentation HMM Phoneme boundary-type V/U/S detection Measurement of the distance between segments
PING-MU HUANG CHENG-LI SUN JUN GUO
School of Information Engineering, Beijing University of Posts and Telecommunications, China
国际会议
2008 International Conference on Machine Learning and Cybernetics(2008机器学习与控制论国际会议)
昆明
英文
2497-2502
2008-07-12(万方平台首次上网日期,不代表论文的发表时间)