The Phoneme Automatic Segmentation Algorithms Study of Tibetan Lhasa Words Continuous Speech Stream
In this paper,we adopt two methods to voice phoneme segmentation when building Tibetan corpus:One is the traditional artificial segmentation method,one is the automatic segmentation method based on the Mono prime HMM model.And experiments are performed to analyze the accuracy of both methods of segmentations.The results showed:Automatic segmentation method based tone prime HMM model helps to shorten the cycle of building Tibetan corpus,especially in building a large corpus segmentation and labeling a lot of time and manpower cost savings,and have greatly improved the accuracy and consistency of speech corpus annotation information.
Tibetan Corpus Phoneme automatic segmentation
ZHANG Jin-xi YU Hong-zhi MA Ning LI Zhao-yao
Key Lab of Chinas National Linguistic Information Technology, Northwest University for Nationalities, Lanzhou, 730030, China
国际会议
2013 2nd International Conference on Systems Engineering and Modeling(ICSEM-13)(2013年第二届系统工程与建模国际会议)
北京
英文
574-577
2013-04-19(万方平台首次上网日期,不代表论文的发表时间)