The Phoneme Automatic Segmentation Algorithms Study of Tibetan Lhasa Words Continuous Speech Stream

摘要：

　　In this paper,we adopt two methods to voice phoneme segmentation when building Tibetan corpus:One is the traditional artificial segmentation method,one is the automatic segmentation method based on the Mono prime HMM model.And experiments are performed to analyze the accuracy of both methods of segmentations.The results showed:Automatic segmentation method based tone prime HMM model helps to shorten the cycle of building Tibetan corpus,especially in building a large corpus segmentation and labeling a lot of time and manpower cost savings,and have greatly improved the accuracy and consistency of speech corpus annotation information.

关键词： Tibetan Corpus Phoneme automatic segmentation

作者: ZHANG Jin-xi YU Hong-zhi MA Ning LI Zhao-yao

作者单位: Key Lab of Chinas National Linguistic Information Technology, Northwest University for Nationalities, Lanzhou, 730030, China

会议类型: 国际会议

会议名称: 2013 2nd International Conference on Systems Engineering and Modeling(ICSEM-13)(2013年第二届系统工程与建模国际会议)

会议地点: 北京

会议语种:英文

页码: 574-577

在线出版日期: 2013-04-19（万方平台首次上网日期，不代表论文的发表时间）

会议专题

The Phoneme Automatic Segmentation Algorithms Study of Tibetan Lhasa Words Continuous Speech Stream