Uyghur Morpheme-based Language Models and ASR
Uyghur language is an agglutinative language in which words are formed by suffixes attaching to a stem (or root). Because of the explosive nature in vocabulary of the agglutinative languages, several morpheme-based language models are built and experiments are implemented. Morpheme is the smallest meaning bearing unit. In this research, morpheme is referred to any of prefix, stem, or suffix. As a result, a large vocabulary ASR system is built on the basis of Julius system. Several ASR results on language models based on different units (word, morpheme, and syllable) are compared.
Uyghur morpheme segmenter language modeling ASR
Mijit Ablimit Graham Neubig Masato Mimura Shinsuke Mori Tatsuya Kawahara Askar Hamdulla
School of Informatics, Kyoto University, Kyoto, Japan School of Informatics, Xinjiang University, Kyoto, Japan
国际会议
2010 IEEE 10th International Conference on Signal Processing(第十届信号处理国际会议 ICSP 2010)
北京
英文
581-584
2010-08-24(万方平台首次上网日期,不代表论文的发表时间)