会议专题

Uyghur Morpheme-based Language Models and ASR

Uyghur language is an agglutinative language in which words are formed by suffixes attaching to a stem (or root). Because of the explosive nature in vocabulary of the agglutinative languages, several morpheme-based language models are built and experiments are implemented. Morpheme is the smallest meaning bearing unit. In this research, morpheme is referred to any of prefix, stem, or suffix. As a result, a large vocabulary ASR system is built on the basis of Julius system. Several ASR results on language models based on different units (word, morpheme, and syllable) are compared.

Uyghur morpheme segmenter language modeling ASR

Mijit Ablimit Graham Neubig Masato Mimura Shinsuke Mori Tatsuya Kawahara Askar Hamdulla

School of Informatics, Kyoto University, Kyoto, Japan School of Informatics, Xinjiang University, Kyoto, Japan

国际会议

2010 IEEE 10th International Conference on Signal Processing(第十届信号处理国际会议 ICSP 2010)

北京

英文

581-584

2010-08-24(万方平台首次上网日期,不代表论文的发表时间)