A Novel Interpolated N-gram Language Model Based on Class Hierarchy
In this paper, we propose a novel interpolated language model that combines the interpolation and the backing-off along hierarchical classes based on class hierarchy. And the corresponding approach to the estimation of interpolation coefficients is also presented. We use the Minimum Discriminative Information (MDI) method to cluster the vocabulary into a word-clustering tree hierarchically. The tree is used to balance the generalization ability of classes’ and word specificity when estimating the likelihood of a n-gram event. Experiments are performed on Reuter’s corpus using a vocabulary of 27,000 words. Results show a reduction on the test perplexity over the standard Modified KN n-gram approach by 12%.
Language model class hierarchy cluster interpolate back-off
Zhenyu Lv Wenju Liu Zhanlei Yang
Institute of Automation Chinese Academy of Sciences Beijing, China
国际会议
大连
英文
1-5
2009-09-24(万方平台首次上网日期,不代表论文的发表时间)