Real-Word Spelling Correction using GoogleWeb 1T n-gram with Backoff
We present a method for correcting real-word spelling errors using the GoogleWeb 1T n-gram data set and a normalized and modied version of the Longest Common Subsequence (LCS) string matching algo- rithm. Our method is focused mainly on how to improve the correction recall (the fraction of er- rors corrected) while keeping the correction preci- sion (the fraction of suggestions that are correct) as high as possible. Evaluation results on a standard data set show that our method performs very well.
Real-word spelling correction Google web 1T n-gram
Aminul ISLAM Diana INKPEN
Department of Computer Science, SITE University of Ottawa Ottawa, ON, Canada
国际会议
大连
英文
1-8
2009-09-24(万方平台首次上网日期,不代表论文的发表时间)