会议专题

Real-Word Spelling Correction using GoogleWeb 1T n-gram with Backoff

We present a method for correcting real-word spelling errors using the GoogleWeb 1T n-gram data set and a normalized and modi ed version of the Longest Common Subsequence (LCS) string matching algo- rithm. Our method is focused mainly on how to improve the correction recall (the fraction of er- rors corrected) while keeping the correction preci- sion (the fraction of suggestions that are correct) as high as possible. Evaluation results on a standard data set show that our method performs very well.

Real-word spelling correction Google web 1T n-gram

Aminul ISLAM Diana INKPEN

Department of Computer Science, SITE University of Ottawa Ottawa, ON, Canada

国际会议

International Conference on Natural Language Processing and Knowledge Engineering(IEEE自然语言处理与知识工程国际会议 IEEE NLP-KE 2009)

大连

英文

1-8

2009-09-24(万方平台首次上网日期,不代表论文的发表时间)