The Detection of Similar Instance Based on Fingerprint
Detecting similar instance is a research hot spot of Example-Based Machine Translation.The method of Vector Space Model is one of the mainstream detection methods.However,there are two disadvantages for it: detection speed is very slow and synonym substitution is not accurate.To solve these problems,fingerprint retrieval algorithm is introduced to improve the detection speed.A concept of replacement cost is put forward to measure the accuracy of substitution between synonyms.The result shows that this method can not only improve the detection speed but also produce a certain improvement to the accuracy of the similarity calculation.
EBMT Simhash Fingerprint Similarity Replacement cost
Cai YiHao Xu Dong
No.88, Wenhua Dong road, Lixia District, Jinan, Shandong Province, P.R.China
国际会议
重庆
英文
711-714
2015-03-21(万方平台首次上网日期,不代表论文的发表时间)