Study on the Method of Text Copy Detection Based on Language Cadence
Abstract-With analyzing the language cadence of text, it is found that every paragraph can be distinguished by the language cadence. A method based on language cadence for text copy detection is proposed here. Because punctuations can label the basic language cadence, the only language cadence code of every paragraph in documents can be created for repeatability detection. With comparing the language cadence codes, the copy detection can been finished based on paragraph and the detection precision can be improved.
text copy detection language cadence punctuation
Chen Fan Feng Zhiyong Zhao Geng
School of Computer Science and Technology, Tianjin University Information Science & Technology Depar School of Computer Science and Technology, Tianjin University Hebei University of TechnologyTianjin, china
国际会议
哈尔滨
英文
140-143
2011-01-18(万方平台首次上网日期,不代表论文的发表时间)