A Detection Algorithm for the Illegal Coping of Chinese Theses
Copy detection technology for documents can automatically detect overlap information among digital documents. It is a powerful tool to protect intellectual property and improve the efficiency of information retrieval. This thesis presents a set of algorithm for copy detections and its mathematical model to solve the copy detections of Chinese theses. On the document-structure-based analysis, we apply word-frequency technology to design a set of algorithm to identify the illegal coping of Chinese theses.
copy detection vector space model feature extraction text similarity
Yang Huanhai
Shandong Institute of Business and Technology, Yantai, 264005, China
国际会议
烟台
英文
157-160
2010-08-06(万方平台首次上网日期,不代表论文的发表时间)