会议专题

CODE COMPARISON SYSTEM BASED ON ABSTRACT SYNTAX TREE

The code comparison technology plays a very important part in the work of plagiarism detection and software evaluation. Software plagiarism mainly appears as copy-and-paste or with a little modification after this, which will not change the function of the code, such as replacing the name of methods or variables, reordering the sequence of the statements etc. This paper introduces a plagiarism detection tool named CCS (Code Comparison System) which is based on the Abstract Syntax Tree (AST). According to the syntax tree’s characteristics, CCS calculates their hash values, transforms their storage forms, and then compares them node by node. As a result, the efficiency improves. Moreover, CCS preprocesses a large amount of source code in its database for potential use, which also accelerate the course of plagiarism detection. CCS also takes special measurement to reduce mistakes when calculating the hash values of the operations like subtraction and division. It performs well in the code comparison field, and is able to help with the copyright protecting of the source code.

code comparison code plagiarism syntax tree Hash value CCS

Baojiang Cui Jiansong Li Tao Guo Jianxin Wang Ding Ma

Beijing University of Posts and Telecommunications, Beijing, China China Information Technology Security Evaluation Center, Beijing, China Beijing Forestry University, Beijing, China

国际会议

2010 3rd IEEE International Conference on Broadband Network & Multimedia Technology(2010年第三届IEEE宽带网络与多媒体国际会议 IC-BNMT 2010)

北京

英文

668-673

2010-10-26(万方平台首次上网日期,不代表论文的发表时间)