Suffix Tree Based Plagiarism Detection Method for C Code
In the field of computer-aided instruction,plagiarism in homework and tests is widespread To ensure teaching qulity of programming and to ensure fairness and in tests,we need an efficient method of code plagiarism detection. Here,we propose a method based on suffix tree for C code similarity detection focus on common plagiarism means. Firstly,take advantage of open source tool —ANTLR generated C language lexical and syntax analyzer,do lexical and syntax analysis with C code and generated suffixjtree of c code,secondly ,Optimize syntax tree and then generate the sequence of suffix tree . Finally,we use the improved algorithm GST (Greedy-String-Tiling) string matching algorithm to match the sequence of suffix tree and calculated similarity.
Plagiraism detection Code similarity Suffix tree ANTLR String matching
Li Yanchen Liu Dongsheng
Computer and Information Engineering College,Inner Mongolia Normal University (IMNU)Hohhot 010022 Ch Computer and Information Engineering College,Inner Mongolia Normal University (IMNU) Hohhot 010022 C
国际会议
南宁
英文
210-213
2010-12-10(万方平台首次上网日期,不代表论文的发表时间)