PLAGIARISM DETECTION IN CHINESE BASED ON CHUNK AND PARAGRAPH WEIGHT
Aiming at the Chinese academic paper plagiarism detection, proposed chunk based plagiarism detection algorithm with chunk extraction method based on character or word. Taken account of that different part of document has different importance, proposed two paragraph weight algorithms and defined three paragraph weight functions. The best chunk lengths are determined by experiments. Experiments show that using paragraph weight can enhance the detection effect.
Plagiarism detection Tezt chunk Paragraph weight
TAO WANG XIAO-ZHONG FAN JIE LIU
Department of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
国际会议
2008 International Conference on Machine Learning and Cybernetics(2008机器学习与控制论国际会议)
昆明
英文
2574-2579
2008-07-12(万方平台首次上网日期,不代表论文的发表时间)