Plagiarism Detection Algorithm for Source Code in Computer Science Education

被引:5
|
作者
Liu, Xin [1 ]
Xu, Chan [1 ]
Ouyang, Boyu [1 ]
机构
[1] Xiangtan Univ, Coll Informat Engn, Xiangtan, Peoples R China
关键词
Code Denoising; Coding Standardize; Improved LCS Algorithm; Longest Common Subsequence; Plagiarism Detection Algorithm;
D O I
10.4018/IJDET.2015100102
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Nowadays, computer programming is getting more necessary in the course of program design in college education. However, the trick of plagiarizing plus a little modification exists among some students' home works. It's not easy for teachers to judge if there's plagiarizing in source code or not. Traditional detection algorithms cannot fit this condition. The author designed an effective and complete method to detect source code plagiarizing according to the popular way of students' plagiarizing. There are two basic concepts of the algorithm. One is to standardize the source code via filtration against to remove the majority noises intentionally blended by plagiarists. The other one is an improved Longest Common Subsequence algorithm for text matching, using statement as the unit for matching. The authors also designed an appropriate HASH function to increase the efficiency of matching. Based on the algorithm, a system was designed and proved to be practical and sufficient, which runs well and meet the practical requirement in application.
引用
收藏
页码:29 / 39
页数:11
相关论文
共 50 条
  • [1] Benefits and drawbacks of source code plagiarism detection in engineering education
    Pawelczak, Dieter
    [J]. PROCEEDINGS OF 2018 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON) - EMERGING TRENDS AND CHALLENGES OF ENGINEERING EDUCATION, 2018, : 1048 - 1056
  • [2] Automatic Source Code Plagiarism Detection
    Kustanto, Cynthia
    Liem, Inggriani
    [J]. SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGS, 2009, : 481 - 486
  • [3] Source Code Representations for Plagiarism Detection
    Duracik, Michal
    Krsak, Emil
    Hrkut, Patrik
    [J]. LEARNING TECHNOLOGY FOR EDUCATION CHALLENGES, LTEC 2018, 2018, 870 : 61 - 69
  • [4] Detecting Computer Code Plagiarism in Higher Education
    Konecki, Mario
    Orehovacki, Tihomir
    Lovrencic, Alen
    [J]. PROCEEDINGS OF THE ITI 2009 31ST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2009, : 409 - 414
  • [5] Source code plagiarism detection: The Unix way
    Petrik, Juraj
    Chuda, Daniela
    Steinmuller, Branislav
    [J]. 2017 IEEE 15TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2017, : 467 - 471
  • [6] VisualCues: Visually Explaining Source Code In Computer Science Education
    Biegel, Benjamin
    Baltes, Sebastian
    Prevos, Bob
    Diehl, Stephan
    [J]. PROCEEDINGS 2015 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING (VL/HCC), 2015, : 89 - 93
  • [7] A State of Art on Source Code Plagiarism Detection
    Agrawal, Mayank
    Sharma, Dilip Kumar
    [J]. PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2016, : 236 - 241
  • [8] A Source Code Similarity System for Plagiarism Detection
    Duric, Zoran
    Gasevic, Dragan
    [J]. COMPUTER JOURNAL, 2013, 56 (01): : 70 - 86
  • [9] Scalable Source Code Plagiarism Detection Using Source Code Vectors Clustering
    Duracik, Michal
    Krsak, Emil
    Hrkut, Patrik
    [J]. PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 499 - 502
  • [10] Source Code Plagiarism
    Sraka, Dejan
    Kaucic, Branko
    [J]. PROCEEDINGS OF THE ITI 2009 31ST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2009, : 461 - 466