Academic Source Code Plagiarism Detection by Measuring Program Behavioral Similarity

被引:22
|
作者
Cheers, Hayden [1 ]
Lin, Yuqing [1 ]
Smith, Shamus P. [1 ]
机构
[1] Univ Newcastle, Sch Elect Engn & Comp, Callaghan, NSW 2308, Australia
关键词
Plagiarism; Tools; Semantics; !text type='Java']Java[!/text; Robustness; Taxonomy; Task analysis; Source code plagiarism detection; behavioral similarity; source code similarity; SOFTWARE; SEMANTICS; GRAPH;
D O I
10.1109/ACCESS.2021.3069367
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Source code plagiarism is a long-standing issue in tertiary computer science education. Many source code plagiarism detection tools have been proposed to aid in the detection of source code plagiarism. However, existing detection tools are not robust to pervasive plagiarism-hiding transformations and can be inaccurate in the detection of plagiarised source code. This article presents BPlag, a behavioural approach to source code plagiarism detection. BPlag is designed to be both robust to pervasive plagiarism-hiding transformations and accurate in the detection of plagiarised source code. Greater robustness and accuracy is afforded by analyzing the behavior of a program, as behavior is perceived to be the least susceptible aspect of a program impacted upon by plagiarism-hiding transformations. BPlag applies symbolic execution to analyses execution behavior and represents a program in a novel graph-based format. Plagiarism is then detected by comparing these graphs and evaluating similarity scores. BPlag is evaluated for robustness, accuracy and efficiency against five commonly used source code plagiarism detection tools. It is then shown that BPlag is more robust to plagiarism-hiding transformations and more accurate in the detection of plagiarised source code, but is less efficient than the compared tools.
引用
收藏
页码:50391 / 50412
页数:22
相关论文
共 50 条
  • [1] A Source Code Similarity System for Plagiarism Detection
    Duric, Zoran
    Gasevic, Dragan
    [J]. COMPUTER JOURNAL, 2013, 56 (01): : 70 - 86
  • [2] A SOURCE CODE AND NON-SOURCE CODE PLAGIARISM DETECTION RESEARCH FOR C PROGRAM
    Zhong Mei
    Li Yanchen
    Liu Dongsheng
    [J]. 2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 543 - 547
  • [3] A NEW SIMILARITY MEASURE FOR IN-CLASS SOURCE CODE PLAGIARISM DETECTION
    Ohno, Asako
    Murao, Hajime
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (11B): : 4237 - 4247
  • [4] Semantic Similarity Search for Source Code Plagiarism Detection: An Exploratory Study
    Ebrahim, Fahad
    Joy, Mike
    [J]. PROCEEDINGS OF THE 2024 CONFERENCE INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, VOL 1, ITICSE 2024, 2024, : 360 - 366
  • [5] Source Code Plagiarism Detection Using Biological String Similarity Algorithms
    Rahal, Imad
    Wielga, Colin
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2014, 13 (03)
  • [6] The Study of Plagiarism Detection for Program Code
    Jiang, Hao
    Jiang, Zhemin
    [J]. ADVANCES IN COMPUTER SCIENCE AND EDUCATION APPLICATIONS, PT II, 2011, 202 : 128 - 133
  • [7] Automatic Source Code Plagiarism Detection
    Kustanto, Cynthia
    Liem, Inggriani
    [J]. SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGS, 2009, : 481 - 486
  • [8] Source Code Representations for Plagiarism Detection
    Duracik, Michal
    Krsak, Emil
    Hrkut, Patrik
    [J]. LEARNING TECHNOLOGY FOR EDUCATION CHALLENGES, LTEC 2018, 2018, 870 : 61 - 69
  • [9] Review of Code Similarity and Plagiarism Detection Research Studies
    Lee, Gunwoo
    Kim, Jindae
    Choi, Myung-seok
    Jang, Rae-Young
    Lee, Ryong
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [10] Source code plagiarism detection: The Unix way
    Petrik, Juraj
    Chuda, Daniela
    Steinmuller, Branislav
    [J]. 2017 IEEE 15TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2017, : 467 - 471