Source Code Representations for Plagiarism Detection

被引:1
|
作者
Duracik, Michal [1 ]
Krsak, Emil [1 ]
Hrkut, Patrik [1 ]
机构
[1] Univ Zilina, Fac Management Sci & Informat, Univ 8215-1, Zilina 01026, Slovakia
关键词
Source code; Representations; Hash; Characteristic vector;
D O I
10.1007/978-3-319-95522-3_6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
At the present time the plagiarism is a growing problem due to a lot of easily accessible resources, and many papers deal with this topic. New algorithms are constantly being created, but there are not currently manny of systems, that we could use for plagiarism detection. Our aim is to explore plagiarism on a large scale. This paper focuses on selecting the appropriate representation of the source code, that is very important when searching for plagiarism. There is an overview of the current representation possibilities. We focus on representation source code using AST. Comparison of the tree structures is time-consuming operation. We will try to find how effectively represent AST in order to facilitate comparison. There are two ways to represent AST. Representation by hashing or using characteristic vectors. We present the experiment and results on which we choose the appropriate form of the representation.
引用
收藏
页码:61 / 69
页数:9
相关论文
共 50 条
  • [21] Classification feature sets for source code plagiarism detection in Java
    Hosam E.
    Hadhoud M.
    Atiya A.
    Fayek M.
    [J]. Journal of Engineering and Applied Science, 2022, 69 (1):
  • [22] Benefits and drawbacks of source code plagiarism detection in engineering education
    Pawelczak, Dieter
    [J]. PROCEEDINGS OF 2018 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON) - EMERGING TRENDS AND CHALLENGES OF ENGINEERING EDUCATION, 2018, : 1048 - 1056
  • [23] Dolos: Language-agnostic plagiarism detection in source code
    Maertens, Rien
    Van Petegem, Charlotte
    Strijbol, Niko
    Baeyens, Toon
    Jacobs, Arne Carla
    Dawyndt, Peter
    Mesuere, Bart
    [J]. JOURNAL OF COMPUTER ASSISTED LEARNING, 2022, 38 (04) : 1046 - 1061
  • [24] Software Source Code Plagiarism and Direction Detection Based on PDG
    Shu, Bo
    Du, Xiaojun
    [J]. MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 1172 - 1177
  • [25] Material Survey on Source Code Plagiarism Detection in Programming Courses
    Alexandra-Cristina, Cimpeanu
    Olteanu, Alexandru
    [J]. 2022 INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2022), 2022, : 387 - 389
  • [26] Evaluating the robustness of source code plagiarism detection tools to pervasive plagiarism-hiding modifications
    Hayden Cheers
    Yuqing Lin
    Shamus P. Smith
    [J]. Empirical Software Engineering, 2021, 26
  • [27] Evaluating the robustness of source code plagiarism detection tools to pervasive plagiarism-hiding modifications
    Cheers, Hayden
    Lin, Yuqing
    Smith, Shamus P.
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (05)
  • [28] A NEW SIMILARITY MEASURE FOR IN-CLASS SOURCE CODE PLAGIARISM DETECTION
    Ohno, Asako
    Murao, Hajime
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (11B): : 4237 - 4247
  • [29] Semantic Similarity Search for Source Code Plagiarism Detection: An Exploratory Study
    Ebrahim, Fahad
    Joy, Mike
    [J]. PROCEEDINGS OF THE 2024 CONFERENCE INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, VOL 1, ITICSE 2024, 2024, : 360 - 366
  • [30] Suitability of BlackBox Dataset for Style Analysis in Detection of Source Code Plagiarism
    Mirza, Olfat M.
    Joy, Mike
    Cosma, Georgina
    [J]. 2017 SEVENTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH 2017), 2017, : 90 - 94