Source Code Representations for Plagiarism Detection

被引:1
|
作者
Duracik, Michal [1 ]
Krsak, Emil [1 ]
Hrkut, Patrik [1 ]
机构
[1] Univ Zilina, Fac Management Sci & Informat, Univ 8215-1, Zilina 01026, Slovakia
关键词
Source code; Representations; Hash; Characteristic vector;
D O I
10.1007/978-3-319-95522-3_6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
At the present time the plagiarism is a growing problem due to a lot of easily accessible resources, and many papers deal with this topic. New algorithms are constantly being created, but there are not currently manny of systems, that we could use for plagiarism detection. Our aim is to explore plagiarism on a large scale. This paper focuses on selecting the appropriate representation of the source code, that is very important when searching for plagiarism. There is an overview of the current representation possibilities. We focus on representation source code using AST. Comparison of the tree structures is time-consuming operation. We will try to find how effectively represent AST in order to facilitate comparison. There are two ways to represent AST. Representation by hashing or using characteristic vectors. We present the experiment and results on which we choose the appropriate form of the representation.
引用
收藏
页码:61 / 69
页数:9
相关论文
共 50 条
  • [1] Automatic Source Code Plagiarism Detection
    Kustanto, Cynthia
    Liem, Inggriani
    [J]. SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGS, 2009, : 481 - 486
  • [2] Source code plagiarism detection: The Unix way
    Petrik, Juraj
    Chuda, Daniela
    Steinmuller, Branislav
    [J]. 2017 IEEE 15TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2017, : 467 - 471
  • [3] A State of Art on Source Code Plagiarism Detection
    Agrawal, Mayank
    Sharma, Dilip Kumar
    [J]. PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2016, : 236 - 241
  • [4] A Source Code Similarity System for Plagiarism Detection
    Duric, Zoran
    Gasevic, Dragan
    [J]. COMPUTER JOURNAL, 2013, 56 (01): : 70 - 86
  • [5] Scalable Source Code Plagiarism Detection Using Source Code Vectors Clustering
    Duracik, Michal
    Krsak, Emil
    Hrkut, Patrik
    [J]. PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 499 - 502
  • [6] Source Code Plagiarism
    Sraka, Dejan
    Kaucic, Branko
    [J]. PROCEEDINGS OF THE ITI 2009 31ST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2009, : 461 - 466
  • [7] Improving Source Code Plagiarism Detection: Lessons Learned
    Misic, Marko J.
    Protic, Jelica Z.
    Tomasevic, Milo V.
    [J]. 2017 25TH TELECOMMUNICATION FORUM (TELFOR), 2017, : 856 - 863
  • [8] CPDP: A Robust Technique for Plagiarism Detection in Source Code
    Muddu, Basavaraju
    Asadullah, Allahbaksh
    Bhat, Vasudev
    [J]. 2013 7TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC), 2013, : 39 - 45
  • [9] A SOURCE CODE AND NON-SOURCE CODE PLAGIARISM DETECTION RESEARCH FOR C PROGRAM
    Zhong Mei
    Li Yanchen
    Liu Dongsheng
    [J]. 2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 543 - 547
  • [10] Review of source-code plagiarism detection in academia
    Novak, Matija
    [J]. 2016 39TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2016, : 796 - 801