Version History Based Source Code Plagiarism Detection in Proprietary Systems

被引:0
|
作者
Maskeri, Girish [1 ]
Karnam, Deepthi [1 ]
Viswanathan, Sree Aurovindh [1 ]
Padmanabhuni, Srinivas [1 ]
机构
[1] Infosys Ltd, Infosys Labs, Bangalore, Karnataka, India
关键词
Plagiarism; Author Information; Version History; CCFINDER;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
While the advent of open source code search tools have made the source code of thousands of open source software (OSS) readily accessible, thereby increasing legitimate reuse, it has also opened up the possibility of unconscientious employees plagiarizing code from OSS repositories. Plagiarism in proprietary software would not only lead to costly lawsuits, but also undermine the credibility of the organization. Hence detecting plagiarism in proprietary software is an urgent need. Though there exist a number of techniques for detecting plagiarism in student project assignments, they do not scale well in the case of large proprietary software. Especially when code snippets are plagiarized from the large number of available open source software. In this paper we propose a novel approach that applies Mining Software Repositories (MSR) based techniques to the problem of plagiarism detection. We create a programming style profile for each maintenance engineer by mining the version history and use that to detect source code commits that are likely to be plagiarized. Such suspected code fragments can be analyzed using any of the existing plagiarism detection techniques to confirm the plagiarism and ascertain the original code.
引用
收藏
页码:609 / 612
页数:4
相关论文
共 50 条
  • [21] Evaluating the Performance of LSA for Source-code Plagiarism Detection
    Cosma, Georgina
    Joy, Mike
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2012, 36 (04): : 409 - 424
  • [22] Plagiarism Detection Algorithm for Source Code in Computer Science Education
    Liu, Xin
    Xu, Chan
    Ouyang, Boyu
    [J]. INTERNATIONAL JOURNAL OF DISTANCE EDUCATION TECHNOLOGIES, 2015, 13 (04) : 29 - 39
  • [23] Source Code Plagiarism Detection in an Educational Context: A Literature Mapping
    Aniceto, Rodrigo C.
    Holanda, Maristela
    Castanho, Carla
    Da Silva, Dilma
    [J]. 2021 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE 2021), 2021,
  • [24] Source Code Plagiarism Detection in an Educational Context: A Literature Mapping
    Aniceto, Rodrigo C
    Holanda, Maristela
    Castanho, Carla
    Da Silva, Dilma
    [J]. Proceedings - Frontiers in Education Conference, FIE, 2021, 2021-October
  • [25] Which Source Code Plagiarism Detection Approach is More Humane?
    Karnalim, Oscar
    Sulistiani, Lisan
    [J]. 2018 9TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2018, : 291 - 296
  • [26] EPlag: A Two Layer Source Code Plagiarism Detection System
    Ajmal, Omer
    Missen, M. M. Saad
    Hashmat, Tazeen
    Moosa, M.
    Ali, Tenvir
    [J]. 2013 EIGHTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2013, : 256 - 261
  • [27] Online Tool for Student's Source Code Plagiarism Detection
    Zakova, K.
    Pistej, J.
    Bistak, P.
    [J]. 2013 11TH IEEE INTERNATIONAL CONFERENCE ON EMERGING ELEARNING TECHNOLOGIES AND APPLICATIONS (ICETA 2013), 2013, : 415 - 419
  • [28] Classification feature sets for source code plagiarism detection in Java
    Hosam E.
    Hadhoud M.
    Atiya A.
    Fayek M.
    [J]. Journal of Engineering and Applied Science, 2022, 69 (1):
  • [29] Benefits and drawbacks of source code plagiarism detection in engineering education
    Pawelczak, Dieter
    [J]. PROCEEDINGS OF 2018 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON) - EMERGING TRENDS AND CHALLENGES OF ENGINEERING EDUCATION, 2018, : 1048 - 1056
  • [30] Dolos: Language-agnostic plagiarism detection in source code
    Maertens, Rien
    Van Petegem, Charlotte
    Strijbol, Niko
    Baeyens, Toon
    Jacobs, Arne Carla
    Dawyndt, Peter
    Mesuere, Bart
    [J]. JOURNAL OF COMPUTER ASSISTED LEARNING, 2022, 38 (04) : 1046 - 1061