Version History Based Source Code Plagiarism Detection in Proprietary Systems

被引:0
|
作者
Maskeri, Girish [1 ]
Karnam, Deepthi [1 ]
Viswanathan, Sree Aurovindh [1 ]
Padmanabhuni, Srinivas [1 ]
机构
[1] Infosys Ltd, Infosys Labs, Bangalore, Karnataka, India
关键词
Plagiarism; Author Information; Version History; CCFINDER;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
While the advent of open source code search tools have made the source code of thousands of open source software (OSS) readily accessible, thereby increasing legitimate reuse, it has also opened up the possibility of unconscientious employees plagiarizing code from OSS repositories. Plagiarism in proprietary software would not only lead to costly lawsuits, but also undermine the credibility of the organization. Hence detecting plagiarism in proprietary software is an urgent need. Though there exist a number of techniques for detecting plagiarism in student project assignments, they do not scale well in the case of large proprietary software. Especially when code snippets are plagiarized from the large number of available open source software. In this paper we propose a novel approach that applies Mining Software Repositories (MSR) based techniques to the problem of plagiarism detection. We create a programming style profile for each maintenance engineer by mining the version history and use that to detect source code commits that are likely to be plagiarized. Such suspected code fragments can be analyzed using any of the existing plagiarism detection techniques to confirm the plagiarism and ascertain the original code.
引用
收藏
页码:609 / 612
页数:4
相关论文
共 50 条
  • [11] Improving Source Code Plagiarism Detection: Lessons Learned
    Misic, Marko J.
    Protic, Jelica Z.
    Tomasevic, Milo V.
    [J]. 2017 25TH TELECOMMUNICATION FORUM (TELFOR), 2017, : 856 - 863
  • [12] CPDP: A Robust Technique for Plagiarism Detection in Source Code
    Muddu, Basavaraju
    Asadullah, Allahbaksh
    Bhat, Vasudev
    [J]. 2013 7TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC), 2013, : 39 - 45
  • [13] A SOURCE CODE AND NON-SOURCE CODE PLAGIARISM DETECTION RESEARCH FOR C PROGRAM
    Zhong Mei
    Li Yanchen
    Liu Dongsheng
    [J]. 2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 543 - 547
  • [14] Review of source-code plagiarism detection in academia
    Novak, Matija
    [J]. 2016 39TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2016, : 796 - 801
  • [15] Using graph databases in source code plagiarism detection
    Novak, Matija
    Levak, Iva
    [J]. CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS, CECIIS 2022, 2022, : 465 - 470
  • [16] SOURCE CODE PLAGIARISM DETECTION METHOD USING ONTOLOGIES
    Smeureanu, Ion
    Iancu, Bogdan
    [J]. INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY, 2013, : 594 - 597
  • [17] The Source Code Plagiarism Detection based on Function Sub-string Matching
    Xiao JingZhong
    Xiao Li
    [J]. 2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 1, 2011, : 397 - 400
  • [18] Efficient clustering-based source code plagiarism detection using PIY
    Tony Ohmann
    Imad Rahal
    [J]. Knowledge and Information Systems, 2015, 43 : 445 - 472
  • [19] Efficient clustering-based source code plagiarism detection using PIY
    Ohmann, Tony
    Rahal, Imad
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 43 (02) : 445 - 472
  • [20] Design Patterns based Pre-processing of Source Code for Plagiarism Detection
    Asadullah, Allahbaksh
    Basavaraju, M.
    Stern, Ilan
    Bhat, Vasudev D.
    [J]. 2012 19TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE WORKSHOPS (APSECW), VOL. 2, 2012, : 128 - 135