Automatic generation of plagiarism detection among student programs

被引:0
|
作者
Roxas, Rachel Edita [1 ]
Lim, Nathalie Rose [1 ]
Bautista, Natasja [1 ]
机构
[1] De La Salle Univ, Coll Comp Studies, Software Technol Dept, Manila 1004, Philippines
关键词
plagiarism detection; transformations; similarity measures;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A system for the automatic generation of plagiarism detectors that rind similar programs in a set of student programs is presented. Existing plagiarism detectors are either applied to a programming language or a pre-defined set of programming languages. The general purpose one usually employs string matching to perform similarity measures that are based on plagiarism detection among documents in general, and not in programs in particular, thus, losing much of the structure and logic of programs in the process. On the other hand, plagiarism detectors for specific languages only cater to that particular set of languages. This study provides a means for the user to specify the programming language of the student programs to be analyzed. Moreover, an automatic plagiarism detector system must be immune to the transformations that students perform on copied programs. These transformations are usually dependent on several factors namely: the type of programming problems and correspondingly, the complexity of the project to be implemented by the students, and also the programming language paradigm of the programs. Thus, the similarity measures employed by the system should be determined by these factors and can be specified by the professor. He/she has the option to specify how the similarities among the student programs will be captured. The system provides an interface for the specification of the particular programming language in which the student programs are implemented, and a knowledge-base of similarity measures that the user would like to include in the analysis of the student programs. Hence, the system provides flexibility in the programming language of the student programs to be analyzed and the similarity measures that the professor wishes to employ. Initial qualitative and quantitative evaluations illustrate a flexible, convenient and cost-effective tool for building plagiarism detectors for effective detection of programs in various imperative and procedural programming languages. The approach also addresses some of the changes that students perform on copied programs which JPlag fails to handle, thus, allowing for improved accuracy in terms of the reduction of false-positives, increasing the chance of catching plagiarized programs. These changes include modification of control structures, use of temporary variables and sub-expressions, in-lining and re-factoring of methods, and redundancy (variables or methods that were not used). Comprehensive tests on other programming languages under various programming language paradigms such as object-oriented, logic and functional languages, considering the different changes that the students employ to copied programs (such as the tests done in JPlag) are also recommended for empirical evaluation.
引用
收藏
页码:244 / 253
页数:10
相关论文
共 50 条
  • [1] AUTOMATIC STUDENT PLAGIARISM DETECTION: FUTURE PERSPECTIVES
    Mozgovoy, Maxim
    Kakkonen, Tuomo
    Cosma, Georgina
    [J]. JOURNAL OF EDUCATIONAL COMPUTING RESEARCH, 2010, 43 (04) : 511 - 531
  • [2] Plagiarism Meets Paraphrasing: Insights for the Next Generation in Automatic Plagiarism Detection
    Barron-Cedeno, Alberto
    Vila, Marta
    Antonia Marti, M.
    Rosso, Paolo
    [J]. COMPUTATIONAL LINGUISTICS, 2013, 39 (04) : 917 - 948
  • [3] Automatic Generation of Summary Obfuscation Corpus for Plagiarism Detection
    Miranda-Jimenez, Sabino
    Stamatatos, Efstathios
    [J]. ACTA POLYTECHNICA HUNGARICA, 2017, 14 (03) : 99 - 112
  • [4] DETECTING PLAGIARISM IN STUDENT PASCAL PROGRAMS
    JANKOWITZ, HT
    [J]. COMPUTER JOURNAL, 1988, 31 (01): : 1 - 8
  • [5] Automatic Generation of Benchmarks for Plagiarism Detection Tools using Grammatical Evolution
    Cebrian, Manuel
    Alfonseca, Manuel
    Ortega, Alfonso
    [J]. GECCO 2007: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2007, : 2253 - 2253
  • [6] Plagiarism Detection in SQL Student Assignments
    Scerbakov, Nikolai
    Schukin, Alexander
    Sabinin, Oleg
    [J]. TEACHING AND LEARNING IN A DIGITAL WORLD, 2018, 716 : 110 - 115
  • [7] Electronic detection of plagiarism in student work
    Barlow, J
    Bean, L
    Hott, DD
    [J]. 7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVI, PROCEEDINGS: SYSTEMICS AND INFORMATION SYSTEMS, TECHNOLOGIES AND APPLICATION, 2003, : 26 - 29
  • [8] Automatic plagiarism detection in obfuscated text
    Alaa Saleh Altheneyan
    Mohamed El Bachir Menai
    [J]. Pattern Analysis and Applications, 2020, 23 : 1627 - 1650
  • [9] Automatic plagiarism detection in obfuscated text
    Altheneyan, Alaa Saleh
    Menai, Mohamed El Bachir
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (04) : 1627 - 1650
  • [10] Automatic Source Code Plagiarism Detection
    Kustanto, Cynthia
    Liem, Inggriani
    [J]. SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGS, 2009, : 481 - 486