A Fuzzy R Code Similarity Detection Algorithm

被引:0
|
作者
Bartoszuk, Maciej [1 ]
Gagolewski, Marek [2 ,3 ]
机构
[1] Polish Acad Sci, Syst Res Inst, Interdisciplinary PhD Studies Program, PL-00901 Warsaw, Poland
[2] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
[3] Warsaw Univ Technol, Fac Math & Informat Sci, PL-00662 Warsaw, Poland
关键词
R; antiplagiarism detection; code cloning; fuzzy proximity relations; aggregation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
R is a programming language and software environment for performing statistical computations and applying data analysis that increasingly gains popularity among practitioners and scientists. In this paper we present a preliminary version of a system to detect pairs of similar R code blocks among a given set of routines, which bases on a proper aggregation of the output of three different [0, 1]-valued (fuzzy) proximity degree estimation algorithms. Its analysis on empirical data indicates that the system may in future be successfully applied in practice in order e. g. to detect plagiarism among students' homework submissions or to perform an analysis of code recycling or code cloning in R's open source packages repositories.
引用
收藏
页码:21 / 30
页数:10
相关论文
共 50 条
  • [31] Similarity Code File Detection Model Based on Frequent Itemsets
    Jiang, Jian-hong
    Wang, Ke
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND NETWORK TECHNOLOGY (CCNT 2018), 2018, 291 : 254 - 262
  • [32] Code similarity detection through control statement and program features
    Sudhamani, M.
    Rangarajan, Lalitha
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 132 : 63 - 75
  • [33] A method for efficient malicious code detection based on conceptual similarity
    Kim, Sungsuk
    Choi, Chang
    Choi, Junho
    Kim, Pankoo
    Kim, Hanil
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 4, 2006, 3983 : 567 - 576
  • [34] Syntax Trees and Information Retrieval to Improve Code Similarity Detection
    Karnalim, Oscar
    Simon
    PROCEEDINGS OF THE TWENTY-SECOND AUSTRALASIAN COMPUTING EDUCATION CONFERENCE, ACE'20, 2020, : 48 - 55
  • [35] Detection of electrocardiogram beats using a fuzzy similarity index
    Uebeyli, Elif Derya
    EXPERT SYSTEMS, 2007, 24 (02) : 87 - 96
  • [36] Fuzzy Similarity Measures for Detection and Classification of Defects in CFRP
    Pellicano, Diego
    Palamara, Isabella
    Cacciola, Matteo
    Calcagno, Salvatore
    Versaci, Mario
    Morabito, Francesco Carlo
    IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2013, 60 (09) : 1917 - 1927
  • [37] Staged Method of Code Similarity Analysis for Firmware Vulnerability Detection
    Wang, Yisen
    Shen, Jianjing
    Lin, Jian
    Lou, Rui
    IEEE ACCESS, 2019, 7 : 14171 - 14185
  • [38] A community detection algorithm based on structural similarity
    Guo, Xuchao
    Hao, Xia
    Liu, Yaqiong
    Zhang, Li
    Wang, Lu
    2017 2ND INTERNATIONAL SEMINAR ON ADVANCES IN MATERIALS SCIENCE AND ENGINEERING, 2017, 231
  • [39] The Community Detection Algorithm based on Transferring the Similarity
    Niu, Dongdong
    Chen, Hongchang
    Liu, Lixiong
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 841 - 845
  • [40] A fuzzy vehicle detection based on contour size similarity
    Wu, BF
    Lin, CT
    2005 IEEE INTELLIGENT VEHICLES SYMPOSIUM PROCEEDINGS, 2005, : 496 - 501