A Fuzzy R Code Similarity Detection Algorithm

被引:0
|
作者
Bartoszuk, Maciej [1 ]
Gagolewski, Marek [2 ,3 ]
机构
[1] Polish Acad Sci, Syst Res Inst, Interdisciplinary PhD Studies Program, PL-00901 Warsaw, Poland
[2] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
[3] Warsaw Univ Technol, Fac Math & Informat Sci, PL-00662 Warsaw, Poland
关键词
R; antiplagiarism detection; code cloning; fuzzy proximity relations; aggregation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
R is a programming language and software environment for performing statistical computations and applying data analysis that increasingly gains popularity among practitioners and scientists. In this paper we present a preliminary version of a system to detect pairs of similar R code blocks among a given set of routines, which bases on a proper aggregation of the output of three different [0, 1]-valued (fuzzy) proximity degree estimation algorithms. Its analysis on empirical data indicates that the system may in future be successfully applied in practice in order e. g. to detect plagiarism among students' homework submissions or to perform an analysis of code recycling or code cloning in R's open source packages repositories.
引用
收藏
页码:21 / 30
页数:10
相关论文
共 50 条
  • [1] Research on Algorithm of Program Code Similarity Detection
    Li Qinqin
    Zhang Chunhai
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS, ELECTRONICS AND CONTROL (ICCSEC), 2017, : 1289 - 1292
  • [2] PARALLELIZATION OF GST ALGORITHM FOR SOURCE CODE SIMILARITY DETECTION
    Misic, Marko J.
    Nikolov, Dusan V.
    Protic, Jelica Z.
    Tomasevic, Milo V.
    2016 24TH TELECOMMUNICATIONS FORUM (TELFOR), 2016, : 921 - 924
  • [3] Binary Code Similarity Detection
    Liu, Zian
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 1056 - 1060
  • [4] Choosing Code Segments to Exclude from Code Similarity Detection
    Simon
    Karnalim, Oscar
    Sheard, Judy
    Dema, Ilir
    Karkare, Amey
    Leinonen, Juho
    Liut, Michael
    McCauley, Renee
    ITICSE-WGR'20: PROCEEDINGS OF THE WORKING GROUP REPORTS ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, 2020, : 1 - 19
  • [5] Investigation of Decision Support System for Indian Penal Code Section Using Similarity Algorithm and Fuzzy Logic
    Srivastav, Ambrish
    Prajapat, Shaligram
    ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2023, 2024, 1453 : 652 - 667
  • [6] Scalable Source Code Similarity Detection in Large Code Repositories
    Alomari, Firas
    Harbi, Muhammed
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2019, 6 (22) : 1 - 11
  • [7] Binary Code Similarity Detection: State and Future
    Li, Zhenshan
    Liu, Hao
    Shan, Ruijie
    Sun, Yanbin
    Jiang, Yu
    Hu, Ning
    2023 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING, CLOUDNET, 2023, : 408 - 412
  • [8] A Survey of Binary Code Similarity Detection Techniques
    Ruan, Liting
    Xu, Qizhen
    Zhu, Shunzhi
    Huang, Xujing
    Lin, Xinyang
    ELECTRONICS, 2024, 13 (09)
  • [9] Code Similarity Detection Based on Siamese Network
    Wu, Yi
    Wang, Wei
    2021 IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SOFTWARE ENGINEERING (ICICSE 2021), 2021, : 47 - 51
  • [10] A Source Code Similarity System for Plagiarism Detection
    Duric, Zoran
    Gasevic, Dragan
    COMPUTER JOURNAL, 2013, 56 (01): : 70 - 86