Source Code Clone Detection Using Unsupervised Similarity Measures

被引:0
|
作者
Martinez-Gil, Jorge [1 ]
机构
[1] Software Competence Ctr Hagenberg GmbH, Softwarepk 32a, A-4232 Hagenberg, Austria
关键词
Software Engineering; Clone Detection; Similarity Measures; Code Similarity; METRICS; GRAPH;
D O I
10.1007/978-3-031-56281-5_2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Assessing similarity in source code has gained significant attention in recent years due to its importance in software engineering tasks such as clone detection and code search and recommendation. This work presents a comparative analysis of unsupervised similarity measures for identifying source code clone detection. The goal is to overview the current state-of-the-art techniques, their strengths, and weaknesses. To do that, we compile the existing unsupervised strategies and evaluate their performance on a benchmark dataset to guide software engineers in selecting appropriate methods for their specific use cases. The source code of this study is available at https://github.com/jorge-martinez-gil/codesim
引用
收藏
页码:21 / 37
页数:17
相关论文
共 50 条
  • [1] Indexing source code and clone detection
    Tronicek, Zdenek
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 144
  • [2] DroidCC: A Scalable Clone Detection Approach for Android Applications to Detect Similarity at Source Code Level
    Akram, Junaid
    Shi, Zhendong
    Mumtaz, Majid
    Ping, Luo
    [J]. 2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2018, : 100 - 105
  • [3] A systematic literature review on source code similarity measurement and clone detection: Techniques, applications, and challenges
    Zakeri-Nasrabadi, Morteza
    Parsa, Saeed
    Ramezani, Mohammad
    Roy, Chanchal
    Ekhtiarzadeh, Masoud
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 204
  • [4] Gapped Code Clone Detection with Lightweight Source Code Analysis
    Murakami, Hiroaki
    Hotta, Keisuke
    Higo, Yoshiki
    Igaki, Hiroshi
    Kusumoto, Shinji
    [J]. 2013 IEEE 21ST INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC), 2013, : 93 - 102
  • [5] Cross-Language Code Similarity and Applications in Clone Detection and Code Search
    Mathew, George Varghese
    [J]. ProQuest Dissertations and Theses Global, 2022,
  • [6] Code Clone Detection using Wavelets
    Karus, Siim
    Kilgi, Karl
    [J]. 2015 IEEE 9TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC), 2015, : 8 - 14
  • [7] Clone detection in source code by frequent itemset techniques
    Wahler, V
    Seipel, D
    Von Gudenberg, JW
    Fischer, G
    [J]. FOURTH IEEE INTERNATIONAL WORKSHOP ON SOURCE CODE ANALYSIS AND MANIPULATION, PROCEEDINGS, 2004, : 128 - 135
  • [8] Source Code Plagiarism Detection Using Biological String Similarity Algorithms
    Rahal, Imad
    Wielga, Colin
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2014, 13 (03)
  • [9] Source code similarity detection using adaptive local alignment of keywords
    Ji, Jeong-Hoon
    Park, Soo-Hyun
    Woo, Gyun
    Cho, Hwan-Gue
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2007, : 179 - 180
  • [10] A Source Code Similarity System for Plagiarism Detection
    Duric, Zoran
    Gasevic, Dragan
    [J]. COMPUTER JOURNAL, 2013, 56 (01): : 70 - 86