Source Code Clone Detection Using Unsupervised Similarity Measures

被引:0
|
作者
Martinez-Gil, Jorge [1 ]
机构
[1] Software Competence Ctr Hagenberg GmbH, Softwarepk 32a, A-4232 Hagenberg, Austria
关键词
Software Engineering; Clone Detection; Similarity Measures; Code Similarity; METRICS; GRAPH;
D O I
10.1007/978-3-031-56281-5_2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Assessing similarity in source code has gained significant attention in recent years due to its importance in software engineering tasks such as clone detection and code search and recommendation. This work presents a comparative analysis of unsupervised similarity measures for identifying source code clone detection. The goal is to overview the current state-of-the-art techniques, their strengths, and weaknesses. To do that, we compile the existing unsupervised strategies and evaluate their performance on a benchmark dataset to guide software engineers in selecting appropriate methods for their specific use cases. The source code of this study is available at https://github.com/jorge-martinez-gil/codesim
引用
收藏
页码:21 / 37
页数:17
相关论文
共 50 条
  • [21] Measuring source code similarity using reference vectors
    Ohno, Asako
    Murao, Hajime
    [J]. ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 92 - +
  • [22] Measuring source code similarity using reference vectors
    Ohno, Asako
    Murao, Hajime
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2007, 3 (03): : 525 - 537
  • [23] An Effective Approach Using Dissimilarity Measures To Estimate Software Code Clone
    Patil, Ritesh V.
    Joshi, Shashank. D.
    Shinde, Sachin V.
    Khanna, V.
    [J]. 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, SIGNALS, COMMUNICATION AND OPTIMIZATION (EESCO), 2015,
  • [24] Refactoring Code Clone Detection
    Othman, Zhala Sarkawt
    Kaya, Mehmet
    [J]. 2019 7TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS), 2019,
  • [25] An Unsupervised Approach for Combining Scores of Outlier Detection Techniques, Based on Similarity Measures
    Pasillas-Diaz, Jose Ramon
    Ratte, Sylvie
    [J]. ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2016, 329 : 61 - 77
  • [26] Flow Chart Generation-Based Source Code Similarity Detection Using Process Mining
    Zhang, Feng
    Li, Lulu
    Liu, Cong
    Zeng, Qingtian
    [J]. SCIENTIFIC PROGRAMMING, 2020, 2020
  • [27] Prioritizing Code Clone Detection Results for Clone Management
    Venkatasubramanyam, Radhika D.
    Gupta, Shrinath
    Singh, Himanshu Kumar
    [J]. 2013 7TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC), 2013, : 30 - 36
  • [28] CCFinder: A multilinguistic token-based code clone detection system for large scale source code
    Kamiya, T
    Kusumoto, S
    Inoue, K
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (07) : 654 - 670
  • [29] Structural Code Clone Detection Methodology Using Software Metrics
    Aktas, Mehmet S.
    Kapdan, Mustafa
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2016, 26 (02) : 307 - 332
  • [30] A NEW SIMILARITY MEASURE FOR IN-CLASS SOURCE CODE PLAGIARISM DETECTION
    Ohno, Asako
    Murao, Hajime
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (11B): : 4237 - 4247