An Approach for Assessing Similarity Metrics Used in Metric-based Clone Detection Techniques

被引:6
|
作者
Shawky, Doaa M. [1 ]
Ali, Ahmed F. [2 ]
机构
[1] Cairo Univ, Fac Engn, Dept Engn Math, Cairo, Egypt
[2] Helwan Univ, Fac Engn, Dept Biomed Engn, Helwan, Egypt
关键词
similarity metrics; clustering; clone detection;
D O I
10.1109/ICCSIT.2010.5563834
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Similarity is an important concept in information theory. A challenging question is how to measure the amount of shared information between two systems. A large number of metrics are proposed and used to measure similarity between two computer programs or two portions of the same program. In this paper, we present an approach for assessing which metrics are most useful for similarity prediction in the context of clone detection. The presented approach uses clustering to identify clone candidates. In the experiments conducted, we applied sequential clustering using all possible permutations of a subset of the metrics used in metric-based clone detection literature. Precision and recall are calculated in every experiment. Experimental results show that the order of the metrics used affects the results dramatically. This shows that the used metrics are of variable relevance.
引用
收藏
页码:580 / 584
页数:5
相关论文
共 50 条
  • [31] A metric-based approach to detect abstract data types and state encapsulations
    Girard, JF
    Koschke, R
    Scheid, G
    [J]. AUTOMATED SOFTWARE ENGINEERING, 12TH IEEE INTERNATIONAL CONFERENCE, PROCEEDINGS, 1997, : 82 - 89
  • [32] Using metric-based filtering to improve design pattern detection approaches
    Issaoui, Imene
    Bouassida, Nadia
    Ben-Abdallah, Hanene
    [J]. INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2015, 11 (01) : 39 - 53
  • [33] Similarity Metric-Based Metalearning Network Combining Prior Metatraining Strategy for Intelligent Fault Detection Under Small Samples Prerequisite
    Chang, Yuanhong
    Chen, Jinglong
    He, Shuilong
    Pan, Tongyang
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [34] Basic statistics for distributional symbolic variables: a new metric-based approach
    Antonio Irpino
    Rosanna Verde
    [J]. Advances in Data Analysis and Classification, 2015, 9 : 143 - 175
  • [35] A Metric-Based Approach to Modelling a Virtual Machine for Smart Contract Execution
    Luca, Spataru Alexe
    Pungila, Ciprian
    [J]. 2020 22ND INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2020), 2020, : 302 - 309
  • [36] Comparing Heuristic and Machine Learning Approaches for Metric-Based Code Smell Detection
    Pecorelli, Fabiano
    Palomba, Fabio
    Di Nucci, Dario
    De Lucia, Andrea
    [J]. 2019 IEEE/ACM 27TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2019), 2019, : 93 - 104
  • [37] Program Code Understandability and Authenticating Code Predicting Systems: A Metric-Based Approach
    Jha, Pooja
    Patnaik, K. Sridhar
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL, NETWORKS, COMPUTING, AND SYSTEMS (ICSNCS 2016), VOL 2, 2016, 396 : 95 - 103
  • [38] An Approach to Counteracting the Common Cyber-attacks According to the Metric-Based Model
    Geramiparvar, Mohammad Sirwan
    Modiri, Nasser
    [J]. International Journal of Computer Science and Network Security, 2016, 16 (01): : 81 - 85
  • [39] A novel parallel distance metric-based approach for diversified ranking on large graphs
    Li, Jin
    Yang, Yun
    Wang, Xiaoling
    Zhao, Zhiming
    Li, Tong
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 88 : 79 - 91
  • [40] Wasserstein Metric-Based Location Spoofing Attack Detection in WiFi Positioning Systems
    Tian, Yinghua
    Zheng, Nae
    Chen, Xiang
    Gao, Liuyang
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2021, 2021