Investigating the Generalizability of Deep Learning-based Clone Detectors

被引:1
|
作者
Choi, Eunjong [1 ]
Fuke, Norihiro [2 ]
Fujiwara, Yuji [2 ]
Yoshida, Norihiro [3 ]
Inoue, Katsuro [4 ]
机构
[1] Kyoto Inst Technol, Kyoto, Japan
[2] Osaka Univ, Osaka, Japan
[3] Ritsumeikan Univ, Kyoto, Japan
[4] Nanzan Univ, Nagoya, Aichi, Japan
关键词
code clone; deep learning; generalizability; CODE;
D O I
10.1109/ICPC58990.2023.00032
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The generalizability of Deep Learning (DL) models is a significant challenge, as poor generalizability indicates that the model has overfitted to the training data and is not able to generalize to new data. Despite numerous DL-based clone detectors emerging in recent years, their generalizability has not been thoroughly assessed. This study investigates the generalizability of three DL-based clone detectors (CCLearner, ASTNN, and CodeBERT) by comparing their detection accuracy on different training and testing clone benchmarks. The results show that all three clone detectors do not generalize well to new data and there is a strong relationship between clone types and generalizability for CCLearner and ASTNN.
引用
收藏
页码:181 / 185
页数:5
相关论文
共 50 条
  • [31] A deep learning-based multimodal ensemble algorithm for lung cancer early detection with cross-ethnic generalizability
    Lee, Tae-Rim
    Ahn, Jin Mo
    Lee, Junnam
    Kim, Dasom
    Jeong, Byeong-Ho
    Oh, Dongryul
    Wang, Mengchi
    Salmans, Michael
    Carson, Andrew
    Leatham, Bryan
    Fathe, Kristin
    Lee, Byung In
    Ki, Chang-Seok
    Park, Young Sik
    Cho, Eun-Hae
    CANCER RESEARCH, 2024, 84 (06)
  • [32] Improving the Generalizability of Infantile Cataracts Detection via Deep Learning-Based Lens Partition Strategy and Multicenter Datasets
    Jiang, Jiewei
    Lei, Shutao
    Zhu, Mingmin
    Li, Ruiyang
    Yue, Jiayun
    Chen, Jingjing
    Li, Zhongwen
    Gong, Jiamin
    Lin, Duoru
    Wu, Xiaohang
    Lin, Zhuoling
    Lin, Haotian
    FRONTIERS IN MEDICINE, 2021, 8
  • [33] Deep Learning-Based COVID-19 Pneumonia Classification Using Chest CT Images: Model Generalizability
    Nguyen, Dan
    Kay, Fernando
    Tan, Jun
    Yan, Yulong
    Ng, Yee Seng
    Iyengar, Puneeth
    Peshock, Ron
    Jiang, Steve
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4
  • [34] Multi-population generalizability of a deep learning-based chest radiograph severity score for COVID-19
    Li, Matthew D.
    Arun, Nishanth T.
    Aggarwal, Mehak
    Gupta, Sharut
    Singh, Praveer
    Little, Brent P.
    Mendoza, Dexter P.
    Corradi, Gustavo C. A.
    Takahashi, Marcelo S.
    Ferraciolli, Suely F.
    Succi, Marc D.
    Lang, Min
    Bizzo, Bernardo C.
    Dayan, Ittai
    Kitamura, Felipe C.
    Kalpathy-Cramer, Jayashree
    MEDICINE, 2022, 101 (29) : E29587
  • [35] Generalizability Improvement of Deep Learning-Based Non-Intrusive Load Monitoring System Using Data Augmentation
    Rafiq, Hasan
    Shi, Xiaohan
    Zhang, Hengxu
    Li, Huimin
    Ochani, Manesh Kumar
    Shah, Aamer Abbas
    IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (04) : 3265 - 3277
  • [36] Investigating Staining Variance Effects on Deep Learning-Based Semantic Segmentation in Digital Pathology
    Marzouki, Amine
    Vagena, Zografoula
    Kurtz, Camille
    Lomenie, Nicolas
    DIGITAL AND COMPUTATIONAL PATHOLOGY, MEDICAL IMAGING 2024, 2024, 12933
  • [37] Deep learning-based EEG analysis: investigating P3 ERP components
    Borra, Davide
    Magosso, Elisa
    JOURNAL OF INTEGRATIVE NEUROSCIENCE, 2021, 20 (04) : 791 - 811
  • [38] Generic Deep Learning-Based Linear Detectors for MIMO Systems Over Correlated Noise Environments
    He, Ke
    Wang, Zizhi
    Huang, Wei
    Deng, Dan
    Xia, Junjuan
    Fan, Liseng
    IEEE ACCESS, 2020, 8 : 29922 - 29929
  • [39] A novel method for improving the robustness of deep learning-based malware detectors against adversarial attacks
    Shaukat, Kamran
    Luo, Suhuai
    Varadharajan, Vijay
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
  • [40] Certified Robustness of Static Deep Learning-based Malware Detectors against Patch and Append Attacks
    Gibert, Daniel
    Zizzo, Giulio
    Le, Quan
    PROCEEDINGS OF THE 16TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2023, 2023, : 173 - 184