An Empirical Study of Deep Transfer Learning-Based Program Repair for Kotlin Projects

被引:3
|
作者
Kim, Misoo [1 ]
Kim, Youngkyoung [2 ]
Jeong, Hohyeon [2 ]
Heo, Jinseok [2 ]
Kim, Sungoh [3 ]
Chung, Hyunhee [3 ]
Lee, Eunseok [4 ]
机构
[1] Sungkyunkwan Univ, Inst Software Convergence, Suwon, South Korea
[2] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon, South Korea
[3] Samsung Elect, SW Engn Grp, Mobile Experience, Suwon, South Korea
[4] Sungkyunkwan Univ, Coll Comp & Informat, Suwon, South Korea
基金
新加坡国家研究基金会;
关键词
Empirical study; Deep learning-based program repair; Transfer learning; Industrial Kotlin project; SonarQube defects; SONARQUBE;
D O I
10.1145/3540250.3558967
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep learning-based automated program repair (DL-APR) can automatically fix software bugs and has received significant attention in the industry because of its potential to significantly reduce software development and maintenance costs. The Samsung mobile experience (MX) team is currently switching from Java to Kotlin projects. This study reviews the application of DL-APR, which automatically fixes defects that arise during this switching process; however, the shortage of Kotlin defect-fixing datasets in Samsung MX team precludes us from fully utilizing the power of deep learning. Therefore, strategies are needed to effectively reuse the pretrained DL-APR model. This demand can be met using the Kotlin defect-fixing datasets constructed from industrial and open-source repositories, and transfer learning. This study aims to validate the performance of the pretrained DL-APR model in fixing defects in the Samsung Kotlin projects, then improve its performance by applying transfer learning. We show that transfer learning with open source and industrial Kotlin defect-fixing datasets can improve the defect-fixing performance of the existing DL-APR by 307%. Furthermore, we confirmed that the performance was improved by 532% compared with the baseline DL-APR model as a result of transferring the knowledge of an industrial (non-defect) bug-fixing dataset. We also discovered that the embedded vectors and overlapping code tokens of the code-change pairs are valuable features for selecting useful knowledge transfer instances by improving the performance of APR models by up to 696%. Our study demonstrates the possibility of applying transfer learning to practitioners who review the application of DL-APR to industrial software.
引用
收藏
页码:1441 / 1452
页数:12
相关论文
共 50 条
  • [1] DEAR: A Novel Deep Learning-based Approach for Automated Program Repair
    Li, Yi
    Wang, Shaohua
    Nguyen, Tien N.
    [J]. 2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 511 - 523
  • [2] Impact of Defect Instances for Successful Deep Learning-based Automatic Program Repair
    Kim, Misoo
    Kim, Youngkyoung
    Heo, Jinseok
    Jeong, Hohyeon
    Kim, Sungoh
    Lee, Eunseok
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 419 - 423
  • [3] A Survey of Learning-based Automated Program Repair
    Zhang, Quanjun
    Fang, Chunrong
    Ma, Yuxiang
    Sun, Weisong
    Chen, Zhenyu
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (02)
  • [4] Improving Deep Learning-Based UWB LOS/NLOS Identification with Transfer Learning: An Empirical Approach
    Park, JiWoong
    Nam, SungChan
    Choi, HongBeom
    Ko, YoungEun
    Ko, Young-Bae
    [J]. ELECTRONICS, 2020, 9 (10) : 1 - 13
  • [5] Shifting Left for Machine Learning: An Empirical Study of Security Weaknesses in Supervised Learning-based Projects
    Bhuiyan, Farzana Ahamed
    Prowell, Stacy
    Shahriar, Hossain
    Wu, Fan
    Rahman, Akond
    [J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 798 - 808
  • [6] Deep learning and deep transfer learning-based OPM for FMF systems
    Amirabadi, M. A.
    Kahaei, M. H.
    Nezamalhosseini, S. A.
    [J]. PHYSICAL COMMUNICATION, 2023, 60
  • [7] An Empirical Study of IR-based Bug Localization for Deep Learning-based Software
    Kim, Misoo
    Kim, Youngkyoung
    Lee, Eunseok
    [J]. 2022 IEEE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST 2022), 2022, : 128 - 139
  • [8] Deep Learning-based Beverage Recognition for Unmanned Vending Machines: An Empirical Study
    Zhang, Haijun
    Li, Donghai
    Ji, Yuzhu
    Zhou, Haibin
    Wu, Weiwei
    [J]. 2019 IEEE 17TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2019, : 1464 - 1467
  • [9] An Extensive Study on Model Architecture and Program Representation in the Domain of Learning-based Automated Program Repair
    Horvath, Daniel
    Csuvik, Viktor
    Gyimothy, Tibor
    Vidacs, Laszlo
    [J]. 2023 IEEE/ACM INTERNATIONAL WORKSHOP ON AUTOMATED PROGRAM REPAIR, APR, 2023, : 31 - 38
  • [10] An Empirical Study of Deep Learning-Based SS7 Attack Detection
    Guo, Yuejun
    Ermis, Orhan
    Tang, Qiang
    Trang, Hoang
    De Oliveira, Alexandre
    [J]. INFORMATION, 2023, 14 (09)