An Empirical Study of Deep Transfer Learning-Based Program Repair for Kotlin Projects

被引：3

作者：

Kim, Misoo ^{[1
]}

Kim, Youngkyoung ^{[2
]}

Jeong, Hohyeon ^{[2
]}

Heo, Jinseok ^{[2
]}

Kim, Sungoh ^{[3
]}

Chung, Hyunhee ^{[3
]}

Lee, Eunseok ^{[4
]}

机构：

[1] Sungkyunkwan Univ, Inst Software Convergence, Suwon, South Korea

[2] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon, South Korea

[3] Samsung Elect, SW Engn Grp, Mobile Experience, Suwon, South Korea

[4] Sungkyunkwan Univ, Coll Comp & Informat, Suwon, South Korea

来源：

PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022 | 2022年

基金：

新加坡国家研究基金会;

关键词：

Empirical study; Deep learning-based program repair; Transfer learning; Industrial Kotlin project; SonarQube defects; SONARQUBE;

D O I：

10.1145/3540250.3558967

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Deep learning-based automated program repair (DL-APR) can automatically fix software bugs and has received significant attention in the industry because of its potential to significantly reduce software development and maintenance costs. The Samsung mobile experience (MX) team is currently switching from Java to Kotlin projects. This study reviews the application of DL-APR, which automatically fixes defects that arise during this switching process; however, the shortage of Kotlin defect-fixing datasets in Samsung MX team precludes us from fully utilizing the power of deep learning. Therefore, strategies are needed to effectively reuse the pretrained DL-APR model. This demand can be met using the Kotlin defect-fixing datasets constructed from industrial and open-source repositories, and transfer learning. This study aims to validate the performance of the pretrained DL-APR model in fixing defects in the Samsung Kotlin projects, then improve its performance by applying transfer learning. We show that transfer learning with open source and industrial Kotlin defect-fixing datasets can improve the defect-fixing performance of the existing DL-APR by 307%. Furthermore, we confirmed that the performance was improved by 532% compared with the baseline DL-APR model as a result of transferring the knowledge of an industrial (non-defect) bug-fixing dataset. We also discovered that the embedded vectors and overlapping code tokens of the code-change pairs are valuable features for selecting useful knowledge transfer instances by improving the performance of APR models by up to 696%. Our study demonstrates the possibility of applying transfer learning to practitioners who review the application of DL-APR to industrial software.

引用

页码：1441 / 1452

页数：12

共 50 条

[1] DEAR: A Novel Deep Learning-based Approach for Automated Program Repair
Li, Yi
Wang, Shaohua
Nguyen, Tien N.
[J]. 2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 511 - 523
[2] Impact of Defect Instances for Successful Deep Learning-based Automatic Program Repair
Kim, Misoo
Kim, Youngkyoung
Heo, Jinseok
Jeong, Hohyeon
Kim, Sungoh
Lee, Eunseok
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 419 - 423
[3] A Survey of Learning-based Automated Program Repair
Zhang, Quanjun
Fang, Chunrong
Ma, Yuxiang
Sun, Weisong
Chen, Zhenyu
[J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (02)
[4] Improving Deep Learning-Based UWB LOS/NLOS Identification with Transfer Learning: An Empirical Approach
Park, JiWoong
Nam, SungChan
Choi, HongBeom
Ko, YoungEun
Ko, Young-Bae
[J]. ELECTRONICS, 2020, 9 (10) : 1 - 13
[5] Shifting Left for Machine Learning: An Empirical Study of Security Weaknesses in Supervised Learning-based Projects
Bhuiyan, Farzana Ahamed
Prowell, Stacy
Shahriar, Hossain
Wu, Fan
Rahman, Akond
[J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 798 - 808
[6] Deep learning and deep transfer learning-based OPM for FMF systems
Amirabadi, M. A.
Kahaei, M. H.
Nezamalhosseini, S. A.
[J]. PHYSICAL COMMUNICATION, 2023, 60
[7] An Empirical Study of IR-based Bug Localization for Deep Learning-based Software
Kim, Misoo
Kim, Youngkyoung
Lee, Eunseok
[J]. 2022 IEEE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST 2022), 2022, : 128 - 139
[8] Deep Learning-based Beverage Recognition for Unmanned Vending Machines: An Empirical Study
Zhang, Haijun
Li, Donghai
Ji, Yuzhu
Zhou, Haibin
Wu, Weiwei
[J]. 2019 IEEE 17TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2019, : 1464 - 1467
[9] An Extensive Study on Model Architecture and Program Representation in the Domain of Learning-based Automated Program Repair
Horvath, Daniel
Csuvik, Viktor
Gyimothy, Tibor
Vidacs, Laszlo
[J]. 2023 IEEE/ACM INTERNATIONAL WORKSHOP ON AUTOMATED PROGRAM REPAIR, APR, 2023, : 31 - 38
[10] An Empirical Study of Deep Learning-Based SS7 Attack Detection
Guo, Yuejun
Ermis, Orhan
Tang, Qiang
Trang, Hoang
De Oliveira, Alexandre
[J]. INFORMATION, 2023, 14 (09)

← 1 2 3 4 5 →