DeepPull: Deep Learning-Based Approach for Predicting Reopening, Decision, and Lifetime of Pull Requests on GitHub Open-Source Projects

被引:0
|
作者
Banyongrakkul, Peerachai [1 ]
Phoomvuthisarn, Suronapee [1 ]
机构
[1] Chulalongkorn Univ, Dept Stat, Bangkok, Thailand
来源
关键词
Pull request; GitHub; Deep learning; Classification;
D O I
10.1007/978-3-031-61753-9_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces DeepPull, a novel multi-output deep learningbased classification approach designed to predict pull request outcomes in GitHub's pull-based software development model. The primary goal of DeepPull is to provide decision-making support for integrators in open-source software projects, with a particular focus onmitigating the challenges associated with managing high request volumes. DeepPull anticipates the reopening, decision, and lifetime of pull requests at the time of submission. Our method effectively leverages diverse data sources, including tabular and textual data, and incorporates a combination of SMOTE and VAE techniques to handle imbalances in reopening predictions. The evaluation of DeepPull on six well-known programming languages, along with 83 open-source projects, demonstrates significant performance improvements over both a randomized baseline and the existing approach. The approach greatly enhances balanced accuracy by 6.25%, AUC by 7.19%, and TPR by 16.78% in reopening prediction, improves accuracy by 7.71%, precision by 0.56%, recall by 10.96%, and F-measure by 6.27% in decision prediction, and reduces MMAE by 5.73% in lifetime prediction compared to an existing approach.
引用
收藏
页码:100 / 123
页数:24
相关论文
共 50 条
  • [1] Predicting the lifetime of pull requests in open-source projects
    de Lima Junior, Manoel Limeira
    Soares, Daricelio
    Plastino, Alexandre
    Murta, Leonardo
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2021, 33 (06)
  • [2] Acceptance Factors of Pull Requests in Open-Source Projects
    Soares, Daricelio Moreira
    de Lima Junior, Manoel Limeira
    Murta, Leonardo
    [J]. 30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 1541 - 1546
  • [3] What are the Characteristics of Reopened Pull Requests? A Case Study on Open Source Projects in GitHub
    Jiang, Jing
    Mohamed, Abdillah
    Zhang, Li
    [J]. IEEE ACCESS, 2019, 7 : 102751 - 102761
  • [4] FastPathology: An Open-Source Platform for Deep Learning-Based Research and Decision Support in Digital Pathology
    Pedersen, Andre
    Valla, Marit
    Bofin, Anna M.
    De Frutos, Javier Perez
    Reinertsen, Ingerid
    Smistad, Erik
    [J]. IEEE ACCESS, 2021, 9 : 58216 - 58229
  • [5] HDL-ODPRs: A Hybrid Deep Learning Technique Based Optimal Duplication Detection for Pull-Requests in Open-Source Repositories
    Alotaibi, Saud S.
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [6] MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers
    Lan, Yihuai
    Wang, Lei
    Zhang, Qiyuan
    Lan, Yunshi
    Dai, Bing Tian
    Wang, Yan
    Zhang, Dongxiang
    Lim, Ee-Peng
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 13188 - 13190
  • [7] An open-source deep learning model for predicting effluent concentration in capacitive deionization
    Son, Moon
    Yoon, Nakyung
    Park, Sanghun
    Abbas, Ather
    Cho, Kyung Hwa
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 856
  • [8] DeepRec: An Open-source Toolkit for Deep Learning based Recommendation
    Zhang, Shuai
    Tay, Yi
    Yao, Lina
    Wu, Bin
    Sun, Aixin
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6581 - 6583
  • [9] Open-source deep learning-based air-void detection algorithm for concrete microscopic images
    Hilloulin, B.
    Bekrine, I
    Schmitt, E.
    Loukili, A.
    [J]. JOURNAL OF MICROSCOPY, 2022, 286 (02) : 179 - 184
  • [10] Beyond Accuracy: An Empirical Study on Unit Testing in Open-source Deep Learning Projects
    Wang, Han
    Yu, Sijia
    Chen, Chunyang
    Turhan, Burak
    Zhu, Xiaodong
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (04)