Experience report: investigating bug fixes in machine learning frameworks/libraries

被引:0
|
作者
Xiaobing Sun
Tianchi Zhou
Rongcun Wang
Yucong Duan
Lili Bo
Jianming Chang
机构
[1] Yangzhou University,School of Information Engineering
[2] Nanjing University,State Key Laboratory for Novel Software Technology
[3] Yangzhou University,Jiangsu Engineering Research Center of Knowledge Management and Intelligent Service
[4] China University of Mining and Technology,School of Computer Science and Technology
[5] Hainan University,School of Computer Science and Cyberspace Security
来源
关键词
bug fixing; machine learning project; empirical study; questionnaire survey;
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning (ML) techniques and algorithms have been successfully and widely used in various areas including software engineering tasks. Like other software projects, bugs are also common in ML projects and libraries. In order to more deeply understand the features related to bug fixing in ML projects, we conduct an empirical study with 939 bugs from five ML projects by manually examining the bug categories, fixing patterns, fixing scale, fixing duration, and types of maintenance. The results show that (1) there are commonly seven types of bugs in ML programs; (2) twelve fixing patterns are typically used to fix the bugs in ML programs; (3) 68.80% of the patches belong to micro-scale-fix and small-scale-fix; (4) 66.77% of the bugs in ML programs can be fixed within one month; (5) 45.90% of the bug fixes belong to corrective activity from the perspective of software maintenance. Moreover, we perform a questionnaire survey and send them to developers or users of ML projects to validate the results in our empirical study. The results of our empirical study are basically consistent with the feedback from developers. The findings from the empirical study provide useful guidance and insights for developers and users to effectively detect and fix bugs in ML projects.
引用
收藏
相关论文
共 50 条
  • [1] Experience report: investigating bug fixes in machine learning frameworks/libraries
    Xiaobing SUN
    Tianchi ZHOU
    Rongcun WANG
    Yucong DUAN
    Lili BO
    Jianming CHANG
    Frontiers of Computer Science, 2021, (06) : 13 - 28
  • [2] Experience report: investigating bug fixes in machine learning frameworks/libraries
    Sun, Xiaobing
    Zhou, Tianchi
    Wang, Rongcun
    Duan, Yucong
    Bo, Lili
    Chang, Jianming
    FRONTIERS OF COMPUTER SCIENCE, 2021, 15 (06)
  • [3] Various Frameworks and Libraries of Machine Learning and Deep Learning: A Survey
    Zhaobin Wang
    Ke Liu
    Jian Li
    Ying Zhu
    Yaonan Zhang
    Archives of Computational Methods in Engineering, 2024, 31 : 1 - 24
  • [4] Various Frameworks and Libraries of Machine Learning and Deep Learning: A Survey
    Wang, Zhaobin
    Liu, Ke
    Li, Jian
    Zhu, Ying
    Zhang, Yaonan
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (01) : 1 - 24
  • [5] Investigating the reliability and interpretability of machine learning frameworks for chemical retrosynthesis
    Hastedt, Friedrich
    Bailey, Rowan M.
    Hellgardt, Klaus
    Yaliraki, Sophia N.
    Chanona, Ehecatl Antonio del Rio
    Zhang, Dongda
    DIGITAL DISCOVERY, 2024, 3 (06): : 1194 - 1212
  • [6] KATANA: Dual Slicing Based Context for Learning Bug Fixes
    Sintaha, Mifta
    Nashid, Noor
    Mesbah, Ali
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (04)
  • [7] Learning How to Mutate Source Code from Bug-Fixes
    Tufano, Michele
    Watson, Cody
    Bavota, Gabriele
    Di Penta, Massimiliano
    White, Martin
    Poshyvanyk, Denys
    2019 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2019), 2019, : 301 - 312
  • [8] Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: a survey
    Giang Nguyen
    Stefan Dlugolinsky
    Martin Bobák
    Viet Tran
    Álvaro López García
    Ignacio Heredia
    Peter Malík
    Ladislav Hluchý
    Artificial Intelligence Review, 2019, 52 : 77 - 124
  • [9] Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: a survey
    Nguyen, Giang
    Dlugolinsky, Stefan
    Bobak, Martin
    Viet Tran
    Lopez Garcia, Alvaro
    Heredia, Ignacio
    Malik, Peter
    Hluchy, Ladislav
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (01) : 77 - 124
  • [10] Generating Python']Python Mutants From Bug Fixes Using Neural Machine Translation
    Asik, Sergen
    Yayan, Ugur
    IEEE ACCESS, 2023, 11 : 85678 - 85693