A Full-fledged Commit Message Quality Checker Based on Machine Learning

被引:0
|
作者
Farago, David [1 ]
Faerber, Michael [2 ]
Petrov, Christian [3 ]
机构
[1] Innoopract GmbH & QPR Technol, Karlsruhe, Germany
[2] Karlsruhe Inst Technol, Karlsruhe, Germany
[3] Innoopract GmbH, Karlsruhe, Germany
关键词
commit message; maintenance; quality; text classification; machine learning;
D O I
10.1109/COMPSAC57700.2023.00108
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Commit messages (CMs) are an essential part of version control. By providing important context in regard to what has changed and why, they strongly support software maintenance and evolution. But writing good CMs is difficult and often neglected by developers. So far, there is no tool suitable for practice that automatically assesses how well a CM is written, including its meaning and context. Since this task is challenging, we ask the research question: how well can the CM quality, including semantics and context, be measured with machine learning methods? By considering all rules from the most popular CM quality guideline, creating datasets for those rules, and training and evaluating state-of-the-art machine learning models to check those rules, we can answer the research question with: sufficiently well for practice, with the lowest F1 score of 82.9%, for the most challenging task. We develop a full-fledged open-source framework that checks all these CM quality rules. It is useful for research, e.g., automatic CM generation, but most importantly for software practitioners to raise the quality of CMs and thus the maintainability and evolution speed of their software.
引用
收藏
页码:788 / 799
页数:12
相关论文
共 50 条
  • [1] A FULL-FLEDGED DEFENSE OF PRINCIPLE-BASED ETHICS AGAINST MORAL PARTICULARISM
    Jalilian, Mohammad Sobhan
    [J]. FILOZOFIA, 2022, 77 (08): : 608 - 625
  • [2] SmartVote: a full-fledged graph-based model for multi-valued truth discovery
    Fang, Xiu Susie
    Sheng, Quan Z.
    Wang, Xianzhi
    Chu, Dianhui
    Ngu, Anne H. H.
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (04): : 1855 - 1885
  • [3] SmartVote: a full-fledged graph-based model for multi-valued truth discovery
    Xiu Susie Fang
    Quan Z. Sheng
    Xianzhi Wang
    Dianhui Chu
    Anne H. H. Ngu
    [J]. World Wide Web, 2019, 22 : 1855 - 1885
  • [4] A full-fledged overhaul is needed for a risk and value-based regulation of medical devices in Europe
    Campillo-Artero, Carlos
    [J]. HEALTH POLICY, 2013, 113 (1-2) : 38 - 44
  • [5] Lateral Static Response of Piles Based on a Full-Fledged Two-Parameter Foundation Model
    Worku, Asrat
    Lulseged, Abey
    [J]. INTERNATIONAL JOURNAL OF GEOMECHANICS, 2023, 23 (02)
  • [6] Home-Based Care Reimagined: A Full-Fledged Health Care Delivery Ecosystem Without Walls
    Ritchie, Christine
    Leff, Bruce
    [J]. HEALTH AFFAIRS, 2022, 41 (05) : 689 - 695
  • [7] Implementation-Based Evaluation of a Full-Fledged Multihop TDMA-MAC for WiFi Mesh Networks
    Sevani, Vishal
    Raman, Bhaskaran
    Joshi, Piyush
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2014, 13 (02) : 392 - 406
  • [8] Assessing the impact of full-fledged location-based augmented reality games on tourism destination visits
    Lacka, Ewelina
    [J]. CURRENT ISSUES IN TOURISM, 2020, 23 (03) : 345 - 357
  • [9] Revisiting Learning-based Commit Message Generation
    Dong, Jinhao
    Lou, Yiling
    Hao, Dan
    Tan, Lin
    [J]. 2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 794 - 805
  • [10] From low-level programming to full-fledged industrial model-based development: the story of the Rubus Component Model
    Bucaioni, Alessio
    Ciccozzi, Federico
    Di Salle, Amleto
    Sjodin, Mikael
    [J]. SOFTWARE AND SYSTEMS MODELING, 2023, 22 (04): : 1085 - 1097