BERT-Based GitHub Issue Report Classification

被引：12

作者：

Siddiq, Mohammed Latif ^{[1
]}

Santos, Joanna C. S. ^{[1
]}

机构：

[1] Univ Notre Dame, Notre Dame, IN 46556 USA

来源：

2022 IEEE/ACM 1ST INTERNATIONAL WORKSHOP ON NATURAL LANGUAGE-BASED SOFTWARE ENGINEERING (NLBSE 2022) | 2022年

关键词：

issue type classification; multi-class classification; text processing; software maintenance; pre-trained model;

D O I：

10.1145/3528588.3528660

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Issue tracking is one of the integral parts of software development, especially for open source projects. GitHub, a commonly used software management tool, provides its own issue tracking system. Each issue can have various tags, which are manually assigned by the project's developers. However, manually labeling software reports is a time-consuming and error-prone task. In this paper, we describe a BERT-based classification technique to automatically label issues as questions, bugs, or enhancements. We evaluate our approach using a dataset containing over 800,000 labeled issues from real open source projects available on GitHub. Our approach classified reported issues with an average F1-score of 0.8571. Our technique outperforms a previous machine learning technique based on FastText.

引用

页码：33 / 36

页数：4

共 50 条

[1] GitHub Issue Classification Using BERT-Style Models
Bharadwaj, Shikhar
Kadam, Tushar
[J]. 2022 IEEE/ACM 1ST INTERNATIONAL WORKSHOP ON NATURAL LANGUAGE-BASED SOFTWARE ENGINEERING (NLBSE 2022), 2022, : 40 - 43
[2] BAE: BERT-based Adversarial Examples for Text Classification
Garg, Siddhant
Ramakrishnan, Goutham
[J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6174 - 6181
[3] Biomedical Abstract Sentence Classification by BERT-Based Reading Comprehension
Jiang C.-Y.
Fan Y.-C.
[J]. SN Computer Science, 4 (4)
[4] Improving BERT-Based Text Classification With Auxiliary Sentence and Domain Knowledge
Yu, Shanshan
Su, Jindian
Luo, Da
[J]. IEEE ACCESS, 2019, 7 : 176600 - 176612
[5] Assessing the use of attention weights to interpret BERT-based stance classification
Cordova Saenz, Carlos Abel
Becker, Karin
[J]. 2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 194 - 201
[6] BERT-based Lexical Substitution
Zhou, Wangchunshu
Ge, Tao
Xu, Ke
Wei, Furu
Zhou, Ming
[J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3368 - 3373
[7] Improving Bert-Based Model for Medical Text Classification with an Optimization Algorithm
Gasmi, Karim
[J]. ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 1653 : 101 - 111
[8] BERT-based semi-supervised domain adaptation for disastrous classification
Jing Wang
Kexin Wang
[J]. Multimedia Systems, 2022, 28 : 2237 - 2246
[9] BERT-based semi-supervised domain adaptation for disastrous classification
Wang, Jing
Wang, Kexin
[J]. MULTIMEDIA SYSTEMS, 2022, 28 (06) : 2237 - 2246
[10] Short-Text Classification Detector: A Bert-Based Mental Approach
Hu, Yongjun
Ding, Jia
Dou, Zixin
Chang, Huiyou
[J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022

← 1 2 3 4 5 →