Automatic Bug Triage using Semi-Supervised Text Classification

被引:0
|
作者
Xuan, Jifeng [1 ]
Jiang, He [2 ,3 ]
Ren, Zhilei [1 ]
Yan, Jun [4 ]
Luo, Zhongxuan [1 ,2 ]
机构
[1] Dalian Univ Technol, Sch Math Sci, Dalian 116024, Peoples R China
[2] Dalian Univ Technol, Sch Software, Dalian 116621, Peoples R China
[3] Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Inst Software, Technol Ctr Software Engn, Beijing 100190, Peoples R China
关键词
automatic bug triage; expectation-maximization; semi-supervised text classification; weighted recommendation list;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose a semi-supervised text classification approach for bug triage to avoid the deficiency of labeled bug reports in existing supervised approaches. This new approach combines naive Bayes classifier and expectation-maximization to take advantage of both labeled and unlabeled bug reports. This approach trains a classifier with a fraction of labeled bug reports. Then the approach iteratively labels numerous unlabeled bug reports and trains a new classifier with labels of all the bug reports. We also employ a weighted recommendation list to boost the performance by imposing the weights of multiple developers in training the classifier. Experimental results on bug reports of Eclipse show that our new approach outperforms existing supervised approaches in terms of classification accuracy.
引用
收藏
页码:209 / 214
页数:6
相关论文
共 50 条
  • [31] Text Classification Method Based On Semi-Supervised Transfer Learning
    Yu, Xiaosheng
    Zhang, Hehuan
    Li, Jing
    [J]. 2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 388 - 394
  • [32] Improving Semi-Supervised Classification using Clustering
    Arora, J.
    Tushir, M.
    Kashyap, R.
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2020, 7 (25) : 1 - 9
  • [33] Using semi-supervised learning for question classification
    Tri, Nguyen Thanh
    Le, Nguyen Minh
    Shimazu, Akira
    [J]. COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 31 - +
  • [34] Improving automatic query classification via semi-supervised learning
    Beitzel, SM
    Jensen, EC
    Frieder, O
    Lewis, DD
    Chowdhury, A
    Kolcz, A
    [J]. Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 42 - 49
  • [35] Semi-supervised classification using multiple clusterings
    Yu G.X.
    Feng L.
    Yao G.J.
    Wang J.
    [J]. Wang, J. (kingjun@swu.edu.cn), 1600, Izdatel'stvo Nauka (26): : 681 - 687
  • [36] Semi-supervised Text Classification from Unlabeled Documents Using Class Associated Words
    Han Hong-qi
    Zhu Dong-hua
    Wang Xue-feng
    [J]. CIE: 2009 INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2009, : 1255 - 1260
  • [37] Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings
    Johnson, Rie
    Zhang, Tong
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [38] Automatic Classification of White Blood Cells Using a Semi-Supervised Convolutional Neural Network
    Song, Huihui
    Wang, Zheng
    [J]. IEEE ACCESS, 2024, 12 : 44972 - 44983
  • [39] An Improved Semi-supervised Variational Autoencoder with Gate Mechanism for Text Classification
    Ye, Haiming
    Zhang, Weiwen
    Nie, Mengna
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (10)
  • [40] Prototype-Guided Pseudo Labeling for Semi-Supervised Text Classification
    Yang, Weiyi
    Zhang, Richong
    Chen, Junfan
    Wang, Lihong
    Kim, Jaein
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 16369 - 16382