Automatic Bug Triage using Semi-Supervised Text Classification

被引:0
|
作者
Xuan, Jifeng [1 ]
Jiang, He [2 ,3 ]
Ren, Zhilei [1 ]
Yan, Jun [4 ]
Luo, Zhongxuan [1 ,2 ]
机构
[1] Dalian Univ Technol, Sch Math Sci, Dalian 116024, Peoples R China
[2] Dalian Univ Technol, Sch Software, Dalian 116621, Peoples R China
[3] Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Inst Software, Technol Ctr Software Engn, Beijing 100190, Peoples R China
关键词
automatic bug triage; expectation-maximization; semi-supervised text classification; weighted recommendation list;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose a semi-supervised text classification approach for bug triage to avoid the deficiency of labeled bug reports in existing supervised approaches. This new approach combines naive Bayes classifier and expectation-maximization to take advantage of both labeled and unlabeled bug reports. This approach trains a classifier with a fraction of labeled bug reports. Then the approach iteratively labels numerous unlabeled bug reports and trains a new classifier with labels of all the bug reports. We also employ a weighted recommendation list to boost the performance by imposing the weights of multiple developers in training the classifier. Experimental results on bug reports of Eclipse show that our new approach outperforms existing supervised approaches in terms of classification accuracy.
引用
收藏
页码:209 / 214
页数:6
相关论文
共 50 条
  • [1] Text Classification Using Semi-Supervised Clustering
    Zhang, Wen
    Yoshida, Taketoshi
    Tang, Xijin
    [J]. 2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 197 - 200
  • [2] Semi-supervised text classification using partitioned EM
    Cong, G
    Lee, WS
    Wu, HR
    Liu, B
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 482 - 493
  • [3] Semi-supervised Text Classification Using RBF Networks
    Jiang, Eric P.
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS VIII, PROCEEDINGS, 2009, 5772 : 95 - 106
  • [4] An Exploration of Semi-supervised Text Classification
    Lien, Henrik
    Biermann, Daniel
    Palumbo, Fabrizio
    Goodwin, Morten
    [J]. ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2022, 2022, 1600 : 477 - 488
  • [5] Semi-supervised collaborative text classification
    Jin, Rong
    Wu, Ming
    Sukthankar, Rahul
    [J]. MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 600 - +
  • [6] TESC: An approach to TExt classification using Semi-supervised Clustering
    Zhang, Wen
    Tang, Xijin
    Yoshida, Taketoshi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 75 : 152 - 160
  • [7] Semi-supervised text classification using positive and unlabeled data
    Yu, Shuang
    Zhou, Xueyuan
    Li, Chunping
    [J]. ADVANCES IN INTELLIGENT IT: ACTIVE MEDIA TECHNOLOGY 2006, 2006, 138 : 249 - 254
  • [8] Text Classification using Semi-supervised Approach for Multi Domain
    Deshmukh, Jyoti S.
    Tripathy, Amiya Kumar
    [J]. 2017 INTERNATIONAL CONFERENCE ON NASCENT TECHNOLOGIES IN ENGINEERING (ICNTE-2017), 2017,
  • [9] A review of semi-supervised learning for text classification
    José Marcio Duarte
    Lilian Berton
    [J]. Artificial Intelligence Review, 2023, 56 : 9401 - 9469
  • [10] Variational Autoencoder for Semi-Supervised Text Classification
    Xu, Weidi
    Sun, Haoze
    Deng, Chao
    Tan, Ying
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3358 - 3364