Automatic Unsupervised Bug Report Categorization

被引:15
|
作者
Limsettho, Nachai [1 ]
Hata, Hideaki [1 ]
Monden, Akito [1 ]
Matsumoto, Kenichi [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300101, Japan
关键词
automated bug report categorization; topic modeling; clustering; cluster labeling;
D O I
10.1109/IWESEP.2014.8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Background: Information in bug reports is implicit and therefore difficult to comprehend. To extract its meaning, some processes are required. Categorizing bug reports is a technique that can help in this regard. It can be used to help in the bug reports management or to understand the underlying structure of the desired project. However, most researches in this area are focusing on a supervised learning approach that still requires a lot of human afford to prepare a training data. Aims: Our aim is to develop an automated framework than can categorize bug reports, according to their hidden characteristics and structures, without the needed for training data. Method: We solve this problem using clustering, unsupervised learning approach. It can automatically group bug reports together based on their textual similarity. We also propose a novel method to label each group with meaningful and representative names. Results: Experiment results show that our framework can achieve performance comparable to the supervised learning approaches. We also show that our labeling process can label each cluster with representative names according to its characteristic. Conclusion: Our framework could be used as an automated categorization system that can be applied without prior knowledge or as an automated labeling suggestion system.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [1] Unsupervised Bug Report Categorization Using Clustering and Labeling Algorithm
    Limsettho, Nachai
    Hata, Hideaki
    Monden, Akito
    Matsumoto, Kenichi
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2016, 26 (07) : 1027 - 1053
  • [2] An Optimization Technique for Unsupervised Automatic Extractive Bug Report Summarization
    Kukkar, Ashima
    Mohana, Rajni
    [J]. INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 2, 2019, 56 : 1 - 11
  • [3] On the Effectiveness of Labeled Latent Dirichlet Allocation in Automatic Bug-Report Categorization
    Zibran, Minhaz F.
    [J]. 2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C), 2016, : 713 - 715
  • [4] Unsupervised Deep Bug Report Summarization
    Li, Xiaochen
    Jiang, He
    Liu, Dong
    Ren, Zhilei
    Li, Ge
    [J]. 2018 IEEE/ACM 26TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2018), 2018, : 144 - 155
  • [5] An Unsupervised Random Forest Clustering Technique for Automatic Traffic Scenario Categorization
    Kruber, Friedrich
    Wurst, Jonas
    Botsch, Michael
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 2811 - 2818
  • [6] System for Automatic Evaluation of Bug Report Quality
    Mirziianov, Ruslan
    Kiamov, Amir
    Ehlakov, Eduard, V
    [J]. PROCEEDINGS OF THE 2021 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (ELCONRUS), 2021, : 2158 - 2160
  • [7] A Study of Applying Unsupervised Learning Methods for Document Clustering and Automatic Categorization of Software
    Chen, Kai-Wen
    Huang, Chin-Yu
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM21), 2021, : 1626 - 1630
  • [8] Unsupervised image categorization
    Heidemann, G
    [J]. IMAGE AND VISION COMPUTING, 2005, 23 (10) : 861 - 876
  • [9] The Automatic Classification of Fault Trigger Based Bug Report
    Du, Xiaoting
    Zheng, Zheng
    Xiao, Guanping
    Yin, Beibei
    [J]. 2017 IEEE 28TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2017), 2017, : 259 - 265
  • [10] CaPBug-A Framework for Automatic Bug Categorization and Prioritization Using NLP and Machine Learning Algorithms
    Ahmed, Hafiza Anisa
    Bawany, Narmeen Zakaria
    Shamsi, Jawwad Ahmed
    [J]. IEEE ACCESS, 2021, 9 : 50496 - 50512