A Machine Learning Based Ensemble Method for Automatic Multiclass Classification of Decisions

被引:6
|
作者
Fu, Liming [1 ]
Liang, Peng [1 ]
Li, Xueying [1 ]
Yang, Chen [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] IBO Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
基金
国家重点研发计划;
关键词
Decision; Automatic Classification; Ensemble Classifier; Software Development; Hibernate;
D O I
10.1145/3463274.3463325
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Stakeholders make various types of decisions with respect to requirements, design, management, and so on during the software development life cycle. Nevertheless, these decisions are typically not well documented and classified due to limited human resources, time, and budget. To this end, automatic approaches provide a promising way. In this paper, we aimed at automatically classifying decisions into five types to help stakeholders better document and understand decisions. First, we collected a dataset from the Hibernate developer mailing list. We then experimented and evaluated 270 configurations regarding feature selection, feature extraction techniques, and machine learning classifiers to seek the best configuration for classifying decisions. Especially, we applied an ensemble learning method and constructed ensemble classifiers to compare the performance between ensemble classifiers and base classifiers. Our experiment results show that (1) feature selection can decently improve the classification results; (2) ensemble classifiers can outperform base classifiers provided that ensemble classifiers are well constructed; (3) BoW + 50% features selected by feature selection with an ensemble classifier that combines Naive Bayes (NB), Logistic Regression (LR), and Support Vector Machine (SVM) achieves the best classification result (with a weighted precision of 0.750, a weighted recall of 0.739, and a weighted F1-score of 0.727) among all the configurations. Our work can benefit various types of stakeholders in software development through providing an automatic approach for effectively classifying decisions into specific types that are relevant to their interests.
引用
收藏
页码:40 / 49
页数:10
相关论文
共 50 条
  • [21] Probability based voting extreme learning machine for multiclass XML documents classification
    Zhao, Xiangguo
    Bi, Xin
    Qiao, Baiyou
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (05): : 1217 - 1231
  • [22] Study on a confidence machine learning method based on ensemble learning
    Jiang, Fang Chun
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (04): : 3357 - 3368
  • [23] Study on a confidence machine learning method based on ensemble learning
    Fang Chun Jiang
    Cluster Computing, 2017, 20 : 3357 - 3368
  • [24] Feasibility of Active Machine Learning for Multiclass Compound Classification
    Lang, Tobias
    Flachsenberg, Florian
    von Luxburg, Ulrike
    Rarey, Matthias
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (01) : 12 - 20
  • [25] Multiclass Classification Machine Learning Identification of Common Poisonings
    Nogee, Daniel
    Haimovich, Adrian
    Hart, Katherine
    Tomassoni, Anthony
    CLINICAL TOXICOLOGY, 2020, 58 (11) : 1083 - 1084
  • [26] Preoperative multiclass classification of thymic mass lesions based on radiomics and machine learning
    Zhu, Yan
    Wang, Li
    Ruan, Aichao
    Peng, Zhiyu
    Zhang, Zhenzhong
    CANCER IMAGING, 2025, 25 (01)
  • [27] Probability based voting extreme learning machine for multiclass XML documents classification
    Xiangguo Zhao
    Xin Bi
    Baiyou Qiao
    World Wide Web, 2014, 17 : 1217 - 1231
  • [28] Ensemble based reactivated regularization extreme learning machine for classification
    Zhang, Boyang
    Ma, Zhao
    Liu, Yingyi
    Yuan, Haiwen
    Sun, Lingjie
    NEUROCOMPUTING, 2018, 275 : 255 - 266
  • [29] An Ensemble Based Machine Learning Classification for Automated Glaucoma Detection
    Pawar, Digvijay J.
    Kanse, Yuvraj K.
    Patil, Suhas S.
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2024, 13
  • [30] Lung Nodule Image Classification Based on Ensemble Machine Learning
    Mao Keming
    Deng Zhuofu
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2016, 6 (07) : 1679 - 1685