A Machine Learning Based Ensemble Method for Automatic Multiclass Classification of Decisions

被引:6
|
作者
Fu, Liming [1 ]
Liang, Peng [1 ]
Li, Xueying [1 ]
Yang, Chen [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] IBO Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
基金
国家重点研发计划;
关键词
Decision; Automatic Classification; Ensemble Classifier; Software Development; Hibernate;
D O I
10.1145/3463274.3463325
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Stakeholders make various types of decisions with respect to requirements, design, management, and so on during the software development life cycle. Nevertheless, these decisions are typically not well documented and classified due to limited human resources, time, and budget. To this end, automatic approaches provide a promising way. In this paper, we aimed at automatically classifying decisions into five types to help stakeholders better document and understand decisions. First, we collected a dataset from the Hibernate developer mailing list. We then experimented and evaluated 270 configurations regarding feature selection, feature extraction techniques, and machine learning classifiers to seek the best configuration for classifying decisions. Especially, we applied an ensemble learning method and constructed ensemble classifiers to compare the performance between ensemble classifiers and base classifiers. Our experiment results show that (1) feature selection can decently improve the classification results; (2) ensemble classifiers can outperform base classifiers provided that ensemble classifiers are well constructed; (3) BoW + 50% features selected by feature selection with an ensemble classifier that combines Naive Bayes (NB), Logistic Regression (LR), and Support Vector Machine (SVM) achieves the best classification result (with a weighted precision of 0.750, a weighted recall of 0.739, and a weighted F1-score of 0.727) among all the configurations. Our work can benefit various types of stakeholders in software development through providing an automatic approach for effectively classifying decisions into specific types that are relevant to their interests.
引用
收藏
页码:40 / 49
页数:10
相关论文
共 50 条
  • [1] Deep Learning with Ensemble Classification Method for Sensor Sampling Decisions
    Taleb, Sirine
    Al Sallab, Ahmad
    Hajj, Hazem
    Dawy, Zaher
    Khanna, Rahul
    Keshavamurthy, Anil
    2016 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2016, : 114 - 119
  • [2] Automatic Modulation Classification Based on Decentralized Learning and Ensemble Learning
    Fu, Xue
    Gui, Guan
    Wang, Yu
    Gacanin, Haris
    Adachi, Fumiyuki
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (07) : 7942 - 7946
  • [3] Automatic Detection of Epilepsy and Seizure Using Multiclass Sparse Extreme Learning Machine Classification
    Wang, Yuanfa
    Li, Zunchao
    Feng, Lichen
    Zheng, Chuang
    Zhang, Wenhao
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2017, 2017
  • [4] Strength of ensemble learning in multiclass classification of rockburst intensity
    Zhang, Junfei
    Wang, Yuhang
    Sun, Yuantian
    Li, Guichen
    INTERNATIONAL JOURNAL FOR NUMERICAL AND ANALYTICAL METHODS IN GEOMECHANICS, 2020, 44 (13) : 1833 - 1853
  • [5] Ensemble of kernel extreme learning machine based random forest classifiers for automatic heartbeat classification
    Yang, Ping
    Wang, Dan
    Zhao, Wen-Bing
    Fu, Li-Hua
    Du, Jin-Lian
    Su, Hang
    Biomedical Signal Processing and Control, 2021, 63
  • [6] Ensemble of kernel extreme learning machine based random forest classifiers for automatic heartbeat classification
    Yang, Ping
    Wang, Dan
    Zhao, Wen-Bing
    Fu, Li-Hua
    Du, Jin-Lian
    Su, Hang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 63
  • [7] Automatic Classification for Vulnerability Based on Machine Learning
    Shuai, Bo
    Li, Haifeng
    Li, Mengjun
    Zhang, Quan
    Tang, Chaojing
    2013 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2013, : 312 - 318
  • [8] A machine learning software tool for multiclass classification
    Wang, Shangzhou
    Lu, Haohui
    Khan, Arif
    Hajati, Farshid
    Khushi, Matloob
    Uddin, Shahadat
    SOFTWARE IMPACTS, 2022, 13
  • [9] Extreme Learning Machine for Regression and Multiclass Classification
    Huang, Guang-Bin
    Zhou, Hongming
    Ding, Xiaojian
    Zhang, Rui
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (02): : 513 - 529
  • [10] Explainable Ensemble Machine Learning Method for Credit Risk Classification
    Ben Ghozzi, Sirine
    Ben HajKacem, Mohamed Aymen
    Essoussi, Nadia
    2024 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS, INISTA, 2024,