A Machine Learning Based Ensemble Method for Automatic Multiclass Classification of Decisions

被引:6
|
作者
Fu, Liming [1 ]
Liang, Peng [1 ]
Li, Xueying [1 ]
Yang, Chen [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] IBO Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
基金
国家重点研发计划;
关键词
Decision; Automatic Classification; Ensemble Classifier; Software Development; Hibernate;
D O I
10.1145/3463274.3463325
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Stakeholders make various types of decisions with respect to requirements, design, management, and so on during the software development life cycle. Nevertheless, these decisions are typically not well documented and classified due to limited human resources, time, and budget. To this end, automatic approaches provide a promising way. In this paper, we aimed at automatically classifying decisions into five types to help stakeholders better document and understand decisions. First, we collected a dataset from the Hibernate developer mailing list. We then experimented and evaluated 270 configurations regarding feature selection, feature extraction techniques, and machine learning classifiers to seek the best configuration for classifying decisions. Especially, we applied an ensemble learning method and constructed ensemble classifiers to compare the performance between ensemble classifiers and base classifiers. Our experiment results show that (1) feature selection can decently improve the classification results; (2) ensemble classifiers can outperform base classifiers provided that ensemble classifiers are well constructed; (3) BoW + 50% features selected by feature selection with an ensemble classifier that combines Naive Bayes (NB), Logistic Regression (LR), and Support Vector Machine (SVM) achieves the best classification result (with a weighted precision of 0.750, a weighted recall of 0.739, and a weighted F1-score of 0.727) among all the configurations. Our work can benefit various types of stakeholders in software development through providing an automatic approach for effectively classifying decisions into specific types that are relevant to their interests.
引用
收藏
页码:40 / 49
页数:10
相关论文
共 50 条
  • [41] An Ensemble Filter Feature Selection Method and Outlier Detection Method for Multiclass Classification
    Ndirangu, Dalton
    Mwangi, Waweru
    Nderu, Lawrence
    2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2019), 2019, : 373 - 379
  • [42] Ensemble learning-based approach for automatic classification of termite mushrooms
    Duong, Thi Kim Chi
    Tran, Van Lang
    Nguyen, The Bao
    Nguyen, Thi Thuy
    Ho, Ngoc Trung Kien
    Nguyen, Thanh Q.
    FRONTIERS IN GENETICS, 2023, 14
  • [43] Web Service Classification Based on Automatic Semantic Annotation and Ensemble Learning
    Li Yuan-jie
    Cao Jian
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2274 - 2279
  • [44] Using pretrained models in ensemble learning for date fruits multiclass classification
    Eser, Murat
    Bilgin, Metin
    Yasin, Elham Tahsin
    Koklu, Murat
    JOURNAL OF FOOD SCIENCE, 2025, 90 (03)
  • [45] A Proposed Batik Automatic Classification System Based on Ensemble Deep Learning and GLCM Feature Extraction Method
    Elvitaria, Luluk
    Shaubari, Ezak Fadzrin Ahmad
    Samsudin, Noor Azah
    Khalid, Shamsul Kamal Ahmad
    Salamun
    Indra, Zul
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (10) : 553 - 561
  • [46] Active learning with extreme learning machine for online imbalanced multiclass classification
    Qin, Jiongming
    Wang, Cong
    Zou, Qinhong
    Sun, Yubin
    Chen, Bin
    KNOWLEDGE-BASED SYSTEMS, 2021, 231
  • [47] An automatic identification method of common species based on ensemble learning
    Li, Hao-Xuan
    Zhang, Mei
    Meng, De-Yao
    Geng, Bo
    Li, Zu-Kui
    Huang, Chuan-Feng
    Li, Wen-Kang
    Jiang, Han-Lin
    Wu, Rong-Hai
    Li, Xiao-Wei
    Chen, Ben-Hui
    Yang, Deng-Qi
    Ren, Guo-Peng
    ECOLOGICAL INFORMATICS, 2025, 86
  • [48] Audio classification method based on machine learning
    Rong, Feng
    2016 INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION, BIG DATA & SMART CITY (ICITBS), 2017, : 81 - 84
  • [49] A Classification Method Based on Ensemble Learning of Deep Learning and Multidimensional Scaling
    Miyazawa, Kazuya
    Sato-Ilic, Mika
    INTELLIGENT DECISION TECHNOLOGIES, KES-IDT 2021, 2021, 238 : 379 - 390
  • [50] Automatic news audio classification method based on selective ensemble SVMs
    School of Electronic Engineering, Xidian University, Xi'an 710071, China
    Moshi Shibie yu Rengong Zhineng, 2006, 5 (634-639):