Ensemble learning method for classification: Integrating data envelopment analysis with machine learning

被引:0
|
作者
An, Qingxian [1 ,2 ]
Huang, Siwei [1 ]
Han, Yuxuan [1 ]
Zhu, You [3 ,4 ]
机构
[1] Cent South Univ, Sch Business, Changsha 410083, Peoples R China
[2] Hefei Univ Technol, Sch Econ, Hefei 230601, Peoples R China
[3] Hunan Univ, Business Sch, Changsha 410082, Peoples R China
[4] Hunan Prov Key Lab Philosophy & Social Sci Ind Dig, Changsha 410082, Peoples R China
基金
中国国家自然科学基金;
关键词
Ensemble learning; Data envelopment analysis; Classifier; Large dataset; STATISTICAL COMPARISONS; CLASSIFIERS; EFFICIENCY; DEA;
D O I
10.1016/j.cor.2024.106739
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In classification tasks with large sample sets, the use of a single classifier carries the risk of overfitting. To overcome this issue, an ensemble of classifier models has often been shown to outperform the use of a single "best" model. Given the rich variety of classifier models available, the selection of the high-efficiency classifiers for a given task dataset remains an urgent challenge. However, most of the previous classifier selection methods only focus on the measurement of classification output performance without considering the computational cost. This paper proposes a new ensemble learning method to improve the classification quality for big datasets by using data envelopment analysis. It contains the following two stages: classifier selection and classifier combination. In the first stage, the commonly used classifiers are evaluated on the basis of their performance on resource consumption and classification output performance using the range directional model (RDM); then, the most efficient classifiers are selected. In the second stage, the classifier confusion matrix is evaluated using the data envelopment analysis (DEA) cross-efficiency model. Then, the weight for the classifier combination is determined to ensure that classifiers with higher performance have greater weights based on the cross-efficiency values. Experimental results demonstrate the superiority of the cross-efficiency model over the BCC model and the benchmark voting method in model ensemble. Furthermore, our method has been shown to save more computational resources and yields better results than existing methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Classification with Extreme Learning Machine and Ensemble Algorithms Over Randomly Partitioned Data
    Catak, Ferhat Ozgur
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 228 - 231
  • [22] Ensemble dropout extreme learning machine via fuzzy integral for data classification
    Zhai, Junhai
    Zang, Liguang
    Zhou, Zhaoyi
    NEUROCOMPUTING, 2018, 275 : 1043 - 1052
  • [23] Ensemble online sequential extreme learning machine for large data set classification
    Zhai, Junhai
    Wang, Jinggeng
    Wang, Xizhao
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 2250 - 2255
  • [24] Dissimilarity based ensemble of extreme learning machine for gene expression data classification
    Lu, Hui-juan
    An, Chun-lin
    Zheng, En-hui
    Lu, Yi
    NEUROCOMPUTING, 2014, 128 : 22 - 30
  • [25] Enhancing Question Pairs Identification with Ensemble Learning: Integrating Machine Learning and Deep Learning Models
    Tarek, Salsabil
    Noaman, Hatem M.
    Kayed, Mohammed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 981 - 992
  • [26] Boosted Stacking Ensemble Machine Learning Method for Wafer Map Pattern Classification
    Choi, Jeonghoon
    Suh, Dongjun
    Otto, Marc-Oliver
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 2945 - 2966
  • [27] Study on a confidence machine learning method based on ensemble learning
    Jiang, Fang Chun
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (04): : 3357 - 3368
  • [28] Study on a confidence machine learning method based on ensemble learning
    Fang Chun Jiang
    Cluster Computing, 2017, 20 : 3357 - 3368
  • [29] A Novel Ensemble Bagging Classification Method for Breast Cancer Classification Using Machine Learning Techniques
    Ponnaganti, Naga Deepti
    Anitha, Raju
    TRAITEMENT DU SIGNAL, 2022, 39 (01) : 229 - 237
  • [30] Medical and Health Data Classification Method Based on Machine Learning
    Zeng, Yu
    Cheng, Fuchao
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021