Ensemble learning method for classification: Integrating data envelopment analysis with machine learning

被引：0

作者：

An, Qingxian ^{[1
,2
]}

Huang, Siwei ^{[1
]}

Han, Yuxuan ^{[1
]}

Zhu, You ^{[3
,4
]}

机构：

[1] Cent South Univ, Sch Business, Changsha 410083, Peoples R China

[2] Hefei Univ Technol, Sch Econ, Hefei 230601, Peoples R China

[3] Hunan Univ, Business Sch, Changsha 410082, Peoples R China

[4] Hunan Prov Key Lab Philosophy & Social Sci Ind Dig, Changsha 410082, Peoples R China

来源：

COMPUTERS & OPERATIONS RESEARCH | 2024年 / 169卷

基金：

中国国家自然科学基金;

关键词：

Ensemble learning; Data envelopment analysis; Classifier; Large dataset; STATISTICAL COMPARISONS; CLASSIFIERS; EFFICIENCY; DEA;

D O I：

10.1016/j.cor.2024.106739

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In classification tasks with large sample sets, the use of a single classifier carries the risk of overfitting. To overcome this issue, an ensemble of classifier models has often been shown to outperform the use of a single "best" model. Given the rich variety of classifier models available, the selection of the high-efficiency classifiers for a given task dataset remains an urgent challenge. However, most of the previous classifier selection methods only focus on the measurement of classification output performance without considering the computational cost. This paper proposes a new ensemble learning method to improve the classification quality for big datasets by using data envelopment analysis. It contains the following two stages: classifier selection and classifier combination. In the first stage, the commonly used classifiers are evaluated on the basis of their performance on resource consumption and classification output performance using the range directional model (RDM); then, the most efficient classifiers are selected. In the second stage, the classifier confusion matrix is evaluated using the data envelopment analysis (DEA) cross-efficiency model. Then, the weight for the classifier combination is determined to ensure that classifiers with higher performance have greater weights based on the cross-efficiency values. Experimental results demonstrate the superiority of the cross-efficiency model over the BCC model and the benchmark voting method in model ensemble. Furthermore, our method has been shown to save more computational resources and yields better results than existing methods.

引用

页数：17

共 50 条

[41] Classification of Stroke Victims through Supervised Machine Learning Algorithms and Ensemble Learning
Hensley, Dalton
Elgazzar, Heba
2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 58 - 64
[42] Ensemble of deep learning and machine learning approach for classification of handwritten Hindi numerals
Rajpal D.
Garg A.R.
Journal of Engineering and Applied Science, 2023, 70 (01):
[43] Ensemble weighted extreme learning machine for imbalanced data classification based on differential evolution
Zhang, Yong
Liu, Bo
Cai, Jing
Zhang, Suhua
NEURAL COMPUTING & APPLICATIONS, 2017, 28 : S259 - S267
[44] Ensemble weighted extreme learning machine for imbalanced data classification based on differential evolution
Yong Zhang
Bo Liu
Jing Cai
Suhua Zhang
Neural Computing and Applications, 2017, 28 : 259 - 267
[45] On Machine Learning Classification of Otoneurological Data
Juhola, Martti
EHEALTH BEYOND THE HORIZON - GET IT THERE, 2008, 136 : 211 - 216
[46] Ensemble learning of deep learning and traditional machine learning approaches for skin lesion segmentation and classification
Khan, Adil H.
Iskandar, Dayang NurFatimah Awang
Al-Asad, Jawad F.
Mewada, Hiren
Sherazi, Muhammad Abid
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (13):
[47] Integrating Statistical and Machine Learning Approaches for Neural Classification
Sarmashghi, Mehrad
Jadhav, Shantanu P.
Eden, Uri T.
IEEE ACCESS, 2022, 10 : 119106 - 119118
[48] Galaxy classification: A machine learning analysis of GAMA catalogue data
Nolte, Aleke
Wang, Lingyu
Bilicki, Maciej
Holwerda, Benne
Biehl, Michael
NEUROCOMPUTING, 2019, 342 : 172 - 190
[49] An Experimental Analysis of Machine Learning Classification Algorithms on Biomedical Data
Das, Himansu
Naik, Bighnaraj
Behera, H. S.
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, DEVICES AND COMPUTING, 2020, 602 : 525 - 539
[50] A machine learning approach for vocal fold segmentation and disorder classification based on ensemble method
Nobel, S. M. Nuruzzaman
Swapno, S. M. Masfequier Rahman
Islam, Md. Rajibul
Safran, Mejdl
Alfarhood, Sultan
Mridha, M. F.
SCIENTIFIC REPORTS, 2024, 14 (01):

← 1 2 3 4 5 →