A Genetic Algorithm-based sequential instance selection framework for ensemble learning

被引:7
|
作者
Xu, Che [1 ]
Zhang, Shuwen [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Econ & Management, Nanjing 210094, Jiangsu, Peoples R China
[2] iFLYTEK, Hefei, Anhui, Peoples R China
关键词
Ensemble learning; Genetic algorithm; Evolutionary instance selection; Diversity learning; Classification; CLASSIFIER ENSEMBLES; ROTATION FOREST; DIVERSITY; MACHINES; IMPROVE; MODEL;
D O I
10.1016/j.eswa.2023.121269
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The accumulation of large amounts of historical data has led to the wide application of ensemble learning over the past few decades, but the balance between the individual accuracy of base classifiers (BCs) and the diversity among these BCs is rarely considered in the construction of ensemble models. Since such a balance is crucial to the success of ensemble models, this paper proposes a Genetic Algorithm-based sequential instance selection framework to address this research gap. The novelties of the proposed framework include: transforming the balance between the individual accuracy of BCs and the diversity among BCs into a general combinatorial optimization model and designing a Genetic Algorithm-based evolutionary instance selection method to solve this model. The proposed framework not only overcomes the inherent limitations of the Genetic Algorithm in some high-dimensional tasks but also provides an explicit and automatic way to balance the accuracy and diversity by searching appropriate training data subsets for different component BCs. Based on obtained training data subsets, the component BCs of the ensemble model are generated sequentially, and their predictions are further combined with the weighted majority voting rule. Using 30 real datasets collected from various practical applications, such as medicine, business, and industry, the effectiveness of the proposed framework in constructing powerful ensemble models is examined and compared with six benchmark ensemble learning methods. In addition, the capability of the proposed framework to improve convergence performances is also examined by the comparison with the traditional Genetic Algorithm.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Increasing Speed of Genetic Algorithm-based Instance Selection
    Kordos, Miroslaw
    Klos-Witkowska, Aleksandra
    [J]. PROCEEDINGS OF THE 2017 9TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS), VOL 2, 2017, : 590 - 595
  • [2] A Genetic Algorithm-Based Ensemble Learning Framework for Drug Combination Prediction
    Wu, Lianlian
    Ye, Xiaona
    Zhang, Yixin
    Gao, Jie
    Lin, Zhikai
    Sui, Binsheng
    Wen, Yuqi
    Wu, Qingqiang
    Liu, Kunhong
    He, Song
    Bo, Xiaochen
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (12) : 3941 - 3954
  • [3] Fuzzy clustering decomposition of genetic algorithm-based instance selection for regression problems
    Kordos, Miroslaw
    Blachnik, Marcin
    Scherer, Rafal
    [J]. INFORMATION SCIENCES, 2022, 587 : 23 - 40
  • [4] A genetic algorithm-based framework for wavelength selection on sample categorization
    Anzanello, Michel J.
    Yamashita, Gabrielli
    Marcelo, Marcelo
    Fogliatto, Flavio S.
    Ortiz, Rafael S.
    Mariotti, Kristiane
    Ferrao, Marco F.
    [J]. DRUG TESTING AND ANALYSIS, 2017, 9 (08) : 1172 - 1181
  • [5] Deep learning and genetic algorithm-based ensemble model for feature selection and classification of breast ultrasound images
    Dar, Mohsin Furkh
    Ganivada, Avatharam
    [J]. Image and Vision Computing, 2024, 146
  • [6] Deep learning and genetic algorithm-based ensemble model for feature selection and classification of breast ultrasound images
    Dar, Mohsin Furkh
    Ganivada, Avatharam
    [J]. IMAGE AND VISION COMPUTING, 2024, 146
  • [7] Nested genetic algorithm-based classifier selection and placement in multi-level ensemble framework for effective disease diagnosis
    Arukonda, Srinivas
    Cheruku, Ramalingaswamy
    [J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2023,
  • [8] A genetic algorithm-based, hybrid machine learning approach to model selection
    Bies, RR
    Muldoon, MF
    Pollock, BG
    Manuck, S
    Smith, G
    Sale, ME
    [J]. JOURNAL OF PHARMACOKINETICS AND PHARMACODYNAMICS, 2006, 33 (02) : 195 - 221
  • [9] A Genetic Algorithm-Based, Hybrid Machine Learning Approach to Model Selection
    Robert R. Bies
    Matthew F. Muldoon
    Bruce G. Pollock
    Steven Manuck
    Gwenn Smith
    Mark E. Sale
    [J]. Journal of Pharmacokinetics and Pharmacodynamics, 2006, 33 : 195 - 221
  • [10] Democratic instance selection: A linear complexity instance selection algorithm based on classifier ensemble concepts
    Garcia-Osorio, Cesar
    de Haro-Garcia, Aida
    Garcia-Pedrajas, Nicolas
    [J]. ARTIFICIAL INTELLIGENCE, 2010, 174 (5-6) : 410 - 441