SR-Forest: A Genetic Programming-Based Heterogeneous Ensemble Learning Method

被引:1
|
作者
Zhang, Hengzhe [1 ]
Zhou, Aimin [2 ,3 ]
Chen, Qi [1 ]
Xue, Bing [1 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6140, New Zealand
[2] East China Normal Univ, Shanghai Inst AI Educ, Shanghai 200062, Peoples R China
[3] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China
关键词
Data models; Ensemble learning; Predictive models; Computational modeling; Task analysis; Boosting; Random forests; Evolutionary feature construction; evolutionary forest (EF); genetic programming (GP); random forest (RF); MULTIPLE-FEATURE CONSTRUCTION; SYMBOLIC REGRESSION; FEATURE-SELECTION; ALGORITHM;
D O I
10.1109/TEVC.2023.3243172
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble learning methods have been widely used in machine learning in recent years due to their high-predictive performance. With the development of genetic programming-based symbolic regression (GPSR) methods, many papers begin to choose a popular ensemble learning method, random forests (RFs), as the baseline competitor. Instead of considering them as competitors, an alternative idea might be to consider symbolic regression (SR) as an enhancement technique for RF. GPSR methods which fit a smooth function are complementary to the piecewise nature of decision trees (DTs), as the smooth variation is common in regression problems. In this article, we propose to form an ensemble model with SR-based DTs to address this issue. Furthermore, we design a guided mutation operator to speed up the search on high-dimensional problems, a multifidelity evaluation strategy to reduce the computational cost, and an ensemble selection mechanism to improve predictive performance. Finally, experimental results on a regression benchmark with 120 datasets show that the proposed ensemble model outperforms 25 existing SR and ensemble learning methods. Moreover, the proposed method can provide notable insights on an XGBoost hyperparameter performance prediction task, which is an important application area of ensemble learning methods.
引用
收藏
页码:1484 / 1498
页数:15
相关论文
共 50 条
  • [1] Genetic Programming-based Evolutionary Feature Construction for Heterogeneous Ensemble Learning [Hot of the Press]
    Zhang, Hengzhe
    Zhou, Aimin
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    [J]. PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 49 - 50
  • [2] A Multiobjective Genetic Programming-Based Ensemble for Simultaneous Feature Selection and Classification
    Nag, Kaustuv
    Pal, Nikhil R.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (02) : 499 - 510
  • [3] Label Distribution Learning Method Based on Deep Forest and Heterogeneous Ensemble
    Wang, Yi-Fei
    Zhu, Ji-Hua
    Liu, Xin-Yuan
    Zhou, Yi-Yang
    [J]. Ruan Jian Xue Bao/Journal of Software, 2024, 35 (07): : 3410 - 3427
  • [4] Genetic programming-based feature learning for question answering
    Khodadi, Iman
    Abadeh, Mohammad Saniee
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (02) : 340 - 357
  • [5] MAP-Elites for Genetic Programming-Based Ensemble Learning: An Interactive Approach [AI-eXplained]
    Zhang, Hengzhe
    Chen, Qi
    Xue, Bing
    Banzhaf, Wolfgang
    Zhang, Mengjie
    [J]. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2023, 18 (04) : 62 - 63
  • [6] On the Detection of Community Smells Using Genetic Programming-based Ensemble Classifier Chain
    Almarimi, Nuri
    Ouni, Ali
    Chouchen, Moataz
    Saidani, Islem
    Mkaouer, Mohamed Wiem
    [J]. 2020 ACM/IEEE 15TH INTERNATIONAL CONFERENCE ON GLOBAL SOFTWARE ENGINEERING, ICGSE, 2020, : 43 - 54
  • [7] Genetic Programming-Based Feature Learning for Facial Expression Classification
    Bi, Ying
    Xue, Bing
    Zhang, Mengjie
    [J]. 2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [8] A Genetic Programming-Based Imputation Method for Classification with Missing Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    [J]. GENETIC PROGRAMMING, EUROGP 2016, 2016, 9594 : 149 - 163
  • [9] Genetic programming-based controller design
    Sekaj, I.
    Perkacz, J.
    [J]. 2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 1339 - 1343
  • [10] A genetic programming-based classifier system
    Ahluwalia, M
    Bull, L
    [J]. GECCO-99: PROCEEDINGS OF THE GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 1999, : 11 - 18