Genetic Programming with Embedded Feature Construction for High-Dimensional Symbolic Regression

被引:7
|
作者
Chen, Qi [1 ]
Zhang, Mengjie [1 ]
Xue, Bing [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand
关键词
Genetic programming; Symbolic regression; Feature construction; Generalisation; VARIABLE SELECTION; CLASSIFIERS;
D O I
10.1007/978-3-319-49049-6_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature construction is an effective way to eliminate the limitation of poor data representation in many tasks such as high-dimensional symbolic regression. Genetic Programming (GP) is a good choice for feature construction for its natural ability to explore the feature space to detect and combine important features. However, there is very little contribution devoted to enhance the generalisation performance of GP for high-dimensional symbolic regression by feature construction. This work aims to develop a new feature construction method namely genetic programming with embedded feature construction (GPEFC) for high-dimensional symbolic regression. GPEFC keeps track of new small informative building blocks on best fitness gain individuals and constructs new features using these building blocks. The new constructed features augment the Terminal Set of GP dynamically. A series of experiments were conducted to investigate the learning ability and generalisation performance of GPEFC. The results show that GPEFC can evolve more compact models in an efficient way, has better learning ability and better generalisation performance than standard GP.
引用
收藏
页码:87 / 102
页数:16
相关论文
共 50 条
  • [1] Feature Selection to Improve Generalization of Genetic Programming for High-Dimensional Symbolic Regression
    Chen, Qi
    Zhang, Mengjie
    Xue, Bing
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2017, 21 (05) : 792 - 806
  • [2] Improving Generalisation of Genetic Programming for High-Dimensional Symbolic Regression with Feature Selection
    Chen, Qi
    Xue, Bing
    Niu, Ben
    Zhang, Mengjie
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 3793 - 3800
  • [3] Genetic Programming for Feature Selection Based on Feature Removal Impact in High-Dimensional Symbolic Regression
    Al-Helali, Baligh
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03): : 2269 - 2282
  • [4] Genetic programming for multiple-feature construction on high-dimensional classification
    Binh Tran
    Xue, Bing
    Zhang, Mengjie
    [J]. PATTERN RECOGNITION, 2019, 93 : 404 - 417
  • [5] Genetic programming for feature construction and selection in classification on high-dimensional data
    Binh Tran
    Bing Xue
    Mengjie Zhang
    [J]. Memetic Computing, 2016, 8 : 3 - 15
  • [6] Genetic programming for feature construction and selection in classification on high-dimensional data
    Binh Tran
    Xue, Bing
    Zhang, Mengjie
    [J]. MEMETIC COMPUTING, 2016, 8 (01) : 3 - 15
  • [7] Genetic Programming for Imputation Predictor Selection and Ranking in Symbolic Regression with High-Dimensional Incomplete Data
    Al-Helali, Baligh
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    [J]. AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 523 - 535
  • [8] A Comparative Analysis of Dimensionality Reduction Methods for Genetic Programming to Solve High-Dimensional Symbolic Regression Problems
    Zhong, Lianjie
    Zhong, Jinghui
    Lu, Chengyu
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 476 - 483
  • [9] LATENT VARIABLE SYMBOLIC REGRESSION FOR HIGH-DIMENSIONAL INPUTS
    McConaghy, Trent
    [J]. GENETIC PROGRAMMING THEORY AND PRACTICE VII, 2010, : 103 - 118
  • [10] Multi Hive Artificial Bee Colony Programming for high dimensional symbolic regression with feature selection
    Arslan, Sibel
    Ozturk, Celal
    [J]. APPLIED SOFT COMPUTING, 2019, 78 : 515 - 527