Genetic Programming with Embedded Feature Construction for High-Dimensional Symbolic Regression

被引:7
|
作者
Chen, Qi [1 ]
Zhang, Mengjie [1 ]
Xue, Bing [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand
关键词
Genetic programming; Symbolic regression; Feature construction; Generalisation; VARIABLE SELECTION; CLASSIFIERS;
D O I
10.1007/978-3-319-49049-6_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature construction is an effective way to eliminate the limitation of poor data representation in many tasks such as high-dimensional symbolic regression. Genetic Programming (GP) is a good choice for feature construction for its natural ability to explore the feature space to detect and combine important features. However, there is very little contribution devoted to enhance the generalisation performance of GP for high-dimensional symbolic regression by feature construction. This work aims to develop a new feature construction method namely genetic programming with embedded feature construction (GPEFC) for high-dimensional symbolic regression. GPEFC keeps track of new small informative building blocks on best fitness gain individuals and constructs new features using these building blocks. The new constructed features augment the Terminal Set of GP dynamically. A series of experiments were conducted to investigate the learning ability and generalisation performance of GPEFC. The results show that GPEFC can evolve more compact models in an efficient way, has better learning ability and better generalisation performance than standard GP.
引用
收藏
页码:87 / 102
页数:16
相关论文
共 50 条
  • [41] On inference in high-dimensional regression
    Battey, Heather S.
    Reid, Nancy
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (01) : 149 - 175
  • [42] Multiple Feature Construction in Classification on High-Dimensional Data Using GP
    Binh Tran
    Zhang, Mengjie
    Xue, Bing
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [43] Racing Control Variable Genetic Programming for Symbolic Regression
    Jiang, Nan
    Xue, Yexiang
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12901 - 12909
  • [44] Genetic Programming for Instance Transfer Learning in Symbolic Regression
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (01) : 25 - 38
  • [45] Sensitivity analysis of genetic programming: A case of symbolic regression
    Chen, SH
    Kuo, TW
    Shieh, YP
    [J]. PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 1119 - 1122
  • [46] Bingo: A Customizable Framework for Symbolic Regression with Genetic Programming
    Randall, David L.
    Townsend, Tyler S.
    Hochhalter, Jacob D.
    Bomarito, Geoffrey F.
    [J]. PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 2282 - 2288
  • [47] Symbolic Regression via Control Variable Genetic Programming
    Jiang, Nan
    Xue, Yexiang
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 178 - 195
  • [48] Population diversity and inheritance in genetic programming for symbolic regression
    Burlacu, Bogdan
    Yang, Kaifeng
    Affenzeller, Michael
    [J]. NATURAL COMPUTING, 2024, 23 (03) : 531 - 566
  • [49] Investigation of Linear Genetic Programming Techniques for Symbolic Regression
    Dal Piccol Sotto, Leo Francoso
    de Melo, Vinicius Veloso
    [J]. 2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 146 - 151
  • [50] Cartesian Genetic Programming with Module Mutation for Symbolic Regression
    Kushida, Jun-ichi
    Hara, Akira
    Takahama, Tetsuyuki
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 159 - 164