Genetic Programming-based Construction of Features for Machine Learning and Knowledge Discovery Tasks

被引:108
|
作者
Krzysztof Krawiec
机构
[1] Poznań University of Technology,Institute of Computing Science
关键词
genetic programming; machine learning; change of representation; feature construction; feature selection;
D O I
10.1023/A:1020984725014
中图分类号
学科分类号
摘要
In this paper we use genetic programming for changing the representation of the input data for machine learners. In particular, the topic of interest here is feature construction in the learning-from-examples paradigm, where new features are built based on the original set of attributes. The paper first introduces the general framework for GP-based feature construction. Then, an extended approach is proposed where the useful components of representation (features) are preserved during an evolutionary run, as opposed to the standard approach where valuable features are often lost during search. Finally, we present and discuss the results of an extensive computational experiment carried out on several reference data sets. The outcomes show that classifiers induced using the representation enriched by the GP-constructed features provide better accuracy of classification on the test set. In particular, the extended approach proposed in the paper proved to be able to outperform the standard approach on some benchmark problems on a statistically significant level.
引用
收藏
页码:329 / 343
页数:14
相关论文
共 50 条
  • [1] Genetic Programming-Based Machine Degradation Modeling Methodology
    Yan, Tongtong
    Wang, Dong
    [J]. IEEE Open Journal of Instrumentation and Measurement, 2022, 1
  • [2] Integration of Programming-based Tasks into Mathematical Problem-based Learning
    Cui, Zhihao
    Ng, Oi-Lam
    Jong, Morris S. Y.
    [J]. 29TH INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION (ICCE 2021), VOL I, 2021, : 185 - 187
  • [3] Genetic Programming-based Evolutionary Feature Construction for Heterogeneous Ensemble Learning [Hot of the Press]
    Zhang, Hengzhe
    Zhou, Aimin
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    [J]. PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 49 - 50
  • [4] FuGePSD: Fuzzy Genetic Programming-based algorithm for Subgroup Discovery
    Carmona, C. J.
    Gonzalez, P.
    del Jesus, M. J.
    [J]. PROCEEDINGS OF THE 2015 CONFERENCE OF THE INTERNATIONAL FUZZY SYSTEMS ASSOCIATION AND THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY, 2015, 89 : 447 - 454
  • [5] Genetic programming-based feature learning for question answering
    Khodadi, Iman
    Abadeh, Mohammad Saniee
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (02) : 340 - 357
  • [6] Quality of Service Timeseries Forecasting for Web Services: A Machine Learning, Genetic Programming-Based Approach
    Yang Syu
    Yong-Yi Fanjiang
    Jong-Yih Kuo
    Jui-Lung Su
    [J]. 2016 ANNUAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (CISS), 2016,
  • [7] Genetic programming-based discovery of ranking functions for effective Web search
    Fan, WG
    Gordon, MD
    Pathak, P
    [J]. JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 2005, 21 (04) : 37 - 56
  • [8] Genetic Programming-Based Feature Learning for Facial Expression Classification
    Bi, Ying
    Xue, Bing
    Zhang, Mengjie
    [J]. 2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [9] The effects of fitness functions on genetic programming-based ranking discovery for web search
    Fan, WG
    Fox, EA
    Pathak, P
    Wu, H
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (07): : 628 - 636
  • [10] Genetic programming-based controller design
    Sekaj, I.
    Perkacz, J.
    [J]. 2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 1339 - 1343