On genetic programming representations and fitness functions for interpretable dimensionality reduction

被引:4
|
作者
Uriot, Thomas [1 ]
Virgolin, Marco [2 ]
Alderliesten, Tanja [1 ]
Bosman, Peter A. N. [2 ]
机构
[1] Leiden Univ, Med Ctr, Leiden, Netherlands
[2] Ctr Wiskunde & Informat, Amsterdam, Netherlands
基金
荷兰研究理事会;
关键词
Dimensionality reduction; genetic programming; interpretability; unsupervised learning;
D O I
10.1145/3512290.3528849
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dimensionality reduction (DR) is an important technique for data exploration and knowledge discovery. However, most of the main DR methods are either linear (e.g., PCA), do not provide an explicit mapping between the original data and its lower-dimensional representation (e.g., MDS, t-SNE, isomap), or produce mappings that cannot be easily interpreted (e.g., kernel PCA, neural-based autoencoder). Recently, genetic programming (GP) has been used to evolve interpretable DR mappings in the form of symbolic expressions. There exists a number of ways in which GP can be used to this end and no study exists that performs a comparison. In this paper, we fill this gap by comparing existing GP methods as well as devising new ones. We evaluate our methods on several benchmark datasets based on predictive accuracy and on how well the original features can be reconstructed using the lower-dimensional representation only. Finally, we qualitatively assess the resulting expressions and their complexity. We find that various GP methods can be competitive with state-of-the-art DR algorithms and that they have the potential to produce interpretable DR mappings.
引用
收藏
页码:458 / 466
页数:9
相关论文
共 50 条
  • [21] Graph representations in genetic programming
    Léo Françoso Dal Piccol Sotto
    Paul Kaufmann
    Timothy Atkinson
    Roman Kalkreuth
    Márcio Porto Basgalupp
    Genetic Programming and Evolvable Machines, 2021, 22 : 607 - 636
  • [22] Graph representations in genetic programming
    Sotto, Leo Francoso Dal Piccol
    Kaufmann, Paul
    Atkinson, Timothy
    Kalkreuth, Roman
    Basgalupp, Marcio Porto
    GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2021, 22 (04) : 607 - 636
  • [23] Interpretable policies for reinforcement learning by genetic programming
    Hein, Daniel
    Udluft, Steffen
    Runkler, Thomas A.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 76 : 158 - 169
  • [24] A TECHNIQUE FOR THE REDUCTION OF DIMENSIONALITY IN DYNAMIC-PROGRAMMING
    BELLMAN, R
    ROOSTA, M
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1982, 88 (02) : 543 - 546
  • [25] Genetic Programming for Modeling Vibratory Finishing Process: Role of Experimental Designs and Fitness Functions
    Garg, Akhil
    Tai, Kang
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT II (SEMCCO 2013), 2013, 8298 : 23 - 31
  • [26] The effects of fitness functions on genetic programming-based ranking discovery for web search
    Fan, WG
    Fox, EA
    Pathak, P
    Wu, H
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (07): : 628 - 636
  • [27] Unbalanced breast cancer data classification using novel fitness functions in genetic programming
    Devarriya, Divyaansh
    Gulati, Cairo
    Mansharamani, Vidhi
    Sakalle, Aditi
    Bhardwaj, Arpit
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
  • [28] Using Gaussian distribution to construct fitness functions in genetic programming for multiclass object classification
    Zhang, Mengjie
    Smart, Will
    PATTERN RECOGNITION LETTERS, 2006, 27 (11) : 1266 - 1274
  • [29] New Fitness Functions in Genetic Programming for Classification with High-dimensional Unbalanced Data
    Pei, Wenbin
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2779 - 2786
  • [30] Interpretable linear dimensionality reduction based on bias-variance analysis
    Bonetti, Paolo
    Metelli, Alberto Maria
    Restelli, Marcello
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (04) : 1713 - 1781