On genetic programming representations and fitness functions for interpretable dimensionality reduction

被引:3
|
作者
Uriot, Thomas [1 ]
Virgolin, Marco [2 ]
Alderliesten, Tanja [1 ]
Bosman, Peter A. N. [2 ]
机构
[1] Leiden Univ, Med Ctr, Leiden, Netherlands
[2] Ctr Wiskunde & Informat, Amsterdam, Netherlands
基金
荷兰研究理事会;
关键词
Dimensionality reduction; genetic programming; interpretability; unsupervised learning;
D O I
10.1145/3512290.3528849
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dimensionality reduction (DR) is an important technique for data exploration and knowledge discovery. However, most of the main DR methods are either linear (e.g., PCA), do not provide an explicit mapping between the original data and its lower-dimensional representation (e.g., MDS, t-SNE, isomap), or produce mappings that cannot be easily interpreted (e.g., kernel PCA, neural-based autoencoder). Recently, genetic programming (GP) has been used to evolve interpretable DR mappings in the form of symbolic expressions. There exists a number of ways in which GP can be used to this end and no study exists that performs a comparison. In this paper, we fill this gap by comparing existing GP methods as well as devising new ones. We evaluate our methods on several benchmark datasets based on predictive accuracy and on how well the original features can be reconstructed using the lower-dimensional representation only. Finally, we qualitatively assess the resulting expressions and their complexity. We find that various GP methods can be competitive with state-of-the-art DR algorithms and that they have the potential to produce interpretable DR mappings.
引用
收藏
页码:458 / 466
页数:9
相关论文
共 50 条
  • [1] Dimensionality Reduction in Face Detection: A Genetic Programming Approach
    Neshatian, Kourosh
    Zhang, Mengjie
    [J]. 2009 24TH INTERNATIONAL CONFERENCE IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ 2009), 2009, : 391 - 396
  • [2] Reduction of Fitness Calculations in Cartesian Genetic Programming
    Burian, Petr
    [J]. 2013 INTERNATIONAL CONFERENCE ON APPLIED ELECTRONICS (AE), 2013, : 53 - 58
  • [3] Representations, fitness functions and genetic operators for the satisfiability problem
    Gottlieb, J
    Voss, N
    [J]. ARTIFICIAL EVOLUTION, 1998, 1363 : 55 - 68
  • [4] Partial functions in fitness-shared genetic programming
    McKay, RI
    [J]. PROCEEDINGS OF THE 2000 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2000, : 349 - 356
  • [5] Fitness functions in genetic programming for classification with unbalanced data
    Patterson, Grant
    Zhang, Mengjie
    [J]. AI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4830 : 769 - 775
  • [6] Genetic Programming for Performance Improvement and Dimensionality Reduction of Classification Problems
    Neshatian, Kourosh
    Zhang, Mengjie
    [J]. 2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 2811 - 2818
  • [7] Feature extraction and dimensionality reduction by genetic programming based on the Fisher criterion
    Guo, Hong
    Zhang, Qing
    Nandi, Asoke K.
    [J]. EXPERT SYSTEMS, 2008, 25 (05) : 444 - 459
  • [8] Canonical form functions as a simple means for genetic programming to evolve human-interpretable functions
    McConaghy, Trent
    Gielen, Georges
    [J]. GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2006, : 855 - +
  • [9] Genetic Programming for Evolving Similarity Functions for Clustering: Representations and Analysis
    Lensen, Andrew
    Xue, Bing
    Zhang, Mengjie
    [J]. EVOLUTIONARY COMPUTATION, 2020, 28 (04) : 531 - 561
  • [10] OPTIMAL COORDINATE REPRESENTATIONS AND DIMENSIONALITY REDUCTION
    KULILOWS.CA
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1971, SMC1 (04): : 401 - +