On genetic programming representations and fitness functions for interpretable dimensionality reduction

被引:3
|
作者
Uriot, Thomas [1 ]
Virgolin, Marco [2 ]
Alderliesten, Tanja [1 ]
Bosman, Peter A. N. [2 ]
机构
[1] Leiden Univ, Med Ctr, Leiden, Netherlands
[2] Ctr Wiskunde & Informat, Amsterdam, Netherlands
基金
荷兰研究理事会;
关键词
Dimensionality reduction; genetic programming; interpretability; unsupervised learning;
D O I
10.1145/3512290.3528849
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dimensionality reduction (DR) is an important technique for data exploration and knowledge discovery. However, most of the main DR methods are either linear (e.g., PCA), do not provide an explicit mapping between the original data and its lower-dimensional representation (e.g., MDS, t-SNE, isomap), or produce mappings that cannot be easily interpreted (e.g., kernel PCA, neural-based autoencoder). Recently, genetic programming (GP) has been used to evolve interpretable DR mappings in the form of symbolic expressions. There exists a number of ways in which GP can be used to this end and no study exists that performs a comparison. In this paper, we fill this gap by comparing existing GP methods as well as devising new ones. We evaluate our methods on several benchmark datasets based on predictive accuracy and on how well the original features can be reconstructed using the lower-dimensional representation only. Finally, we qualitatively assess the resulting expressions and their complexity. We find that various GP methods can be competitive with state-of-the-art DR algorithms and that they have the potential to produce interpretable DR mappings.
引用
收藏
页码:458 / 466
页数:9
相关论文
共 50 条
  • [41] Variants of genetic programming for species distribution modelling - fitness sharing, partial functions, population evaluation
    McKay, RI
    [J]. ECOLOGICAL MODELLING, 2001, 146 (1-3) : 231 - 241
  • [42] Interpretable dimensionality reduction of single cell transcriptome data with deep generative models
    Jiarui Ding
    Anne Condon
    Sohrab P. Shah
    [J]. Nature Communications, 9
  • [43] IBMG: Interpretable Behavioral model generator for nonlinear analog circuits via canonical form functions and genetic programming
    McConaghy, T
    Gielen, G
    [J]. 2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 5170 - 5173
  • [44] Interpreting the dimensions of neural feature representations revealed by dimensionality reduction
    Goddard, Erin
    Klein, Colin
    Solomon, Samuel G.
    Hogendoorn, Hinze
    Carlson, Thomas A.
    [J]. NEUROIMAGE, 2018, 180 : 41 - 67
  • [45] Contrastive analysis for scatterplot-based representations of dimensionality reduction
    Marcilio-Jr, Wilson E.
    Eler, Danilo M.
    Garcia, Rogerio E.
    [J]. COMPUTERS & GRAPHICS-UK, 2021, 101 (101): : 46 - 58
  • [46] Coevolving functions in genetic programming
    Ahluwalia, M
    Bull, L
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2001, 47 (07) : 573 - 585
  • [47] An investigation of dynamic fitness measures for genetic programming
    Ragalo, Anisa
    Pillay, Nelishia
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 92 : 52 - 72
  • [48] Fitness approximation for bot evolution in genetic programming
    Esparcia-Alcazar, Anna I.
    Moravec, Jaroslav
    [J]. SOFT COMPUTING, 2013, 17 (08) : 1479 - 1487
  • [49] Evolving dynamic fitness measures for genetic programming
    Ragalo, Anisa
    Pillay, Nelishia
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 109 : 162 - 187
  • [50] Dimensionality reduction via genetic value clustering
    Topchy, A
    Punch, W
    [J]. GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2003, PT II, PROCEEDINGS, 2003, 2724 : 1431 - 1443