TopoMap: A 0-dimensional Homology Preserving Projection of High-Dimensional Data

被引:11
|
作者
Doraiswamy, Harish [1 ]
Tierny, Julien [2 ,3 ]
Silva, Paulo J. S. [4 ]
Nonato, Luis Gustavo [5 ]
Silva, Claudio [1 ]
机构
[1] NYU, New York, NY 10012 USA
[2] CNRS, Paris, France
[3] Sorbonne Univ, Paris, France
[4] Univ Estadual Campinas, Campinas, Brazil
[5] Univ Sao Paulo, Sao Carlos, Brazil
基金
欧洲研究理事会; 巴西圣保罗研究基金会;
关键词
Three-dimensional displays; Data visualization; Two dimensional displays; Data analysis; Layout; Visualization; Topology; Topological data analysis; computational topology; high-dimensional data; projection; VISUAL EXPLORATION; POINT CLOUDS; REDUCTION; TOPOLOGY; PERSISTENCE; LANDSCAPES; EXTRACTION; STABILITY;
D O I
10.1109/TVCG.2020.3030441
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multidimensional Projection is a fundamental tool for high-dimensional data analytics and visualization. With very few exceptions, projection techniques are designed to map data from a high-dimensional space to a visual space so as to preserve some dissimilarity (similarity) measure, such as the Euclidean distance for example. In fact, although adopting distinct mathematical formulations designed to favor different aspects of the data, most multidimensional projection methods strive to preserve dissimilarity measures that encapsulate geometric properties such as distances or the proximity relation between data objects. However, geometric relations are not the only interesting property to be preserved in a projection. For instance, the analysis of particular structures such as clusters and outliers could be more reliably performed if the mapping process gives some guarantee as to topological invariants such as connected components and loops. This paper introduces TopoMap, a novel projection technique which provides topological guarantees during the mapping process. In particular, the proposed method performs the mapping from a high-dimensional space to a visual space, while preserving the 0-dimensional persistence diagram of the Rips filtration of the high-dimensional data, ensuring that the filtrations generate the same connected components when applied to the original as well as projected data. The presented case studies show that the topological guarantee provided by TopoMap not only brings confidence to the visual analytic process but also can be used to assist in the assessment of other projection methods.
引用
收藏
页码:561 / 571
页数:11
相关论文
共 50 条
  • [31] ON THE CANONICAL MODULE OF A 0-DIMENSIONAL SCHEME
    KREUZER, M
    [J]. CANADIAN JOURNAL OF MATHEMATICS-JOURNAL CANADIEN DE MATHEMATIQUES, 1994, 46 (02): : 357 - 379
  • [32] LIPSCHITZ CLASSES ON 0-DIMENSIONAL GROUPS
    WALKER, PL
    [J]. PROCEEDINGS OF THE CAMBRIDGE PHILOSOPHICAL SOCIETY-MATHEMATICAL AND PHYSICAL SCIENCES, 1967, 63 : 923 - &
  • [33] Gorenstein liaison of 0-dimensional schemes
    M. Casanellas
    [J]. manuscripta mathematica, 2003, 111 : 265 - 275
  • [34] Tensorized Projection for High-Dimensional Binary Embedding
    Hong, Weixiang
    Meng, Jingjing
    Yuan, Junsong
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 69 - 76
  • [35] On Criticality in High-Dimensional Data
    Saremi, Saeed
    Sejnowski, Terrence J.
    [J]. NEURAL COMPUTATION, 2014, 26 (07) : 1329 - 1339
  • [36] High-dimensional data clustering
    Bouveyron, C.
    Girard, S.
    Schmid, C.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 502 - 519
  • [38] High-Dimensional Data Bootstrap
    Chernozhukov, Victor
    Chetverikov, Denis
    Kato, Kengo
    Koike, Yuta
    [J]. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2023, 10 : 427 - 449
  • [39] High-dimensional data visualization
    Tang, Lin
    [J]. NATURE METHODS, 2020, 17 (02) : 129 - 129
  • [40] High-dimensional data visualization
    Lin Tang
    [J]. Nature Methods, 2020, 17 : 129 - 129