t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

被引:83
|
作者
Chatzimparmpas, Angelos [1 ]
Martins, Rafael M. [1 ]
Kerren, Andreas [1 ]
机构
[1] Linnaeus Univ, Dept Comp Sci & Media Technol, S-35195 Vaxjo, Sweden
关键词
Tools; Visualization; Data visualization; Task analysis; Correlation; Principal component analysis; Dimensionality reduction; Interpretable t-SNE; dimensionality reduction; high-dimensional data; explainable machine learning; visualization; HIGH-DIMENSIONAL DATA; VISUAL ANALYSIS; REDUCTION; QUALITY; AXES;
D O I
10.1109/TVCG.2020.2986996
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualization of multidimensional data has proven to be a popular approach, with successful applications in a wide range of domains. Despite their usefulness, t-SNE projections can be hard to interpret or even misleading, which hurts the trustworthiness of the results. Understanding the details of t-SNE itself and the reasons behind specific patterns in its output may be a daunting task, especially for non-experts in dimensionality reduction. In this article, we present t-viSNE, an interactive tool for the visual exploration of t-SNE projections that enables analysts to inspect different aspects of their accuracy and meaning, such as the effects of hyper-parameters, distance and neighborhood preservation, densities and costs of specific neighborhoods, and the correlations between dimensions and visual patterns. We propose a coherent, accessible, and well-integrated collection of different views for the visualization of t-SNE projections. The applicability and usability of t-viSNE are demonstrated through hypothetical usage scenarios with real data sets. Finally, we present the results of a user study where the tool's effectiveness was evaluated. By bringing to light information that would normally be lost after running t-SNE, we hope to support analysts in using t-SNE and making its results better understandable.
引用
收藏
页码:2696 / 2714
页数:19
相关论文
共 50 条
  • [41] Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE
    Melit Devassy, Binu
    George, Sony
    Nussbaum, Peter
    [J]. JOURNAL OF IMAGING, 2020, 6 (05)
  • [42] Deep learning and t-SNE projection for plankton images clusterization
    Homsi Goulart, Antonio Jose
    Morimitsu, Alexandre
    Jacomassi, Renan
    Hirata, Nina
    Lopes, Rubens
    [J]. OCEANS 2021: SAN DIEGO - PORTO, 2021,
  • [43] t-SNE Visualization of Large-Scale Neural Recordings
    Dimitriadis, George
    Neto, Joana P.
    Kampff, Adam R.
    [J]. NEURAL COMPUTATION, 2018, 30 (07) : 1750 - 1774
  • [44] A t-SNE Based Classification Approach to Compositional Microbiome Data
    Xu, Xueli
    Xie, Zhongming
    Yang, Zhenyu
    Li, Dongfang
    Xu, Ximing
    [J]. FRONTIERS IN GENETICS, 2020, 11
  • [45] Parallel t-SNE Applied to Data Visualization in Smart Cities
    Da Silva Lopes, Maximiliano Araujo
    Doria Neto, Adriao D.
    De Medeiros Martins, Allan
    [J]. IEEE ACCESS, 2020, 8 : 11482 - 11490
  • [46] Confidence estimation for t-SNE embeddings using random forest
    Busra Ozgode Yigin
    Gorkem Saygili
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 3981 - 3992
  • [47] Accelerating t-SNE using Tree-Based Algorithms
    van der Maaten, Laurens
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 3221 - 3245
  • [48] Data Segmentation via t-SNE, DBSCAN, and Random Forest
    DeLise, Timothy
    [J]. INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 139 - 151
  • [49] A fault identification method of rotating machinery based on t-SNE
    Gu, Yuhai
    He, Linfeng
    Deng, Yali
    [J]. Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2016, 37 : 152 - 156
  • [50] Using Global t-SNE to Preserve Intercluster Data Structure
    Zhou, Yuansheng
    Sharpee, Tatyana O.
    [J]. NEURAL COMPUTATION, 2022, 34 (08) : 1637 - 1651