t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

被引：83

作者：

Chatzimparmpas, Angelos ^{[1
]}

Martins, Rafael M. ^{[1
]}

Kerren, Andreas ^{[1
]}

机构：

[1] Linnaeus Univ, Dept Comp Sci & Media Technol, S-35195 Vaxjo, Sweden

来源：

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS | 2020年 / 26卷 / 08期

关键词：

Tools; Visualization; Data visualization; Task analysis; Correlation; Principal component analysis; Dimensionality reduction; Interpretable t-SNE; dimensionality reduction; high-dimensional data; explainable machine learning; visualization; HIGH-DIMENSIONAL DATA; VISUAL ANALYSIS; REDUCTION; QUALITY; AXES;

D O I：

10.1109/TVCG.2020.2986996

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualization of multidimensional data has proven to be a popular approach, with successful applications in a wide range of domains. Despite their usefulness, t-SNE projections can be hard to interpret or even misleading, which hurts the trustworthiness of the results. Understanding the details of t-SNE itself and the reasons behind specific patterns in its output may be a daunting task, especially for non-experts in dimensionality reduction. In this article, we present t-viSNE, an interactive tool for the visual exploration of t-SNE projections that enables analysts to inspect different aspects of their accuracy and meaning, such as the effects of hyper-parameters, distance and neighborhood preservation, densities and costs of specific neighborhoods, and the correlations between dimensions and visual patterns. We propose a coherent, accessible, and well-integrated collection of different views for the visualization of t-SNE projections. The applicability and usability of t-viSNE are demonstrated through hypothetical usage scenarios with real data sets. Finally, we present the results of a user study where the tool's effectiveness was evaluated. By bringing to light information that would normally be lost after running t-SNE, we hope to support analysts in using t-SNE and making its results better understandable.

引用

页码：2696 / 2714

页数：19

共 50 条

[41] Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE
Melit Devassy, Binu
George, Sony
Nussbaum, Peter
[J]. JOURNAL OF IMAGING, 2020, 6 (05)
[42] Deep learning and t-SNE projection for plankton images clusterization
Homsi Goulart, Antonio Jose
Morimitsu, Alexandre
Jacomassi, Renan
Hirata, Nina
Lopes, Rubens
[J]. OCEANS 2021: SAN DIEGO - PORTO, 2021,
[43] t-SNE Visualization of Large-Scale Neural Recordings
Dimitriadis, George
Neto, Joana P.
Kampff, Adam R.
[J]. NEURAL COMPUTATION, 2018, 30 (07) : 1750 - 1774
[44] A t-SNE Based Classification Approach to Compositional Microbiome Data
Xu, Xueli
Xie, Zhongming
Yang, Zhenyu
Li, Dongfang
Xu, Ximing
[J]. FRONTIERS IN GENETICS, 2020, 11
[45] Parallel t-SNE Applied to Data Visualization in Smart Cities
Da Silva Lopes, Maximiliano Araujo
Doria Neto, Adriao D.
De Medeiros Martins, Allan
[J]. IEEE ACCESS, 2020, 8 : 11482 - 11490
[46] Confidence estimation for t-SNE embeddings using random forest
Busra Ozgode Yigin
Gorkem Saygili
[J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 3981 - 3992
[47] Accelerating t-SNE using Tree-Based Algorithms
van der Maaten, Laurens
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 3221 - 3245
[48] Data Segmentation via t-SNE, DBSCAN, and Random Forest
DeLise, Timothy
[J]. INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 139 - 151
[49] A fault identification method of rotating machinery based on t-SNE
Gu, Yuhai
He, Linfeng
Deng, Yali
[J]. Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2016, 37 : 152 - 156
[50] Using Global t-SNE to Preserve Intercluster Data Structure
Zhou, Yuansheng
Sharpee, Tatyana O.
[J]. NEURAL COMPUTATION, 2022, 34 (08) : 1637 - 1651

← 1 2 3 4 5 →