Visual cluster separation using high-dimensional sharpened dimensionality reduction

被引:1
|
作者
Kim, Youngjoo [1 ]
Telea, Alexandru C. [2 ]
Trager, Scott C. [3 ]
Roerdink, Jos B. T. M. [1 ]
机构
[1] Univ Groningen, Bernoulli Inst Math Comp Sci & Artificial Intelli, Groningen, Netherlands
[2] Univ Utrecht, Dept Informat & Comp Sci, Utrecht, Netherlands
[3] Univ Groningen, Kapteyn Astron Inst, Groningen, Netherlands
关键词
High-dimensional data visualization; dimensionality reduction; clustering; astronomy; MEAN SHIFT; PROJECTION METHODS; DISTANCE;
D O I
10.1177/14738716221086589
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Applying dimensionality reduction (DR) to large, high-dimensional data sets can be challenging when distinguishing the underlying high-dimensional data clusters in a 2D projection for exploratory analysis. We address this problem by first sharpening the clusters in the original high-dimensional data prior to the DR step using Local Gradient Clustering (LGC). We then project the sharpened data from the high-dimensional space to 2D by a user-selected DR method. The sharpening step aids this method to preserve cluster separation in the resulting 2D projection. With our method, end-users can label each distinct cluster to further analyze an otherwise unlabeled data set. Our "High-Dimensional Sharpened DR" (HD-SDR) method, tested on both synthetic and real-world data sets, is favorable to DR methods with poor cluster separation and yields a better visual cluster separation than these DR methods with no sharpening. Our method achieves good quality (measured by quality metrics) and scales computationally well with large high-dimensional data. To illustrate its concrete applications, we further apply HD-SDR on a recent astronomical catalog.
引用
收藏
页码:246 / 269
页数:24
相关论文
共 50 条
  • [41] Comparing and Exploring High-Dimensional Data with Dimensionality Reduction Algorithms and Matrix Visualizations
    Cutura, Rene
    Aupetit, Michael
    Fekete, Jean-Daniel
    Sedlmair, Michael
    [J]. PROCEEDINGS OF THE WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES AVI 2020, 2020,
  • [42] Distance-preserving projection of high-dimensional data for nonlinear dimensionality reduction
    Yang, L
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) : 1243 - 1246
  • [43] Recent Dimensionality Reduction Techniques for High-Dimensional COVID-19 Data
    Dallas, Ioannis L.
    Vrahatis, Aristidis G.
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    [J]. COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, CIBB 2021, 2022, 13483 : 227 - 241
  • [44] Semi-supervised dimensionality reduction for analyzing high-dimensional data with constraints
    Yan, Su
    Bouaziz, Sofien
    Lee, Dongwon
    Barlow, Jesse
    [J]. NEUROCOMPUTING, 2012, 76 (01) : 114 - 124
  • [45] Practical bound for dimensionality in high-dimensional entanglement
    Romero, Jacquiline
    Padgett, Miles
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (17) : 6122 - 6123
  • [46] Interactive Visual Cluster Analysis by Contrastive Dimensionality Reduction
    Xia, Jiazhi
    Huang, Linquan
    Lin, Weixing
    Zhao, Xin
    Wu, Jing
    Chen, Yang
    Zhao, Ying
    Chen, Wei
    [J]. IEEE Transactions on Visualization and Computer Graphics, 2023, 29 (01) : 734 - 744
  • [47] Gaussian processes with built-in dimensionality reduction: Applications to high-dimensional uncertainty propagation
    Tripathy, Rohit
    Bilionis, Ilias
    Gonzalez, Marcial
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 321 : 191 - 223
  • [48] Self-taught dimensionality reduction on the high-dimensional small-sized data
    Zhu, Xiaofeng
    Huang, Zi
    Yang, Yang
    Shen, Heng Tao
    Xu, Changsheng
    Luo, Jiebo
    [J]. PATTERN RECOGNITION, 2013, 46 (01) : 215 - 229
  • [49] The analysis on dimensionality reduction mathematical model based on feedback constraint for High-dimensional information
    Peng, Wu
    [J]. ADVANCES IN MECHATRONICS, AUTOMATION AND APPLIED INFORMATION TECHNOLOGIES, PTS 1 AND 2, 2014, 846-847 : 1056 - 1059
  • [50] Robust locally nonlinear embedding (RLNE) for dimensionality reduction of high-dimensional data with noise
    Xu, Yichen
    Li, Eric
    [J]. NEUROCOMPUTING, 2024, 596