Geometric MDS Performance for Large Data Dimensionality Reduction and Visualization

被引:2
|
作者
Dzemyda, Gintautas [1 ]
Sabaliauskas, Martynas [1 ]
Medvedev, Viktor [1 ]
机构
[1] Vilnius Univ, Inst Data Sci & Digital Technol, Vilnius, Lithuania
关键词
dimensionality reduction; multidimensional scaling; Geometric MDS; large-scale data; multi-core implementation; SMACOF; !text type='Python']Python[!/text] codes; INTRINSIC DIMENSIONALITY; VISUAL ANALYSIS; MULTIVARIATE; PROJECTION;
D O I
10.15388/22-INFOR491
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multidimensional scaling (MDS) is a widely used technique for mapping data from a high-dimensional to a lower-dimensional space and for visualizing data. Recently, a new method, known as Geometric MDS, has been developed to minimize the MDS stress function by an iterative procedure, where coordinates of a particular point of the projected space are moved to the new position defined analytically. Such a change in position is easily interpreted geometrically. Moreover, the coordinates of points of the projected space may be recalculated simultaneously, i.e. in parallel, independently of each other. This paper has several objectives. Two implementations of Geometric MDS are suggested and analysed experimentally. The parallel implementation of Geometric MDS is developed for multithreaded multi-core processors. The sequential implementation is optimized for computational speed, enabling it to solve large data problems. It is compared with the SMACOF version of MDS. Python codes for both Geometric MDS and SMACOF are presented to highlight the differences between the two implementations. The comparison was carried out on several aspects: the comparative performance of Geometric MDS and SMACOF depending on the projection dimension, data size and computation time. Geometric MDS usually finds lower stress when the dimensionality of the projected space is smaller.
引用
收藏
页码:299 / 320
页数:22
相关论文
共 50 条
  • [1] Dimensionality Reduction for Data Visualization
    Kaski, Samuel
    Peltonen, Jaakko
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2011, 28 (02) : 100 - 104
  • [2] Data visualization by nonlinear dimensionality reduction
    Gisbrecht, Andrej
    Hammer, Barbara
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 5 (02) : 51 - 73
  • [3] Nonlinear Dimensionality Reduction and Data Visualization:A Review
    Hujun Yin School of Electrical and Electronic Engineering
    [J]. Machine Intelligence Research, 2007, (03) : 294 - 303
  • [4] Nonlinear dimensionality reduction and data visualization: A review
    Yin H.
    [J]. International Journal of Automation and Computing, 2007, 4 (3) : 294 - 303
  • [5] Improving Dimensionality Reduction Projections for Data Visualization
    Rafieian, Bardia
    Hermosilla, Pedro
    Vazquez, Pere-Pau
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (17):
  • [6] Diffusion Maps for dimensionality reduction and visualization of meteorological data
    Fernandez, Angela
    Gonzalez, Ana M.
    Diaz, Julia
    Dorronsoro, Jose R.
    [J]. NEUROCOMPUTING, 2015, 163 : 25 - 37
  • [7] Dimensionality Reduction and Visualization of Network Intrusion Detection Data
    Zong, Wei
    Chow, Yang-Wai
    Susilo, Willy
    [J]. INFORMATION SECURITY AND PRIVACY, ACISP 2019, 2019, 11547 : 441 - 455
  • [8] Scatterplot selection for dimensionality reduction in multidimensional data visualization
    Okada, Kaya
    Itoh, Takayuki
    [J]. JOURNAL OF VISUALIZATION, 2024,
  • [9] Dimensionality reduction for visualization of normal and pathological speech data
    Goddard, J.
    Schlotthauer, G.
    Torres, M. E.
    Rufiner, H. L.
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2009, 4 (03) : 194 - 201
  • [10] Asymmetry index for data and its verification in dimensionality reduction and data visualization
    Olszewski, Dominik
    [J]. INFORMATION SCIENCES, 2025, 689