Multi-Core Implementation of Geometric Multidimensional Scaling for Large-Scale Data

被引:1
|
作者
Dzemyda, Gintautas [1 ]
Medvedev, Viktor [1 ]
Sabaliauskas, Martynas [1 ]
机构
[1] Vilnius Univ, Inst Data Sci & Digital Technol, Vilnius, Lithuania
关键词
Dimensionality reduction; Multidimensional scaling; Geometric MDS; Parallel computations; Large-scale data; Multi-core implementation;
D O I
10.1007/978-3-031-04819-7_8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A well-known and widely used technique for mapping data from high-dimensional space to lower-dimensional space is multidimensional scaling (MDS). Although MDS, as a dimensionality reduction method used for data visualization, demonstrates great versatility, it is computationally demanding, especially when the data set is not fixed and its size is constantly growing. Traditional MDS approaches are limited when analyzing very large data sets, as they require very long computation times and large amounts of memory. A way to minimize MDS stress, which can be used to reduce the dimensionality of large-scale data, has been developed using the ideas of Geometric MDS, where all points in a low-dimensional space change their coordinates simultaneously and independently during a single iteration of stress minimization. It is shown in this paper that Geometric MDS allows the implementation of parallel computing for the dimensionality reduction process of large-scale data using multithreaded multi-core processors. We explore how the computational time consumption of data dimensionality reduction and multidimensional data visualization depends on the number of processor cores or processor threads used.
引用
下载
收藏
页码:74 / 82
页数:9
相关论文
共 50 条
  • [1] Large-Scale Modal Analysis on Multi-Core Architectures
    Suresh, Krishnan
    Yadav, Praveen
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE 2012, VOL 2, PTS A AND B, 2012, : 785 - 791
  • [2] Securing a Large-Scale Data Center Using a Multi-Core Enclave Model
    Al-Qahtani, Mohammed Saad
    Farooq, Hafiz Muhammad
    UKSIM-AMSS 11TH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2017), 2017, : 221 - 226
  • [3] Analyzing large-scale DNA Sequences on Multi-core Architectures
    Memeti, Suejb
    Pllana, Sabri
    2015 IEEE 18TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2015, : 208 - 215
  • [4] Multi-core accelerated CRDT for large-scale and dynamic collaboration
    Cai, Weiwei
    He, Fazhi
    Lv, Xiao
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (08): : 10799 - 10828
  • [5] Multi-core accelerated CRDT for large-scale and dynamic collaboration
    Weiwei Cai
    Fazhi He
    Xiao Lv
    The Journal of Supercomputing, 2022, 78 : 10799 - 10828
  • [6] Interactive Rendering of Large-Scale Volumes on Multi-Core CPUs
    Wang, Feng
    Wald, Ingo
    Johnson, Chris R.
    2019 IEEE 9TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2019, : 27 - 36
  • [7] A multi-core computing approach for large-scale multi-label classification
    Rodriguez, Juan Manuel
    Godoy, Daniela
    Mateos, Cristian
    Zunino, Alejandro
    INTELLIGENT DATA ANALYSIS, 2017, 21 (02) : 329 - 352
  • [8] Large-scale fast Fourier transform on a heterogeneous multi-core system
    Li, Yan
    Diamond, Jeffrey R.
    Wang, Xu
    Lin, Haibo
    Yang, Yudong
    Han, Zhenxing
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2012, 26 (02): : 148 - 158
  • [9] AOmpLib: An Aspect Library for Large-Scale Multi-Core Parallel Programming
    Medeiros, Bruno
    Sobral, Joao L.
    2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2013, : 270 - 279
  • [10] Quadtree-Based Lightweight Data Compression for Large-Scale Geospatial Rasters on Multi-Core CPUs
    Zhang, Jianting
    You, Simin
    Gruenwald, Le
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 478 - 484