Analysis of parallel computational models for clustering

被引:3
|
作者
Plaza, Malgorzata [1 ]
Deniziak, Stanislaw [1 ]
Plaza, Miroslaw [1 ]
Belka, Radoslaw [1 ]
Pieta, Pawel [1 ]
机构
[1] Kielce Univ Technol, Fac Elect Engn Automat Control & Comp Sci, Al Tysiaclecia PP 7, PL-25314 Kielce, Poland
关键词
big data; clustering; cluster analysis; data mining; machine learning; parallel algorithms; ALGORITHM;
D O I
10.1117/12.2500795
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Clustering is one of the main task of data mining, where groups of similar objects are discovered and grouping of similar data as well as outliers detection are performed. Processing of huge datasets requires scalable models of computations and distributed computing environments, therefore efficient parallel clustering methods are required for this purpose. Usually for parallel data analytics the MapReduce processing model is used. But growing computer power of heterogeneous platforms based on graphic processors and FPGA accelerators causes that CUDA and OpenCL models may be interesting alternative to MapReduce. This paper presents comparative analysis of effectiveness of applying MapReduce and CUDA/OpenCL processing models for clustering. We compare different methods of clustering in terms of their possibilities of parallelization using both models of computation. The conclusions indicate directions for further work in this area.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] COMPUTATIONAL MODELS FOR PARALLEL COMPUTERS
    KUNG, HT
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1988, 326 (1591): : 357 - 371
  • [2] A parallel computational framework for ultra-large-scale sequence clustering analysis
    Zheng, Wei
    Mao, Qi
    Genco, Robert J.
    Wactawski-Wende, Jean
    Buck, Michael
    Cai, Yunpeng
    Sun, Yijun
    BIOINFORMATICS, 2019, 35 (03) : 380 - 388
  • [3] Computational Insights into Reproductive Toxicity: Clustering, Mechanism Analysis, and Predictive Models
    Cui, Huizi
    He, Qizheng
    Li, Wannan
    Duan, Yuying
    Han, Weiwei
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (14)
  • [4] COLLAPSING THE HIERARCHY OF PARALLEL COMPUTATIONAL MODELS
    Bruda, Stefan D.
    Zhang, Yuanqiao
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2010, 21 (03) : 441 - 457
  • [5] Advances in Parallel and Distributed Computational Models
    2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023, 2023, : 284 - 285
  • [6] Tessellation and clustering by mixture models and their parallel implementations
    Du, Q
    Wang, XQ
    PROCEEDINGS OF THE FOURTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2004, : 257 - 268
  • [7] Parallel implementation of information retrieval clustering models
    Jiménez, D
    Vidal, V
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2004, 2005, 3402 : 129 - 141
  • [8] Comparative Analysis of the Performance of Complex Texture Clustering Driven by Computational Intelligence Methods Using Multiple Clustering Models
    Zhou, Jincheng
    Wang, Dan
    Ling, Lei
    Li, Mingjiang
    Lai, Khin-Wee
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [9] RELATIONS BETWEEN SEVERAL PARALLEL COMPUTATIONAL MODELS
    Bruda, Stefan D.
    Zhang, Yuanqiao
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2009, 10 (02): : 163 - 172
  • [10] Parallel Algorithms for Computational Models of Geophysical Systems
    Carrillo-Ledesma, Antonio
    Herrera, Ismael
    de la Cruz, Luis M.
    GEOFISICA INTERNACIONAL, 2013, 52 (03): : 293 - 309