Performance of Parallel K-Means Algorithms in Java']Java

被引:6
|
作者
Nigro, Libero [1 ]
机构
[1] Univ Calabria, Engn Dept Informat Modelling Elect & Syst Sci DIM, I-87036 Arcavacata Di Rende, Italy
关键词
parallel algorithms; multi-core machines; K-means clustering; !text type='Java']Java[!/text; functional parallel streams; actors; message-passing; lightweight parallel programming;
D O I
10.3390/a15040117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-means is a well-known clustering algorithm often used for its simplicity and potential efficiency. Its properties and limitations have been investigated by many works reported in the literature. K-means, though, suffers from computational problems when dealing with large datasets with many dimensions and great number of clusters. Therefore, many authors have proposed and experimented different techniques for the parallel execution of K-means. This paper describes a novel approach to parallel K-means which, today, is based on commodity multicore machines with shared memory. Two reference implementations in Java are developed and their performances are compared. The first one is structured according to a map/reduce schema that leverages the built-in multi-threaded concurrency automatically provided by Java to parallel streams. The second one, allocated on the available cores, exploits the parallel programming model of the Theatre actor system, which is control-based, totally lock-free, and purposely relies on threads as coarse-grain "programming-in-the-large" units. The experimental results confirm that some good execution performance can be achieved through the implicit and intuitive use of Java concurrency in parallel streams. However, better execution performance can be guaranteed by the modular Theatre implementation which proves more adequate for an exploitation of the computational resources.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] COMPARATIVE STUDY OF k-MEANS AND MEAN SHIFT CLUSTERING ALGORITHMS FOR WASTE DATA IN WEST JAVA']JAVA PROVINCE
    Setyawan, Rony
    Pamuji, Geraldi Catur
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2024, 19 (03): : 869 - 879
  • [2] Performance Evaluation of Java']Java/PCJ Implementation of Parallel Algorithms on the Cloud
    Nowicki, Marek
    Gorski, Lukasz
    Bala, Piotr
    [J]. EURO-PAR 2020: PARALLEL PROCESSING WORKSHOPS, 2021, 12480 : 213 - 224
  • [3] Performance Analysis of Parallel K-Means with Optimization Algorithms for Clustering on Spark
    Santhi, V.
    Jose, Rini
    [J]. DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY (ICDCIT 2018), 2018, 10722 : 158 - 162
  • [4] A WEB COMPUTING ENVIRONMENT FOR PARALLEL ALGORITHMS IN JAVA']JAVA
    Bonorden, Olaf
    Gehweiler, Joachim
    Heide, Friedhelm Meyer Auf Der
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2006, 7 (02): : 1 - 14
  • [5] A web computing environment for parallel algorithms in Java']Java
    Bonorden, Olaf
    Gehweiler, Joachim
    Heide, Friedhelm Meyer auf der
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2006, 3911 : 801 - 808
  • [6] Performance evaluation of Java']Java/PCJ implementation of parallel algorithms on the cloud (extended version)
    Nowicki, Marek
    Gorski, Tukasz
    Bala, Piotr
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (15):
  • [7] Comparing Java']Java agents and MPI for writing parallel algorithms
    Kochmar, J
    Nowoczynski, P
    Scott, JR
    Stone, N
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, PROCEEDINGS, 1999, : 2108 - 2114
  • [8] Building a distributed K-Means model for Weka using remote method invocation (RMI) feature of Java']Java
    Sudarsan, V.
    Sugumar, R.
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (14):
  • [9] Performance of Parallel K-Means Based on Theatre
    Cicirelli, Franco
    Nigro, Libero
    Pupo, Francesco
    [J]. PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 4, 2023, 465 : 241 - 249
  • [10] Performance Analysis of K-Means Seeding Algorithms
    Ortiz-Bejar, Jose
    Tellez, Eric S.
    Graff, Mario
    Ortiz-Bejar, Jesus
    Jacobo, Jaime Cerda
    Zamora-Mendez, Alejandro
    [J]. 2019 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC 2019), 2019,