Performance analysis for clustering algorithms

被引:2
|
作者
Xue, Yu [1 ,2 ,3 ]
Zhao, Binping [1 ]
Ma, Tinghuai [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Jiangsu, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Jiangsu Engn Ctr Network Monitoring, Nanjing 210044, Jiangsu, Peoples R China
[3] Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Jiangsu, Peoples R China
关键词
optimal clustering; K-means; fuzzy C-means algorithm; hybrid differential evolution algorithm; performance analysis; high dimension;
D O I
10.1504/IJCSM.2016.080089
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
There are lots of algorithms for optimal clustering. The main part of clustering algorithms includes the K-means, fuzzy c-means (FCM) and evolution algorithm. The main purpose of this paper was to research the performance and characteristics of these three types of algorithms. One criteria (clustering validity index), namely TRW, was used in the optimisation of classification and eight real-world datasets (glass, wine, ionosphere, biodegradation, connectionist bench, hill-valley, musk, madelon datasets), whose dimension became higher, were applied. We made a performance analysis and concluded that it was easy of the K-means and FCM to fall into a local minimum, and the hybrid algorithm was found more reliable and more efficient, especially on difficult tasks with high dimension.
引用
收藏
页码:485 / 493
页数:9
相关论文
共 50 条
  • [41] Performance Comparison of Two Algorithms for Arbitrary Shapes Clustering
    Khader, Mariam
    Al-Naymat, Ghazi
    2019 INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2019, : 20 - 26
  • [42] Comparison of the performance of center-based clustering algorithms
    Zhang, B
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2003, 2637 : 63 - 74
  • [43] Performance characterization of clustering algorithms for colour image segmentation
    Ilea, D. E.
    Whelan, P. F.
    Ghita, O.
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON OPTIMIZATION OF ELECTRICAL AND ELECTRONIC EQUIPMENT, VOL IV, 2006, : 137 - 142
  • [44] On the performance of high dimensional data clustering and classification algorithms
    Ericson, Kathleen
    Pallickara, Shrideep
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2013, 29 (04): : 1024 - 1034
  • [45] A Comparative Analysis of Distributed Clustering Algorithms: A Survey
    Singh, Deepika
    Gosain, Anjana
    2013 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI), 2013, : 165 - 169
  • [46] Malware Analysis Using Classification and Clustering Algorithms
    Balaji, K. M.
    Subbulakshmi, T.
    INTERNATIONAL JOURNAL OF E-COLLABORATION, 2022, 18 (01)
  • [47] Ultrafast clustering algorithms for metagenomic sequence analysis
    Li, Weizhong
    Fu, Limin
    Niu, Beifang
    Wu, Sitao
    Wooley, John
    BRIEFINGS IN BIOINFORMATICS, 2012, 13 (06) : 656 - 668
  • [48] Analysis of Hard Clustering Algorithms Applicable to Regionalization
    Christina, J.
    Komathy, K.
    2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICT 2013), 2013, : 606 - 610
  • [49] Comparison of Clustering Algorithms for Revenue and Cost Analysis
    Boyko, Nataliya
    Hetman, Solomiya
    Kots, Iryna
    COLINS 2021: COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS, VOL I, 2021, 2870
  • [50] Multi Angle Analysis of The Existing Clustering Algorithms
    Ping, Jinzhen
    Wang, Qian
    Yu, Lili
    Wu, XueFang
    PROCEEDINGS OF THE 2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND AUTOMATION ENGINEERING, 2016, 42 : 404 - 407