PERFORMANCE ANALYSIS OF ENTROPY METHODS ON K MEANS IN CLUSTERING PROCESS

被引:3
|
作者
Lubis, Mhd Dicky Syahputra [1 ]
Mawengkang, Herman [2 ]
Suwilo, Saib [2 ]
机构
[1] Univ Sumatera Utara, Dept Comp Sci, Medan 20155, Indonesia
[2] Univ Sumatera Utara, Dept Math Sci, Medan 20155, Indonesia
关键词
K Means; Entropy; Clustering; Data Mining; Weight;
D O I
10.1088/1742-6596/930/1/012028
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
K Means is a non-hierarchical data clustering method that attempts to partition existing data into one or more clusters / groups. This method partitions the data into clusters / groups so that data that have the same characteristics are grouped into the same cluster and data that have different characteristics are grouped into other groups. The purpose of this data clustering is to minimize the objective function set in the clustering process, which generally attempts to minimize variation within a cluster and maximize the variation between clusters. However, the main disadvantage of this method is that the number k is often not known before. Furthermore, a randomly chosen starting point may cause two points to approach the distance to be determined as two centroids. Therefore, for the determination of the starting point in K Means used entropy method where this method is a method that can be used to determine a weight and take a decision from a set of alternatives. Entropy is able to investigate the harmony in discrimination among a multitude of data sets. Using Entropy criteria with the highest value variations will get the highest weight. Given this entropy method can help K Means work process in determining the starting point which is usually determined at random. Thus the process of clustering on K Means can be more quickly known by helping the entropy method where the iteration process is faster than the K Means Standard process. Where the postoperative patient dataset of the UCI Repository Machine Learning used and using only 12 data as an example of its calculations is obtained by entropy method only with 2 times iteration can get the desired end result.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] A Novel k′-Means Algorithm for Clustering Analysis
    Fang, Chonglun
    Ma, Jinwen
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 2142 - +
  • [22] Stability analysis in K-means clustering
    Steinley, Douglas
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2008, 61 : 255 - 273
  • [23] Clustering Performance of an Evolutionary K-Means Algorithm
    Nigro, Libero
    Cicirelli, Franco
    Pupo, Francesco
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 9, ICICT 2024, 2025, 1054 : 359 - 369
  • [24] Statistically Improving K-means Clustering Performance
    Ihsanoglu, Abdullah
    Zaval, Mounes
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [25] Hierarchical Clustering and K-means Analysis of HPC Application Kernels Performance Characteristics
    Grodowitz, M. L.
    Sreepathi, Sarat
    2015 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2015,
  • [26] Parallel K-means clustering algorithm based on information entropy and GPU
    Chen, Xiao-Hui
    Zhang, Gong-Xuan
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2014, 35 : 62 - 67
  • [27] Image segmentation based on rough entropy and K-means clustering algorithm
    Xu, Yi
    Li, Long-Shu
    Li, Xue-Jun
    Huadong Ligong Daxue Xuebao /Journal of East China University of Science and Technology, 2007, 33 (02): : 255 - 258
  • [28] An Improved k-means Algorithm for Clustering Using Entropy Weighting Measures
    Li, Taoying
    Chen, Yan
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 149 - 153
  • [29] Performance Analysis of Fuzzy C-Means Clustering Methods for MRI Image Segmentation
    Choudhry, Mahipal Singh
    Kapoor, Rajiv
    TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 749 - 758
  • [30] Evaluating Performance of WFA K-means and Modified Follow the Leader Methods for Clustering Load Curves
    Mahmoudi-Kohan, N.
    Moghaddam, M. P.
    Bidaki, S. M.
    2009 IEEE/PES POWER SYSTEMS CONFERENCE AND EXPOSITION, VOLS 1-3, 2009, : 1858 - +