PERFORMANCE ANALYSIS OF ENTROPY METHODS ON K MEANS IN CLUSTERING PROCESS

被引:3
|
作者
Lubis, Mhd Dicky Syahputra [1 ]
Mawengkang, Herman [2 ]
Suwilo, Saib [2 ]
机构
[1] Univ Sumatera Utara, Dept Comp Sci, Medan 20155, Indonesia
[2] Univ Sumatera Utara, Dept Math Sci, Medan 20155, Indonesia
关键词
K Means; Entropy; Clustering; Data Mining; Weight;
D O I
10.1088/1742-6596/930/1/012028
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
K Means is a non-hierarchical data clustering method that attempts to partition existing data into one or more clusters / groups. This method partitions the data into clusters / groups so that data that have the same characteristics are grouped into the same cluster and data that have different characteristics are grouped into other groups. The purpose of this data clustering is to minimize the objective function set in the clustering process, which generally attempts to minimize variation within a cluster and maximize the variation between clusters. However, the main disadvantage of this method is that the number k is often not known before. Furthermore, a randomly chosen starting point may cause two points to approach the distance to be determined as two centroids. Therefore, for the determination of the starting point in K Means used entropy method where this method is a method that can be used to determine a weight and take a decision from a set of alternatives. Entropy is able to investigate the harmony in discrimination among a multitude of data sets. Using Entropy criteria with the highest value variations will get the highest weight. Given this entropy method can help K Means work process in determining the starting point which is usually determined at random. Thus the process of clustering on K Means can be more quickly known by helping the entropy method where the iteration process is faster than the K Means Standard process. Where the postoperative patient dataset of the UCI Repository Machine Learning used and using only 12 data as an example of its calculations is obtained by entropy method only with 2 times iteration can get the desired end result.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Blood Bank Clustering: Improving Performance of Clustering using Entropy Weighted K-Means
    Srinivas, M. Satya
    Lakshmi, P. Vijaya
    Kumar, V. Kalyan Durga Shyam
    Balaji, V. Siva Sai
    2021 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, SMART AND GREEN TECHNOLOGIES (ICISSGT 2021), 2021, : 37 - 41
  • [2] Rough Entropy Based k-Means Clustering
    Malyszko, Dariusz
    Stepaniuk, Jaroslaw
    ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2009, 5908 : 406 - 413
  • [3] Entropy Weighted Power k-Means Clustering
    Chakraborty, Saptarshi
    Paul, Debolina
    Das, Swagatam
    Xu, Jason
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 691 - 700
  • [4] K-means clustering using entropy minimization
    Okafor, A
    Pardalos, PM
    THEORY AND ALGORITHMS FOR COOPERATIVE SYSTEMS, 2004, 4 : 339 - 351
  • [5] Entropy Based Soft K-means Clustering
    Bai, Xue
    Luo, Siwei
    Zhao, Yibiao
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 107 - 110
  • [6] Minimum entropy, k-means, spectral clustering
    Lee, Y
    Choi, S
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 117 - 122
  • [7] Performance Analysis of K Means Clustering Algorithms for mMTC Systems
    Kim, Haesik
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 30 - 35
  • [8] K-means clustering algorithm using the entropy
    Palubinskas, G
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING IV, 1998, 3500 : 63 - 71
  • [9] PERFORMANCE ANALYSIS OF COMBINED METHODS OF GENETIC ALGORITHM AND K-MEANS CLUSTERING IN DETERMINING THE VALUE OF CENTROID
    Putra, Adya Zizwan
    Zarlis, Muhammad
    Nababan, Erna Budhiarti
    INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICONICT), 2017, 930
  • [10] ASSESSING THE PERFORMANCE OF K-MEANS AND DBSCAN CLUSTERING METHODS IN TUBERCULOSIS MAPPING
    Faidah, Defi yusti
    Destin, Dianda
    Anggina, Fazila azra
    Caesar, Muhammad imamul
    COMMUNICATIONS IN MATHEMATICAL BIOLOGY AND NEUROSCIENCE, 2025,