PERFORMANCE ANALYSIS OF ENTROPY METHODS ON K MEANS IN CLUSTERING PROCESS

被引：3

作者：

Lubis, Mhd Dicky Syahputra ^{[1
]}

Mawengkang, Herman ^{[2
]}

Suwilo, Saib ^{[2
]}

机构：

[1] Univ Sumatera Utara, Dept Comp Sci, Medan 20155, Indonesia

[2] Univ Sumatera Utara, Dept Math Sci, Medan 20155, Indonesia

来源：

INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICONICT) | 2017年 / 930卷

关键词：

K Means; Entropy; Clustering; Data Mining; Weight;

D O I：

10.1088/1742-6596/930/1/012028

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

K Means is a non-hierarchical data clustering method that attempts to partition existing data into one or more clusters / groups. This method partitions the data into clusters / groups so that data that have the same characteristics are grouped into the same cluster and data that have different characteristics are grouped into other groups. The purpose of this data clustering is to minimize the objective function set in the clustering process, which generally attempts to minimize variation within a cluster and maximize the variation between clusters. However, the main disadvantage of this method is that the number k is often not known before. Furthermore, a randomly chosen starting point may cause two points to approach the distance to be determined as two centroids. Therefore, for the determination of the starting point in K Means used entropy method where this method is a method that can be used to determine a weight and take a decision from a set of alternatives. Entropy is able to investigate the harmony in discrimination among a multitude of data sets. Using Entropy criteria with the highest value variations will get the highest weight. Given this entropy method can help K Means work process in determining the starting point which is usually determined at random. Thus the process of clustering on K Means can be more quickly known by helping the entropy method where the iteration process is faster than the K Means Standard process. Where the postoperative patient dataset of the UCI Repository Machine Learning used and using only 12 data as an example of its calculations is obtained by entropy method only with 2 times iteration can get the desired end result.

引用

页数：6

共 50 条

[1] Blood Bank Clustering: Improving Performance of Clustering using Entropy Weighted K-Means
Srinivas, M. Satya
Lakshmi, P. Vijaya
Kumar, V. Kalyan Durga Shyam
Balaji, V. Siva Sai
2021 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, SMART AND GREEN TECHNOLOGIES (ICISSGT 2021), 2021, : 37 - 41
[2] Rough Entropy Based k-Means Clustering
Malyszko, Dariusz
Stepaniuk, Jaroslaw
ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2009, 5908 : 406 - 413
[3] Entropy Weighted Power k-Means Clustering
Chakraborty, Saptarshi
Paul, Debolina
Das, Swagatam
Xu, Jason
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 691 - 700
[4] K-means clustering using entropy minimization
Okafor, A
Pardalos, PM
THEORY AND ALGORITHMS FOR COOPERATIVE SYSTEMS, 2004, 4 : 339 - 351
[5] Entropy Based Soft K-means Clustering
Bai, Xue
Luo, Siwei
Zhao, Yibiao
2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 107 - 110
[6] Minimum entropy, k-means, spectral clustering
Lee, Y
Choi, S
2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 117 - 122
[7] Performance Analysis of K Means Clustering Algorithms for mMTC Systems
Kim, Haesik
11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 30 - 35
[8] K-means clustering algorithm using the entropy
Palubinskas, G
IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING IV, 1998, 3500 : 63 - 71
[9] PERFORMANCE ANALYSIS OF COMBINED METHODS OF GENETIC ALGORITHM AND K-MEANS CLUSTERING IN DETERMINING THE VALUE OF CENTROID
Putra, Adya Zizwan
Zarlis, Muhammad
Nababan, Erna Budhiarti
INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICONICT), 2017, 930
[10] ASSESSING THE PERFORMANCE OF K-MEANS AND DBSCAN CLUSTERING METHODS IN TUBERCULOSIS MAPPING
Faidah, Defi yusti
Destin, Dianda
Anggina, Fazila azra
Caesar, Muhammad imamul
COMMUNICATIONS IN MATHEMATICAL BIOLOGY AND NEUROSCIENCE, 2025,

← 1 2 3 4 5 →