A Survey on K-Means Clustering for Analyzing Variation in Data

被引：5

作者：

Patil, Pratik ^{[1
]}

Karthikeyan, A. ^{[2
]}

机构：

[1] VIT, M Tech Embedded Syst, Vellore, Tamil Nadu, India

[2] VIT, Sch Elect Engn, Vellore, Tamil Nadu, India

来源：

INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES, ICICCT 2019 | 2020年 / 89卷

关键词：

K-means; Clustering; Machine learning; Dataset; Variation; Analysis; Data mining; Iterations; Parameters; Eucledian; Structure;

D O I：

10.1007/978-981-15-0146-3_29

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Most of the times data for certain task seems to be varying due constant changes made to method of data collection as well as due to inclusion of new parameters related to the task. This may result in false conclusion derived from data generated and might lead to failure in task or degradation in the standard of activity related to that task which is being monitored from that data. Clustering is basically the grouping of similar kind of data wherein each cluster consist of data with some similarities. Whereas most of the data is unstructured or semi-structured, and that's where unsupervised K-means Clustering method plays role to convert the data into structured one's for clustering. This paper consist of K-means clustering method which is being used to keep an eye on such variations which are occurring in data generated for a task when certain changes are incorporated in technique to track this data.

引用

页码：317 / 323

页数：7

共 50 条

[41] An efficient approximation to the K-means clustering for massive data
Capo, Marco
Perez, Aritz
Lozano, Jose A.
KNOWLEDGE-BASED SYSTEMS, 2017, 117 : 56 - 69
[42] Initializing k-means Clustering by Bootstrap and Data Depth
Aurora Torrente
Juan Romo
Journal of Classification, 2021, 38 : 232 - 256
[43] Parallel batch k-means for Big data clustering
Alguliyev, Rasim M.
Aliguliyev, Ramiz M.
Sukhostat, Lyudmila, V
COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 152
[44] On K-means Data Clustering Algorithm with Genetic Algorithm
Kapil, Shruti
Chawla, Meenu
Ansari, Mohd Dilshad
2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 202 - 206
[45] Modified K-means Algorithm for Big Data Clustering
Sengupta, Debapriya
Roy, Sayantan Singha
Ghosh, Sarbani
Dasgupta, Ranjan
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 1443 - 1448
[46] Private Distributed K-Means Clustering on Interval Data
Huang, Dingquan
Yao, Xin
An, Senquan
Ren, Shengbing
2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
[47] Weighted kernel K-means for clustering spatial data
Faculty of Computer Science and Information Systems, University Technology Malaysia, Skudai 81310 Johor, Malaysia
WSEAS Trans. Syst., 2006, 6 (1301-1308):
[48] Selection of K in K-means clustering
Pham, DT
Dimov, SS
Nguyen, CD
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2005, 219 (01) : 103 - 119
[49] Geodesic K-means Clustering
Asgharbeygi, Nima
Maleki, Arian
19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3450 - 3453
[50] Stability of k-means clustering
Ben-David, Shai
Pal, Ddvid
Simon, Hans Ulrich
LEARNING THEORY, PROCEEDINGS, 2007, 4539 : 20 - +

← 1 2 3 4 5 →