A Survey on K-Means Clustering for Analyzing Variation in Data

被引:5
|
作者
Patil, Pratik [1 ]
Karthikeyan, A. [2 ]
机构
[1] VIT, M Tech Embedded Syst, Vellore, Tamil Nadu, India
[2] VIT, Sch Elect Engn, Vellore, Tamil Nadu, India
关键词
K-means; Clustering; Machine learning; Dataset; Variation; Analysis; Data mining; Iterations; Parameters; Eucledian; Structure;
D O I
10.1007/978-981-15-0146-3_29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the times data for certain task seems to be varying due constant changes made to method of data collection as well as due to inclusion of new parameters related to the task. This may result in false conclusion derived from data generated and might lead to failure in task or degradation in the standard of activity related to that task which is being monitored from that data. Clustering is basically the grouping of similar kind of data wherein each cluster consist of data with some similarities. Whereas most of the data is unstructured or semi-structured, and that's where unsupervised K-means Clustering method plays role to convert the data into structured one's for clustering. This paper consist of K-means clustering method which is being used to keep an eye on such variations which are occurring in data generated for a task when certain changes are incorporated in technique to track this data.
引用
收藏
页码:317 / 323
页数:7
相关论文
共 50 条
  • [41] An efficient approximation to the K-means clustering for massive data
    Capo, Marco
    Perez, Aritz
    Lozano, Jose A.
    KNOWLEDGE-BASED SYSTEMS, 2017, 117 : 56 - 69
  • [42] Initializing k-means Clustering by Bootstrap and Data Depth
    Aurora Torrente
    Juan Romo
    Journal of Classification, 2021, 38 : 232 - 256
  • [43] Parallel batch k-means for Big data clustering
    Alguliyev, Rasim M.
    Aliguliyev, Ramiz M.
    Sukhostat, Lyudmila, V
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 152
  • [44] On K-means Data Clustering Algorithm with Genetic Algorithm
    Kapil, Shruti
    Chawla, Meenu
    Ansari, Mohd Dilshad
    2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 202 - 206
  • [45] Modified K-means Algorithm for Big Data Clustering
    Sengupta, Debapriya
    Roy, Sayantan Singha
    Ghosh, Sarbani
    Dasgupta, Ranjan
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 1443 - 1448
  • [46] Private Distributed K-Means Clustering on Interval Data
    Huang, Dingquan
    Yao, Xin
    An, Senquan
    Ren, Shengbing
    2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
  • [47] Weighted kernel K-means for clustering spatial data
    Faculty of Computer Science and Information Systems, University Technology Malaysia, Skudai 81310 Johor, Malaysia
    WSEAS Trans. Syst., 2006, 6 (1301-1308):
  • [48] Selection of K in K-means clustering
    Pham, DT
    Dimov, SS
    Nguyen, CD
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2005, 219 (01) : 103 - 119
  • [49] Geodesic K-means Clustering
    Asgharbeygi, Nima
    Maleki, Arian
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3450 - 3453
  • [50] Stability of k-means clustering
    Ben-David, Shai
    Pal, Ddvid
    Simon, Hans Ulrich
    LEARNING THEORY, PROCEEDINGS, 2007, 4539 : 20 - +