A Survey on K-Means Clustering for Analyzing Variation in Data

被引:5
|
作者
Patil, Pratik [1 ]
Karthikeyan, A. [2 ]
机构
[1] VIT, M Tech Embedded Syst, Vellore, Tamil Nadu, India
[2] VIT, Sch Elect Engn, Vellore, Tamil Nadu, India
关键词
K-means; Clustering; Machine learning; Dataset; Variation; Analysis; Data mining; Iterations; Parameters; Eucledian; Structure;
D O I
10.1007/978-981-15-0146-3_29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the times data for certain task seems to be varying due constant changes made to method of data collection as well as due to inclusion of new parameters related to the task. This may result in false conclusion derived from data generated and might lead to failure in task or degradation in the standard of activity related to that task which is being monitored from that data. Clustering is basically the grouping of similar kind of data wherein each cluster consist of data with some similarities. Whereas most of the data is unstructured or semi-structured, and that's where unsupervised K-means Clustering method plays role to convert the data into structured one's for clustering. This paper consist of K-means clustering method which is being used to keep an eye on such variations which are occurring in data generated for a task when certain changes are incorporated in technique to track this data.
引用
收藏
页码:317 / 323
页数:7
相关论文
共 50 条
  • [21] A Framework Based on K-Means Clustering and Topic Modeling for Analyzing Unstructured Manufacturing Capability Data
    Sabbagh, Ramin
    Ameri, Farhad
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2020, 20 (01)
  • [22] Analyzing the Evolution of Rare Events via Social Media Data and k-means Clustering Algorithm
    Lu, Xiaoyu Sean
    Zhou, MengChu
    2016 IEEE 13TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING, AND CONTROL (ICNSC), 2016,
  • [23] K-Means Cloning: Adaptive Spherical K-Means Clustering
    Hedar, Abdel-Rahman
    Ibrahim, Abdel-Monem M.
    Abdel-Hakim, Alaa E.
    Sewisy, Adel A.
    ALGORITHMS, 2018, 11 (10):
  • [24] An efficient K-means clustering algorithm for tall data
    Capo, Marco
    Perez, Aritz
    Lozano, Jose A.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (03) : 776 - 811
  • [25] An efficient K-means clustering algorithm for tall data
    Marco Capó
    Aritz Pérez
    Jose A. Lozano
    Data Mining and Knowledge Discovery, 2020, 34 : 776 - 811
  • [26] An extension of the K-means algorithm to clustering skewed data
    Volodymyr Melnykov
    Xuwen Zhu
    Computational Statistics, 2019, 34 : 373 - 394
  • [27] Clustering the Patent Data Using K-Means Approach
    Anuranjana
    Mittas, Nisha
    Mehrotra, Deepti
    SOFTWARE ENGINEERING (CSI 2015), 2019, 731 : 639 - 645
  • [28] Optimized data fusion for K-means Laplacian clustering
    Yu, Shi
    Liu, Xinhai
    Tranchevent, Leon-Charles
    Glanzel, Wolfgang
    Suykens, Johan A. K.
    De Moor, Bart
    Moreau, Yves
    BIOINFORMATICS, 2011, 27 (01) : 118 - 126
  • [29] Parallelization of K-Means Clustering Algorithm for Data Mining
    Jiang, Hao
    Yu, Liyan
    4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [30] K-means Clustering with Feature Selection for Stream Data
    Wang, Xiao-dong
    Chen, Rung-Ching
    Yan, Fei
    Hendry
    2018 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2018), 2018, : 453 - 456