Variance Based Moving K-Means Algorithm

被引:0
|
作者
Vijay, Vibin [1 ]
Raghunath, V. P. [1 ]
Singh, Amarjot [2 ]
Omkar, S. N. [3 ]
机构
[1] Natl Inst Technol Warangal, Dept Elect & Commun Engn, Warangal, Andhra Pradesh, India
[2] Simon Fraser Univ, Sch Engn Sci, Burnaby, BC, Canada
[3] Indian Inst Sci, Dept Aerosp Engn, Bangalore, Karnataka, India
关键词
Data clustering; Intra-cluster variance; Dead centers; Image Processing; GENE-EXPRESSION;
D O I
10.1109/IACC.2017.164
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Clustering is a useful data exploratory method with its wide applicability in multiple fields. However, data clustering greatly relies on initialization of cluster centers that can result in large intra-cluster variance and dead centers, therefore leading to sub-optimal solutions. This paper proposes a novel variance based version of the conventional Moving K-Means (MKM) algorithm called Variance Based Moving K-Means (VMKM) that can partition data into optimal homogeneous clusters, irrespective of cluster initialization. The algorithm utilizes a novel distance metric and a unique data element selection criteria to transfer the selected elements between clusters to achieve low intra-cluster variance and subsequently avoid dead centers. Quantitative and qualitative comparison with various clustering techniques is performed on four datasets selected from image processing, bioinformatics, remote sensing and the stock market respectively. An extensive analysis highlights the superior performance of the proposed method over other techniques.
引用
收藏
页码:841 / 847
页数:7
相关论文
共 50 条
  • [31] Chinese text clustering algorithm based k-means
    Yao, Mingyu
    Pi, Dechang
    Cong, Xiangxiang
    [J]. 2012 INTERNATIONAL CONFERENCE ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING (ICMPBE2012), 2012, 33 : 301 - 307
  • [32] An Enhancing K-Means Algorithm Based on Sorting and Partition
    Yin Jun-wei
    Chen Jian-ming
    Xue Bai-li
    Zhang Jian
    [J]. INFORMATION COMPUTING AND APPLICATIONS, ICICA 2013, PT I, 2013, 391 : 365 - 374
  • [33] A Credits Based Scheduling Algorithm with K-means Clustering
    Sharma, Vrajesh
    Bala, Manju
    [J]. 2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 82 - 86
  • [34] Tobacco Distribution Based on Improved K-means Algorithm
    Zheng, Bin
    Tang, Fa-zhe
    Yang, Hua-long
    [J]. PROCEEDINGS OF 2009 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATION, LOGISTICS AND INFORMATICS, 2009, : 724 - +
  • [35] Improved K-means algorithm based on density Canopy
    Zhang, Geng
    Zhang, Chengchang
    Zhang, Huayu
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 145 : 289 - 297
  • [36] Research and Improve on K-means Algorithm Based on Hadoop
    Wu, Kehe
    Zeng, Wenjing
    Wu, Tingting
    An, Yanwen
    [J]. PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE, 2015, : 334 - 337
  • [37] Feature Selection Algorithm Based on K-means Clustering
    Tang, Xue
    Dong, Min
    Bi, Sheng
    Pei, Maofeng
    Cao, Dan
    Xie, Cheche
    Chi, Sunhuang
    [J]. 2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 1522 - 1527
  • [38] K-means clustering algorithm based distance concentration
    College of Management, Huazhong University of Science and Technology, Wuhan 430074, China
    不详
    [J]. Huazhong Ligong Daxue Xuebao, 2007, 10 (50-52):
  • [39] An Improved K-Means Algorithm Based on Evidence Distance
    Zhu, Ailin
    Hua, Zexi
    Shi, Yu
    Tang, Yongchuan
    Miao, Lingwei
    [J]. ENTROPY, 2021, 23 (11)
  • [40] A Clustering Algorithm Based on Integration of K-Means and PSO
    Atabay, Habibollah Agh
    Sheikhzadeh, Mohammad Javad
    Torshizi, Mehdi
    [J]. 2016 1ST CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC 2016), 2016, : 59 - 63