Entropy Weighted Power k-Means Clustering

被引:0
|
作者
Chakraborty, Saptarshi [1 ]
Paul, Debolina [1 ]
Das, Swagatam [2 ]
Xu, Jason [3 ]
机构
[1] Indian Stat Inst, Kolkata, India
[2] Indian Stat Inst, Elect & Commun Sci Unit, Kolkata, India
[3] Duke Univ, Dept Stat Sci, Durham, NC 27706 USA
关键词
STRONG CONSISTENCY; FEATURE-SELECTION; OPTIMIZATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite its well-known shortcomings, k-means remains one of the most widely used approaches to data clustering. Current research continues to tackle its flaws while attempting to preserve its simplicity. Recently, the power k-means algorithm was proposed to avoid poor local minima by annealing through a family of smoother surfaces. However, the approach lacks statistical guarantees and fails in high dimensions when many features are irrelevant. This paper addresses these issues by introducing entropy regularization to learn feature relevance while annealing. We prove consistency of the proposed approach and derive a scalable majorization-minimization algorithm that enjoys closed-form updates and convergence guarantees. In particular, our method retains the same computational complexity of k-means and power k-means, but yields significant improvements over both. Its merits are thoroughly assessed on a suite of real and synthetic data experiments.
引用
收藏
页码:691 / 700
页数:10
相关论文
共 50 条
  • [1] Blood Bank Clustering: Improving Performance of Clustering using Entropy Weighted K-Means
    Srinivas, M. Satya
    Lakshmi, P. Vijaya
    Kumar, V. Kalyan Durga Shyam
    Balaji, V. Siva Sai
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, SMART AND GREEN TECHNOLOGIES (ICISSGT 2021), 2021, : 37 - 41
  • [2] The LINEX Weighted k-Means Clustering
    Ahmadzadehgoli, Narges
    Mohammadpour, Adel
    Behzadi, Mohammad Hassan
    [J]. JOURNAL OF STATISTICAL THEORY AND APPLICATIONS, 2019, 18 (02): : 147 - 154
  • [3] The LINEX Weighted k-Means Clustering
    Narges Ahmadzadehgoli
    Adel Mohammadpour
    Mohammad Hassan Behzadi
    [J]. Journal of Statistical Theory and Applications, 2019, 18 : 147 - 154
  • [4] Power k-Means Clustering
    Xu, Jason
    Lange, Kenneth
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [5] Constrained Clustering with Minkowski Weighted K-Means
    de Amorim, Renato Cordeiro
    [J]. 13TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI 2012), 2012, : 13 - 17
  • [6] Rough Entropy Based k-Means Clustering
    Malyszko, Dariusz
    Stepaniuk, Jaroslaw
    [J]. ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2009, 5908 : 406 - 413
  • [7] Weighted adjacent matrix for K-means clustering
    Zhou, Jukai
    Liu, Tong
    Zhu, Jingting
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (23) : 33415 - 33434
  • [8] Weighted adjacent matrix for K-means clustering
    Jukai Zhou
    Tong Liu
    Jingting Zhu
    [J]. Multimedia Tools and Applications, 2019, 78 : 33415 - 33434
  • [9] K-means clustering using entropy minimization
    Okafor, A
    Pardalos, PM
    [J]. THEORY AND ALGORITHMS FOR COOPERATIVE SYSTEMS, 2004, 4 : 339 - 351
  • [10] Entropy Based Soft K-means Clustering
    Bai, Xue
    Luo, Siwei
    Zhao, Yibiao
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 107 - 110