A Binary Optimization Approach for Constrained K-Means Clustering

被引:6
|
作者
Le, Huu M. [1 ]
Eriksson, Anders [1 ]
Thanh-Toan Do [2 ]
Milford, Michael [1 ]
机构
[1] Queensland Univ Technol, Brisbane, Qld, Australia
[2] Univ Liverpool, Liverpool, Merseyside, England
来源
基金
澳大利亚研究理事会;
关键词
PRODUCT QUANTIZATION;
D O I
10.1007/978-3-030-20870-7_24
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
K-Means clustering still plays an important role in many computer vision problems. While the conventional Lloyd method, which alternates between centroid update and cluster assignment, is primarily used in practice, it may converge to solutions with empty clusters. Furthermore, some applications may require the clusters to satisfy a specific set of constraints, e.g., cluster sizes, must-link/cannot-link. Several methods have been introduced to solve constrained K-Means clustering. Due to the non-convex nature of K-Means, however, existing approaches may result in sub-optimal solutions that poorly approximate the true clusters. In this work, we provide a new perspective to tackle this problem by considering constrained K-Means as a special instance of Binary Optimization. We then propose a novel optimization scheme to search for feasible solutions in the binary domain. This approach allows us to solve constrained K-Means clustering in such a way that multiple types of constraints can be simultaneously enforced. Experimental results on synthetic and real datasets show that our method provides better clustering accuracy with faster run time compared to several existing techniques.
引用
收藏
页码:383 / 398
页数:16
相关论文
共 50 条
  • [1] Constrained Clustering with Minkowski Weighted K-Means
    de Amorim, Renato Cordeiro
    [J]. 13TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI 2012), 2012, : 13 - 17
  • [2] Towards effective document clustering:: A constrained K-means based approach
    Hu, Guobiao
    Zhou, Shuigeng
    Guan, Jihong
    Hu, Xiaohua
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (04) : 1397 - 1409
  • [3] Manifold optimization for k-means clustering
    Carson, Timothy
    Mixon, Dustin G.
    Villar, Soledad
    Ward, Rachel
    [J]. 2017 INTERNATIONAL CONFERENCE ON SAMPLING THEORY AND APPLICATIONS (SAMPTA), 2017, : 73 - 77
  • [4] Clustering Using Boosted Constrained k-Means Algorithm
    Okabe, Masayuki
    Yamada, Seiji
    [J]. FRONTIERS IN ROBOTICS AND AI, 2018, 5
  • [5] Differentially Private K-Means Clustering and a Hybrid Approach to Private Optimization
    Su, Dong
    Cao, Jianneng
    Li, Ninghui
    Bertino, Elisa
    Lyu, Min
    Jin, Hongxia
    [J]. ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2017, 20 (04)
  • [6] Self-learning K-means clustering: a global optimization approach
    Volkovich, Z.
    Toledano-Kitai, D.
    Weber, G. -W.
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2013, 56 (02) : 219 - 232
  • [7] Self-learning K-means clustering: a global optimization approach
    Z. Volkovich
    D. Toledano-Kitai
    G.-W. Weber
    [J]. Journal of Global Optimization, 2013, 56 : 219 - 232
  • [8] In-DBMS K-means Clustering for Binary Streams
    Garcia-Alvarado, Carlos
    Ordonez, Carlos
    [J]. Proceedings - 2022 IEEE International Conference on Big Data, Big Data 2022, 2022, : 1987 - 1996
  • [9] A k-means approach to clustering disease progressions
    Duc Thanh Anh Luong
    Chandola, Varun
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2017, : 268 - 274
  • [10] Hierarchical initialization approach for K-Means clustering
    Lu, J. F.
    Tang, J. B.
    Tang, Z. M.
    Yang, J. Y.
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (06) : 787 - 795