Quantization/Clustering: when and why does k-means work?

被引:0
|
作者
Levrard, Clement [1 ]
机构
[1] Univ Paris Diderot, LPMA, 8 Pl Aure Lie Nemours, F-75013 Paris, France
来源
JOURNAL OF THE SFDS | 2018年 / 159卷 / 01期
关键词
k-means; clustering; quantization; separation rate; distortion;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Though mostly used as a clustering algorithm, k-means is originally designed as a quantization algorithm. Namely, it aims at providing a compression of a probability distribution with k points. Building upon Levrard (2015); Tang and Monteleoni (2016a), we try to investigate how and when these two approaches are compatible. Namely, we show that provided the sample distribution satisfies a margin like condition (in the sense of Mammen and Tsybakov, 1999 for supervised learning), both the associated empirical risk minimizer and the output of Lloyd's algorithm provide almost optimal classification in certain cases (in the sense of Azizyan et al., 2013). Besides, we also show that they achieved fast and optimal convergence rates in terms of sample size and compression risk.
引用
收藏
页码:1 / 26
页数:26
相关论文
共 50 条
  • [1] In Search of a New Initialization of K-Means Clustering for Color Quantization
    Frackiewicz, Mariusz
    Palus, Henryk
    EIGHTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2015), 2015, 9875
  • [2] Accelerated k-means clustering algorithm for colour image quantization
    Hu, Y-C
    Su, B-H
    IMAGING SCIENCE JOURNAL, 2008, 56 (01): : 29 - 40
  • [3] Vector quantization using k-means clustering neural network
    Im, Sio-Kei
    Chan, Ka-Hou
    ELECTRONICS LETTERS, 2023, 59 (07)
  • [4] The Complexity of k-Means Clustering when Little is Known
    Ganian, Robert
    Hamm, Thekla
    Korchemna, Viktoriia
    Okrasa, Karolina
    Simonov, Kirill
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [5] QUANTIZATION AND THE METHOD OF K-MEANS
    POLLARD, D
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1982, 28 (02) : 199 - 205
  • [6] Fast Color Quantization by K-Means Clustering Combined with Image Sampling
    Frackiewicz, Mariusz
    Mandrella, Aron
    Palus, Henryk
    SYMMETRY-BASEL, 2019, 11 (08):
  • [7] Color quantization using an accelerated Jancey k-means clustering algorithm
    Bounds, Harrison
    Celebi, M. Emre
    Maxwell, Jordan
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (05)
  • [8] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [9] Fuzzy K-Means Incremental Clustering Based on K-Center and Vector Quantization
    Li, Taoying
    Chen, Yan
    JOURNAL OF COMPUTERS, 2010, 5 (11) : 1670 - 1677
  • [10] K-Means Cloning: Adaptive Spherical K-Means Clustering
    Hedar, Abdel-Rahman
    Ibrahim, Abdel-Monem M.
    Abdel-Hakim, Alaa E.
    Sewisy, Adel A.
    ALGORITHMS, 2018, 11 (10):