A Convergent Differentially Private k-Means Clustering Algorithm

被引:13
|
作者
Lu, Zhigang [1 ]
Shen, Hong [1 ,2 ]
机构
[1] Univ Adelaide, Sch Comp Sci, Adelaide, SA, Australia
[2] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
基金
澳大利亚研究理事会; 国家重点研发计划;
关键词
Differential privacy; Adversarial machine learning; k-means clustering;
D O I
10.1007/978-3-030-16148-4_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Preserving differential privacy (DP) for the iterative clustering algorithms has been extensively studied in the interactive and the non-interactive settings. However, existing interactive differentially private clustering algorithms suffer from a non-convergence problem, i.e., these algorithms may not terminate without a predefined number of iterations. This problem severely impacts the clustering quality and the efficiency of the algorithm. To resolve this problem, we propose a novel iterative approach in the interactive settings which controls the orientation of the centroids movement over the iterations to ensure the convergence by injecting DP noise in a selected area. We prove that, in the expected case, our approach converges to the same centroids as Lloyd's algorithm in at most twice the iterations of Lloyd's algorithm. We perform experimental evaluations on real-world datasets to show that our algorithm outperforms the state-of-the-art of the interactive differentially private clustering algorithms with a guaranteed convergence and better clustering quality to meet the same DP requirement.
引用
收藏
页码:612 / 624
页数:13
相关论文
共 50 条
  • [41] Research on Improved K-means Clustering Algorithm
    Zhang, Yinsheng
    Shan, Huilin
    Li, Jiaqiang
    Zhou, Jie
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1977 - 1980
  • [42] The Global Kernel k-Means Clustering Algorithm
    Tzortzis, Grigorios
    Likas, Aristidis
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1977 - 1984
  • [43] The Improvement and Application of a K-Means Clustering Algorithm
    Tao, Li Jun
    Hong, Liu Yin
    Yan, Hao
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2016), 2016, : 93 - 96
  • [44] Clustering with Spectral Norm and the k-means Algorithm
    Kumar, Amit
    Kannan, Ravindran
    2010 IEEE 51ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, 2010, : 299 - 308
  • [45] An Improved Kernel K-means Clustering Algorithm
    Liu, Yang
    Yin, Hong Peng
    Chai, Yi
    PROCEEDINGS OF 2016 CHINESE INTELLIGENT SYSTEMS CONFERENCE, VOL I, 2016, 404 : 275 - 280
  • [46] A Clustering Method Based on K-Means Algorithm
    Li, Youguo
    Wu, Haiyan
    INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 1104 - 1109
  • [47] Efficient enhanced k-means clustering algorithm
    Fahim A.M.
    Salem A.M.
    Torkey F.A.
    Ramadan M.A.
    Journal of Zhejiang University-SCIENCE A, 2006, 7 (10): : 1626 - 1633
  • [48] A Modified K-means Algorithm for Sequence Clustering
    Hsu, Jia-Lien
    Yang, Hong-Xiang
    HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 1, PROCEEDINGS, 2009, : 287 - 292
  • [49] Clustering Performance of an Evolutionary K-Means Algorithm
    Nigro, Libero
    Cicirelli, Franco
    Pupo, Francesco
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 9, ICICT 2024, 2025, 1054 : 359 - 369
  • [50] A Novel ELM K-Means Algorithm for Clustering
    Alshamiri, Abobakr Khalil
    Surampudi, Bapi Raju
    Singh, Alok
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, SEMCCO 2014, 2015, 8947 : 212 - 222