Differentially Private K-Means Publishing with Distributed Dimensions

被引:0
|
作者
Zhu, Boyu [1 ]
Zhang, Yuan [1 ]
Chen, Tingting [2 ]
Zhong, Sheng [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Comp Sci & Technol Dept, Nanjing, Peoples R China
[2] Calif State Polytech Univ Pomona, Dept Comp Sci, Coll Sci, Pomona, CA USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
D O I
10.1109/CSCWD61410.2024.10580021
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we address the critical concerns related to dataset privacy in the context of k-means clustering publishing within a distributed dimension setting. By leveraging differential privacy mechanisms, we propose a novel framework that integrates a differentially private classifier, constructed through voting based on raw clustering results, and an enhanced generative adversarial network (GAN) simulating the classifier's behavior in inferring class labels for a public dataset. Our approach generates synthetic clustering results that mimic real outcomes in classification tasks, ensuring differential privacy and minimizing noise. Our contributions include a comprehensive exploration of privacy issues, the introduction of a novel privacy-preserving k-means clustering framework, and theoretical analyses demonstrating sensitivity and differential privacy guarantees. Evaluation on the MNIST dataset demonstrates the effectiveness of the framework, achieving 82.22% accuracy with a (10.48, 10-9)-differential-privacy guarantee, compared to 83.45% accuracy without privacy-preserving.
引用
收藏
页码:3263 / 3268
页数:6
相关论文
共 50 条
  • [1] DIFFERENTIALLY PRIVATE COMPRESSIVE K-MEANS
    Schellekens, V.
    Chatalic, A.
    Houssiau, F.
    de Montjoye, Y. -A.
    Jacques, L.
    Gribonval, R.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7933 - 7937
  • [2] Differentially Private K-Means Clustering
    Su, Dong
    Cao, Jianneng
    Li, Ninghui
    Bertino, Elisa
    Jin, Hongxia
    CODASPY'16: PROCEEDINGS OF THE SIXTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, 2016, : 26 - 37
  • [3] K-Means Clustering with Distributed Dimensions
    Ding, Hu
    Liu, Yu
    Huang, Lingxiao
    Li, Jian
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [4] Optimal Differentially Private Algorithms for k-Means Clustering
    Huang, Zhiyi
    Liu, Jinyan
    PODS'18: PROCEEDINGS OF THE 37TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2018, : 395 - 408
  • [5] Differentially Private k-Means with Constant Multiplicative Error
    Kaplan, Haim
    Stemmer, Uri
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [6] A Convergent Differentially Private k-Means Clustering Algorithm
    Lu, Zhigang
    Shen, Hong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 612 - 624
  • [7] Differentially Private k-Means Clustering With Convergence Guarantee
    Lu, Zhigang
    Shen, Hong
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (04) : 1541 - 1552
  • [8] Differentially Private K-Means Clustering and a Hybrid Approach to Private Optimization
    Su, Dong
    Cao, Jianneng
    Li, Ninghui
    Bertino, Elisa
    Lyu, Min
    Jin, Hongxia
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2017, 20 (04)
  • [9] A Differentially Private K-means Clustering Scheme for Smart Grid
    Guo, Shuai
    Wen, Mi
    Liang, Xiaohui
    International Journal of Network Security, 2021, 23 (01) : 126 - 134
  • [10] Private Distributed K-Means Clustering on Interval Data
    Huang, Dingquan
    Yao, Xin
    An, Senquan
    Ren, Shengbing
    2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,