Distributed K-Means clustering guaranteeing local differential privacy

被引:37
|
作者
Xia, Chang [1 ]
Hua, Jingyu [1 ]
Tong, Wei [1 ]
Zhong, Sheng [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
基金
国家重点研发计划;
关键词
Differential privacy; Randomized response; Machine learning; Distributed clustering; K-Means;
D O I
10.1016/j.cose.2019.101699
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In many cases, a service provider might require to aggregate data from end-users to perform mining tasks such as K-means clustering. Nevertheless, since such data often contain sensitive information. In this paper, we propose the first locally differentially private K-means mechanism under this distributed scenario. Differing from standard differentially private clustering mechanisms, the proposed mechanism doesn't need any trusted third party to collect and preprocess users data. Our mechanism first perturbs users data locally to satisfy local differential privacy (LDP). Then it revises the traditional K-means algorithm to allow the service provider to obtain high-quality clustering results by collaborating with users based on the highly perturbed data. We prove that our mechanism can enable high utility clustering while guaranteeing local differential privacy for each user. We also propose an extended mechanism to improve our basic model in terms of privacy and utility. In this mechanism, we perturb both users' sensitive data and the intermediate results of users' clusters in each iteration. Moreover, we consider a more general setting where the users may have different privacy requirements. Extensive experiments are conducted on two real-world datasets, and the results show that our proposal can well preserve the quality of clustering results. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Distributed Sparse Subspace Clustering by K-Means Subspace Fusion
    Huang, Liang-Chi
    Hong, Y. -W. Peter
    Wu, Jwo-Yuh
    2024 IEEE 13RD SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP, SAM 2024, 2024,
  • [42] RETRACTED: CVDP k-means clustering algorithm for differential privacy based on coefficient of variation (Retracted Article)
    Kong, Yuting
    Qian, Yurong
    Tan, Fuxiang
    Bai, Lu
    Shao, Jinxin
    Ma, Tinghuai
    Tereshchenko, Sergei Nikolayevich
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 6027 - 6045
  • [43] An Optimal Distributed K-Means Clustering Algorithm Based on CloudStack
    Mao, Yingchi
    Xu, Ziyang
    Li, Xiaofang
    Ping, Ping
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 3149 - 3156
  • [44] An Optimal Distributed K-Means Clustering Algorithm Based on CloudStack
    Mao, Yingchi
    Xu, Ziyang
    Ping, Ping
    Wang, Longbao
    2015 NINTH INTERNATIONAL CONFERENCE ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY FCST 2015, 2015, : 386 - 391
  • [45] A Lightweight Mutual Privacy Preserving k-Means Clustering in Industrial IoT
    Hu, Chunqiang
    Liu, Jianshuo
    Xia, Hui
    Deng, Shaojiang
    Yu, Jiguo
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 2138 - 2152
  • [46] Privacy-Preserving K-Means Clustering Upon Negative Databases
    Hu, Xiaoyi
    Lu, Liping
    Zhao, Dongdong
    Xiang, Jianwen
    Liu, Xing
    Zhou, Haiying
    Xiong, Shengwu
    Tian, Jing
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 191 - 204
  • [47] Mutual Privacy Preserving k-Means Clustering in Social Participatory Sensing
    Xing, Kai
    Hu, Chunqiang
    Yu, Jiguo
    Cheng, Xiuzhen
    Zhang, Fengjuan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (04) : 2066 - 2076
  • [48] Importance of Data Standardization in Privacy-Preserving K-Means Clustering
    Su, Chunhua
    Zhan, Justin
    Sakurai, Kouichi
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2009, 5667 : 276 - +
  • [49] Privacy preserving k-means clustering in multi-party environment
    Samet, Saeed
    Miri, Ali
    Orozco-Barbosa, Luis
    SECRYPT 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY, 2007, : 381 - +
  • [50] K-Means Cloning: Adaptive Spherical K-Means Clustering
    Hedar, Abdel-Rahman
    Ibrahim, Abdel-Monem M.
    Abdel-Hakim, Alaa E.
    Sewisy, Adel A.
    ALGORITHMS, 2018, 11 (10):