A Local Differential Privacy Based Privacy-Preserving Grid Clustering Method

被引:0
|
作者
Zhang D.-Y. [1 ]
Ni W.-W. [1 ]
Zhang S. [1 ]
Fu N. [1 ]
Hou L.-H. [1 ]
机构
[1] Department of Computer Science and Engineering, Southeast University, Nanjing 211189, Key Laboratory of Computer Network and Information Integration in Southeast University, Ministry of Education, Nanjing 211189
来源
关键词
circular feedback mechanism; evaluation index of grid division; grid clustering; local differential privacy; privacy protection;
D O I
10.11897/SP.J.1016.2023.00422
中图分类号
学科分类号
摘要
With the continuous deepening of mobile Internet applications, a large number of individual data have been produced. Collecting data distributed on different terminals for clustering can find the behavior patterns of people and support the in-depth development of application services. However, these data often contain individual sensitive information. Directly collecting data for clustering has the risk of revealing individual data privacy in the case of a lack of trusted data collectors. In recent years, localized differential Privacy (local differential privacy, LDP) has been continuously concerned by researchers in privacy protection because of its rigorous mathematical theory. Most of the existing LDP-based clustering methods use partition-based clustering methods, which are only suitable for convex distribution data and have the problem of large clustering quality loss. To tackle these problems, we focus on grid clustering and propose a local differential privacy based privacy-preserving grid clustering method. Firstly, we design an evaluation index of grid division, which adjusts the grid density estimation error and the loss of cluster edge information to guide the selection of grid structure. Then, we construct a cyclic feedback mechanism between the server and the terminal, which uses data distribution information to iteratively optimize the disturbance granularity, reduce the amount of differential noise injection, and improve the accuracy of grid density estimation. Finally, we propose an adaptive grid aggregation method based on grid structure to improve the accuracy of privacy protection clustering on the server-side. Theoretical analysis and experimental results show that the proposed method considers the privacy of each terminal's individual data and has a good clustering effect on different distributed data. © 2023 Science Press. All rights reserved.
引用
收藏
页码:422 / 435
页数:13
相关论文
共 31 条
  • [1] Min E, Guo X, Liu Q, Et al., A survey of clustering with deep learning: From the perspective of network architecture, IEEE Access, 6, pp. 39501-39514, (2018)
  • [2] Zhang H, Lin Z, Zhang C, Et al., Robust latent low rank representation for subspace clustering, Neurocomputing, 145, pp. 369-373, (2014)
  • [3] Huang Mao-Feng, Ni Wei-Wei, Wang Jia-Jun, Et al., A logarithmic .spiral based data perturbation method for clustering, Chinese Journal of Computers, 35, 11, pp. 2275-2282, (2012)
  • [4] Kasiviswanathan S P, Lee H K, Nissim K, Et al., What can we learn privately?, SIAM Journal on Computing, 40, 3, pp. 793-826, (2011)
  • [5] Ding B, Kulkarni J, Yekhanin S., Collecting telemetry data privately, Proceedings of the 31st International Conference on Neural Information Processing Systems, pp. 5430-5439, (2017)
  • [6] Johnson N, Near J P, Song D., Towards practical differential privacy for SQL queries, Proceedings of the VLDB Endowment, 11, 5, pp. 526-539, (2018)
  • [7] Ouyang Jia, Yin Jian, Xiao Zheng-Hong, Et al., Transaction data collection for itemset mining under local differential privacy, Journal of Software, 32, 11, pp. 3541-3562, (2021)
  • [8] Zhang Xiao-Jian, Fu Nan, Meng Xiao-Feng, Key-value data accurate collection under local differential privacy, Journal of Computers, 43, 8, pp. 1479-1492, (2020)
  • [9] Erlingsson O, Pihur V, Korolova A., RAPPOR: Randomized aggregatable privacy-preserving ordinal response, Proceeding.s of the 2014 ACM SIGSAC Conference on Computer and Communications Security, pp. 1054-1067, (2014)
  • [10] Cormode G, Jha S, Kulkami T, Et al., Privacy at scale: Local differential privacy in practice, Proceedings of the 2018 International Conference on Management of Data, pp. 1655-1658, (2018)