LoHDP: Adaptive local differential privacy for high-dimensional data publishing

被引:0
|
作者
Shen, Guohua [1 ,2 ,3 ,4 ]
Cai, Mengnan [1 ]
Huang, Zhiqiu [1 ,2 ,3 ]
Yang, Yang [1 ]
Guo, Feifei [1 ]
Wei, Linlin [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] Minist Ind & Informat Technol, Key Lab Safety Crit Software, Nanjing, Peoples R China
[3] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
data publication; high-dimensional data; local differential privacy; marginal release; privacy protection;
D O I
10.1002/cpe.8039
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The increasing availability of high-dimensional data collected from numerous users has led to the need for multi-dimensional data publishing methods that protect individual privacy. In this paper, we investigate the use of local differential privacy for such purposes. Existing solutions calculate pairwise attribute marginals to construct probabilistic graphical models for generating attribute clusters. These models are then used to derive low-dimensional marginals of these clusters, allowing for an approximation of the distribution of the original dataset and the generation of synthetic datasets. Existing solutions have limitations in computing the marginals of pairwise attributes and multi-dimensional distribution on attribute clusters, as well as constructing relational dependency graphs that contain large clusters. To address these problems, we propose LoHDP, a high-dimensional data publishing method composed of adaptive marginal computing and an effective attribute clustering method. The adaptive local marginal calculates any k-dimensional marginals required in the algorithm. In particular, methods such as sampling-based randomized response are used instead of privacy budget splits to perturb user data. The attribute clustering method measures the correlation between pairwise attributes using an effective method, reduces the search space during the construction of the dependency graph using high-pass filtering technology, and realizes dimensionality reduction by combining sufficient triangulation operation. We demonstrate through extensive experiments on real datasets that our LoHDP method outperforms existing methods in terms of synthetic dataset quality.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Multi-Party High-Dimensional Data Publishing Under Differential Privacy
    Cheng, Xiang
    Tang, Peng
    Su, Sen
    Chen, Rui
    Wu, Zequn
    Zhu, Binyuan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (08) : 1557 - 1571
  • [2] Differential Privacy High-Dimensional Data Publishing Based on Feature Selection and Clustering
    Chu, Zhiguang
    He, Jingsha
    Zhang, Xiaolei
    Zhang, Xing
    Zhu, Nafei
    ELECTRONICS, 2023, 12 (09)
  • [3] LoPub: High-Dimensional Crowdsourced Data Publication With Local Differential Privacy
    Ren, Xuebin
    Yu, Chia-Mu
    Yu, Weiren
    Yang, Shusen
    Yang, Xinyu
    McCann, Julie A.
    Yu, Philip S.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (09) : 2151 - 2166
  • [4] A High-Dimensional Data Trust Publishing Method Based on Attention Mechanism and Differential Privacy
    Li, Taiqiang
    Zhang, Zhen
    Qian, Heng
    Wang, Qiuyue
    Su, Guanqun
    Meng, Lingzhen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IX, ICIC 2024, 2024, 14870 : 208 - 219
  • [5] Privacy-preserving high-dimensional data publishing for classification
    Wang, Rong
    Zhu, Yan
    Chang, Chin-Chen
    Peng, Qiang
    COMPUTERS & SECURITY, 2020, 93
  • [6] Collecting High-Dimensional and Correlation-Constrained Data with Local Differential Privacy
    Du, Rong
    Ye, Qingqing
    Fu, Yue
    Hu, Haibo
    2021 18TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2021,
  • [7] Local Differential Privacy Protection of High-Dimensional Perceptual Data by the Refined Bayes Network
    Ju, Chunhua
    Gu, Qiuyang
    Wu, Gongxing
    Zhang, Shuangzhu
    SENSORS, 2020, 20 (09)
  • [8] High-Dimensional Crowdsourced. Data Distribution hstimation with Local Privacy
    Ren, Xuebin
    Yu, Chia-Mu
    Yu, Weiren
    Yang, Shusen
    Yang, Xinyu
    McCann, Julie
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2016, : 226 - 233
  • [9] PU_Bpub: High-Dimensional Data Release Mechanism Based on Spectral Clustering with Local Differential Privacy
    Lin, Aixin
    Ma, Xuebin
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2022), PT II, 2022, 13472 : 572 - 581
  • [10] Dynamic Edge-Based High-Dimensional Data Aggregation with Differential Privacy
    Chen, Qian
    Ni, Zhiwei
    Zhu, Xuhui
    Lyu, Moli
    Liu, Wentao
    Xia, Pingfan
    ELECTRONICS, 2024, 13 (16)