LoHDP: Adaptive local differential privacy for high-dimensional data publishing

被引:0
|
作者
Shen, Guohua [1 ,2 ,3 ,4 ]
Cai, Mengnan [1 ]
Huang, Zhiqiu [1 ,2 ,3 ]
Yang, Yang [1 ]
Guo, Feifei [1 ]
Wei, Linlin [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] Minist Ind & Informat Technol, Key Lab Safety Crit Software, Nanjing, Peoples R China
[3] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
data publication; high-dimensional data; local differential privacy; marginal release; privacy protection;
D O I
10.1002/cpe.8039
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The increasing availability of high-dimensional data collected from numerous users has led to the need for multi-dimensional data publishing methods that protect individual privacy. In this paper, we investigate the use of local differential privacy for such purposes. Existing solutions calculate pairwise attribute marginals to construct probabilistic graphical models for generating attribute clusters. These models are then used to derive low-dimensional marginals of these clusters, allowing for an approximation of the distribution of the original dataset and the generation of synthetic datasets. Existing solutions have limitations in computing the marginals of pairwise attributes and multi-dimensional distribution on attribute clusters, as well as constructing relational dependency graphs that contain large clusters. To address these problems, we propose LoHDP, a high-dimensional data publishing method composed of adaptive marginal computing and an effective attribute clustering method. The adaptive local marginal calculates any k-dimensional marginals required in the algorithm. In particular, methods such as sampling-based randomized response are used instead of privacy budget splits to perturb user data. The attribute clustering method measures the correlation between pairwise attributes using an effective method, reduces the search space during the construction of the dependency graph using high-pass filtering technology, and realizes dimensionality reduction by combining sufficient triangulation operation. We demonstrate through extensive experiments on real datasets that our LoHDP method outperforms existing methods in terms of synthetic dataset quality.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Optimizing error of high-dimensional statistical queries under differential privacy
    McKenna, Ryan
    Miklau, Gerome
    Hay, Michael
    Machanavajjhala, Ashwin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (10): : 1206 - 1219
  • [22] Differential Privacy for Data and Model Publishing of Medical Data
    Sun, Zongkun
    Wang, Yinglong
    Shu, Minglei
    Liu, Ruixia
    Zhao, Huiqi
    IEEE ACCESS, 2019, 7 : 152103 - 152114
  • [23] Differentially Private Multi-Party High-Dimensional Data Publishing
    Su, Sen
    Tang, Peng
    Cheng, Xiang
    Chen, Rui
    Wu, Zequn
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 205 - 216
  • [24] Adaptive Clustering for Outlier Identification in High-Dimensional Data
    Thudumu, Srikanth
    Branch, Philip
    Jin, Jiong
    Singh, Jugdutt
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2019, PT II, 2020, 11945 : 215 - 228
  • [25] ADAPTIVE CHANGE POINT MONITORING FOR HIGH-DIMENSIONAL DATA
    Wu, Teng
    Wang, Runmin
    Yan, Hao
    Shao, Xiaofeng
    STATISTICA SINICA, 2022, 32 (03) : 1583 - 1610
  • [26] Adaptive Dimensionality Reduction Method for High-dimensional Data
    Duan, Shuyong
    Yang, Jianhua
    Han, Xu
    Liu, Guirong
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2024, 60 (17): : 283 - 296
  • [27] Adaptive Bayesian density regression for high-dimensional data
    Shen, Weining
    Ghosal, Subhashis
    BERNOULLI, 2016, 22 (01) : 396 - 420
  • [28] Privacy Preserving Trajectory Data Publishing with Personalized Differential Privacy
    Wen, Ruxue
    Cheng, Wenqing
    Huang, Haojun
    Miao, Wang
    Wang, Chen
    2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 313 - 320
  • [29] Differential Privacy in Power Big Data Publishing
    Kong, Ping
    Wang, Xiaochun
    Zhang, Boyi
    Li, Yidong
    PARALLEL ARCHITECTURE, ALGORITHM AND PROGRAMMING, PAAP 2017, 2017, 729 : 471 - 479
  • [30] High-Dimensional Function Optimization with a Self Adaptive Differential Evolution
    Worasucheep, Chukiat
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 668 - 673