Differentially Private Publication of Vertically Partitioned Data

被引:10
|
作者
Tang, Peng [1 ]
Cheng, Xiang [1 ]
Su, Sen [1 ]
Chen, Rui [2 ]
Shao, Huaxi [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[2] Samsung Res Amer, Mountain View, CA 94043 USA
基金
中国国家自然科学基金;
关键词
Publishing; Differential privacy; Distributed databases; Protocols; Privacy; Correlation; data publishing; vertical partitioning; latent tree model;
D O I
10.1109/TDSC.2019.2905237
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study the problem of publishing vertically partitioned data under differential privacy, where different attributes of the same set of individuals are held by multiple parties. In this setting, with the assistance of a semi-trusted curator, the involved parties aim to collectively generate an integrated dataset while satisfying differential privacy for each local dataset. Based on the latent tree model (LTM), we present a differentially private latent tree (DPLT) approach, which is, to the best of our knowledge, the first approach to solving this challenging problem. In DPLT, the parties and the curator collaboratively identify the latent tree that best approximates the joint distribution of the integrated dataset, from which a synthetic dataset can be generated. The fundamental advantage of adopting LTM is that we can use the connections between a small number of latent attributes derived from each local dataset to capture the cross-dataset dependencies of the observed attributes in all local datasets such that the joint distribution of the integrated dataset can be learned with little injected noise and low computation and communication costs. DPLT is backed up by a series of novel techniques, including two-phase latent attribute generation (TLAG), tree index based correlation quantification (TICQ) and distributed Laplace perturbation protocol (DLPP). Extensive experiments on real datasets demonstrate that DPLT offers desirable data utility with low computation and communication costs.
引用
收藏
页码:780 / 795
页数:16
相关论文
共 50 条
  • [1] Secure Two-Party Differentially Private Data Release for Vertically Partitioned Data
    Mohammed, Noman
    Alhadidi, Dima
    Fung, Benjamin C. M.
    Debbabi, Mourad
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2014, 11 (01) : 59 - 71
  • [2] Differentially private generative decomposed adversarial network for vertically partitioned data sharing
    Wang, Zhenya
    Cheng, Xiang
    Su, Sen
    Wang, Guangsheng
    [J]. INFORMATION SCIENCES, 2023, 619 : 722 - 744
  • [3] Differentially private data publishing for arbitrarily partitioned data
    Wang, Rong
    Fung, Benjamin C. M.
    Zhu, Yan
    Peng, Qiang
    [J]. INFORMATION SCIENCES, 2021, 553 : 247 - 265
  • [4] Differentially Private Multidimensional Data Publication
    Zhang Ji
    Dong Xin
    Yu Jiadi
    Luo Yuan
    Li Minglu
    Wu Bin
    [J]. CHINA COMMUNICATIONS, 2014, 11 (01) : 79 - 85
  • [5] PrivPfC: differentially private data publication for classification
    Dong Su
    Jianneng Cao
    Ninghui Li
    Min Lyu
    [J]. The VLDB Journal, 2018, 27 : 201 - 223
  • [6] Differentially Private Publication Scheme for Trajectory Data
    Li, Meng
    Zhu, Liehuang
    Zhang, Zijian
    Xu, Rixin
    [J]. 2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 596 - 601
  • [7] Differentially private publication of streaming trajectory data
    Ding, Xiaofeng
    Zhou, Wenxiang
    Sheng, Shujun
    Bao, Zhifeng
    Choo, Kim-Kwang Raymond
    Jin, Hai
    [J]. INFORMATION SCIENCES, 2020, 538 : 159 - 175
  • [8] PrivPfC: differentially private data publication for classification
    Su, Dong
    Cao, Jianneng
    Li, Ninghui
    Lyu, Min
    [J]. VLDB JOURNAL, 2018, 27 (02): : 201 - 223
  • [9] Private representative-based clustering for vertically partitioned data
    Estivill-Castro, V
    [J]. PROCEEDINGS OF THE FIFTH MEXICAN INTERNATIONAL CONFERENCE IN COMPUTER SCIENCE (ENC 2004), 2004, : 160 - 167
  • [10] Differentially private data publication with multi -level data utility
    Jiang, Honglu
    Sarwar, S. M.
    Yu, Haotian
    Islam, Sheikh Ariful
    [J]. HIGH-CONFIDENCE COMPUTING, 2022, 2 (02):