Publishing Set-Valued Data via Differential Privacy

被引:5
|
作者
Chen, Rui [1 ]
Mohammed, Noman [1 ]
Fung, Benjamin C. M. [1 ]
Desai, Bipin C. [1 ]
Xiong, Li [2 ]
机构
[1] Concordia Univ, Montreal, PQ, Canada
[2] Emory Univ, Atlanta, GA 30322 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2011年 / 4卷 / 11期
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Set-valued data provides enormous opportunities for various data mining tasks. In this paper, we study the problem of publishing set-valued data for data mining tasks under the rigorous differential privacy model. All existing data publishing methods for set-valued data are based on partitionbased privacy models, for example k-anonymity, which are vulnerable to privacy attacks based on background knowledge. In contrast, differential privacy provides strong privacy guarantees independent of an adversary's background knowledge and computational power. Existing data publishing approaches for differential privacy, however, are not adequate in terms of both utility and scalability in the context of set-valued data due to its high dimensionality. We demonstrate that set-valued data could be efficiently released under differential privacy with guaranteed utility with the help of context-free taxonomy trees. We propose a probabilistic top-down partitioning algorithm to generate a differentially private release, which scales linearly with the input data size. We also discuss the applicability of our idea to the context of relational data. We prove that our result is (epsilon, delta)-useful for the class of counting queries, the foundation of many data mining tasks. We show that our approach maintains high utility for counting queries and frequent itemset mining and scales to large datasets through extensive experiments on real-life set-valued datasets.
引用
收藏
页码:1087 / 1098
页数:12
相关论文
共 50 条
  • [1] Privacy Protection Method on Publishing Dynamic Set-Valued Data
    Zhang, Jian
    Yang, Yu
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INFORMATION MANAGEMENT AND NETWORK SECURITY, 2016, 47 : 262 - 265
  • [2] Towards Privacy Preserving Publishing of Set-Valued Data on Hybrid Cloud
    Zhang, Hongli
    Zhou, Zhigang
    Ye, Lin
    Du, Xiaojiang
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2018, 6 (02) : 316 - 329
  • [3] PrivSet: Set-Valued Data Analyses with Local Differential Privacy
    Wang, Shaowei
    Huang, Liusheng
    Nie, Yiwen
    Wang, Pengzhan
    Xu, Hongli
    Yang, Wei
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2018), 2018, : 1070 - 1078
  • [4] Publishing Set Valued Data via m-Privacy
    Tiwari, Pratik Kumar
    Chaturvedi, Sushil
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ENGINEERING AND TECHNOLOGY RESEARCH (ICAETR), 2014,
  • [5] Publishing Anonymized Set-Valued Data via Disassociation towards Analysis
    Awad, Nancy
    Couchot, Jean-Francois
    Al Bouna, Bechara
    Philippe, Laurent
    [J]. FUTURE INTERNET, 2020, 12 (04):
  • [6] Joint Distribution Analysis for Set-Valued Data With Local Differential Privacy
    Huang, Yaxuan
    Xue, Kaiping
    Zhu, Bin
    Wei, David S. L.
    Sun, Qibin
    Lu, Jun
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 7106 - 7117
  • [7] A Privacy Preserving Method for Publishing Set-valued Data and Its Correlative Social Network
    Wang, Li-e
    Lin, Shan
    Bai, Yan
    Chang, Sang-Yoon
    Li, Xianxian
    Liu, Peng
    [J]. ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [8] Privacy-Aware Set-Valued Data Publishing on Cloud for Personal Healthcare Records
    Alexander, Elizabeth
    Sathyalakshmi
    [J]. ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2016, 2017, 517 : 323 - 334
  • [9] Set-valued data collection with local differential privacy based on category hierarchy
    Ouyang, Jia
    Xiao, Yinyin
    Liu, Shaopeng
    Xiao, Zhenghong
    Liao, Xiuxiu
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (03) : 2733 - 2763
  • [10] Publishing Set-Valued Data Against Realistic Adversaries
    Jun-Qiang Liu
    [J]. Journal of Computer Science and Technology, 2012, 27 : 24 - 36