Publishing histograms with outliers under data differential privacy

被引:12
|
作者
Han, Qilong [1 ]
Shao, Bo [1 ]
Li, Lijie [1 ]
Ma, Zhiqiang [1 ]
Zhang, Haitao [1 ]
Du, Xiaojiang [2 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA
基金
中国国家自然科学基金;
关键词
differential privacy; histogram; outlier; bigdata;
D O I
10.1002/sec.1493
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Histograms are important tools for data mining and analysis. Several differentially private publishing schemes for histograms have been proposed recently. Existing differentially private histogram publication schemes have shown that histogram reconstruction is a promising idea for the improvement of publication histograms' accuracy. However, none of these have properly considered the problem outliers in the original histogram, which can cause significant reconstruction errors. Based on the problem, the publication of histogram outliers under differential privacy, this paper puts forward a publication method for histograms with outliers under differential privacy: Outlier-HistoPub. Our method deals with the count sequence of the original histogram first, using a global sort to reduce the degree of alternative distribution (a concept proposed in this paper), which may eliminate the influence of outliers during reconstruction. To avoid individual privacy leakage in the reconstruction process, an exponential mechanism is used to select the most similar adjacent bins of the uniformity distribution histogram to merge each time, and the Laplace mechanism is utilized to generate noisy data to perturb the count sequence of the reconstruction histogram. Experiments prove that the method proposed in this paper can improve the efficiency and accuracy of histogram publication. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:2313 / 2322
页数:10
相关论文
共 50 条
  • [1] Publishing Spatial Histograms Under Differential Privacy
    Ghane, Soheila
    Kulik, Lars
    Ramamohanarao, Kotagiri
    30TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2018), 2018,
  • [2] Publishing Common Neighbors Histograms of Social Networks under Edge Differential Privacy
    Lv, Chaojie
    Xiao, Xiaokui
    Zhang, Lan
    Yu, Ting
    PROCEEDINGS OF THE 19TH ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ACM ASIACCS 2024, 2024, : 1109 - 1123
  • [3] Hiding outliers into crowd: Privacy-preserving data publishing with outliers
    Wang, Hui
    Liu, Ruilin
    DATA & KNOWLEDGE ENGINEERING, 2015, 100 : 94 - 115
  • [4] Novel trajectory data publishing method under differential privacy
    Zhao, Xiaodong
    Dong, Yulan
    Pi, Dechang
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 138
  • [5] Multi-Party Sequential Data Publishing Under Differential Privacy
    Tang, Peng
    Chen, Rui
    Su, Sen
    Guo, Shanqing
    Ju, Lei
    Liu, Gaoyuan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 9562 - 9577
  • [6] Differential Privacy for Data and Model Publishing of Medical Data
    Sun, Zongkun
    Wang, Yinglong
    Shu, Minglei
    Liu, Ruixia
    Zhao, Huiqi
    IEEE ACCESS, 2019, 7 : 152103 - 152114
  • [7] Publishing Graphs Under Node Differential Privacy
    Jian, Xun
    Wang, Yue
    Chen, Lei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4164 - 4177
  • [8] Differential Privacy in Power Big Data Publishing
    Kong, Ping
    Wang, Xiaochun
    Zhang, Boyi
    Li, Yidong
    PARALLEL ARCHITECTURE, ALGORITHM AND PROGRAMMING, PAAP 2017, 2017, 729 : 471 - 479
  • [9] Privacy Preserving Trajectory Data Publishing with Personalized Differential Privacy
    Wen, Ruxue
    Cheng, Wenqing
    Huang, Haojun
    Miao, Wang
    Wang, Chen
    2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 313 - 320
  • [10] A systematic literature review on wearable health data publishing under differential privacy
    Munshi Saifuzzaman
    Tajkia Nuri Ananna
    Mohammad Jabed Morshed Chowdhury
    Md Sadek Ferdous
    Farida Chowdhury
    International Journal of Information Security, 2022, 21 : 847 - 872