Publishing histograms with outliers under data differential privacy

被引:12
|
作者
Han, Qilong [1 ]
Shao, Bo [1 ]
Li, Lijie [1 ]
Ma, Zhiqiang [1 ]
Zhang, Haitao [1 ]
Du, Xiaojiang [2 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA
基金
中国国家自然科学基金;
关键词
differential privacy; histogram; outlier; bigdata;
D O I
10.1002/sec.1493
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Histograms are important tools for data mining and analysis. Several differentially private publishing schemes for histograms have been proposed recently. Existing differentially private histogram publication schemes have shown that histogram reconstruction is a promising idea for the improvement of publication histograms' accuracy. However, none of these have properly considered the problem outliers in the original histogram, which can cause significant reconstruction errors. Based on the problem, the publication of histogram outliers under differential privacy, this paper puts forward a publication method for histograms with outliers under differential privacy: Outlier-HistoPub. Our method deals with the count sequence of the original histogram first, using a global sort to reduce the degree of alternative distribution (a concept proposed in this paper), which may eliminate the influence of outliers during reconstruction. To avoid individual privacy leakage in the reconstruction process, an exponential mechanism is used to select the most similar adjacent bins of the uniformity distribution histogram to merge each time, and the Laplace mechanism is utilized to generate noisy data to perturb the count sequence of the reconstruction histogram. Experiments prove that the method proposed in this paper can improve the efficiency and accuracy of histogram publication. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:2313 / 2322
页数:10
相关论文
共 50 条
  • [21] Differential Privacy Data Publishing Method Based on Cell Merging
    Li, Qi
    Li, Yuqiang
    Zeng, Guicai
    Liu, Aihua
    PROCEEDINGS OF THE 2017 IEEE 14TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2017), 2017, : 778 - 782
  • [22] Differential privacy preserving data publishing based on Bayesian network
    Qi, Xuejian
    Ma, Xuebin
    Bai, Xiangyu
    Li, Wuyungerile
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1718 - 1726
  • [23] Dynamic Data Publishing with Differential Privacy via Reinforcement Learning
    Gao, Ruichao
    Ma, Xuebin
    2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2019, : 746 - 752
  • [24] Differential Privacy via t-Closeness in Data Publishing
    Soria-Comas, Jordi
    Domingo-Ferrer, Josep
    2013 ELEVENTH ANNUAL INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2013, : 27 - 35
  • [25] Achieving differential privacy of trajectory data publishing in participatory sensing
    Li, Meng
    Zhu, Liehuang
    Zhang, Zijian
    Xu, Rixin
    INFORMATION SCIENCES, 2017, 400 : 1 - 13
  • [26] Privacy in Data Publishing
    di Vimercati, Sabrina De Capitani
    Foresti, Sara
    Livraga, Giovanni
    DATA PRIVACY MANAGEMENT AND AUTONOMOUS SPONTANEOUS SECURITY, 2011, 6514 : 8 - 21
  • [27] Privacy in Data Publishing
    Gehrke, Johannes
    Kifer, Daniel
    Machanavajjhala, Ashwin
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 1213 - 1213
  • [28] Differential privacy data publishing in the big data platform of precise poverty alleviation
    Suwei Gao
    Changchun Zhou
    Soft Computing, 2020, 24 : 8139 - 8147
  • [29] In-Storage Computation of Histograms with Differential Privacy
    Tosa, Andrei
    Hangan, Anca
    Sebestyen, Gheorghe
    Istvan, Zsolt
    2021 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT), 2021, : 246 - 249
  • [30] Differential privacy data publishing in the big data platform of precise poverty alleviation
    Gao, Suwei
    Zhou, Changchun
    SOFT COMPUTING, 2020, 24 (11) : 8139 - 8147