Answering Multi-Dimensional Range Queries under Local Differential Privacy

被引:20
|
作者
Yang, Jianyu [1 ,2 ]
Wang, Tianhao [2 ]
Li, Ninghui [2 ]
Cheng, Xiang [1 ]
Su, Sen [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing, Peoples R China
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2020年 / 14卷 / 03期
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
DATA PUBLICATION; ERROR;
D O I
10.14778/3430915.3430927
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we tackle the problem of answering multi-dimensional range queries under local differential privacy. There are three key technical challenges: capturing the correlations among attributes, avoiding the curse of dimensionality, and dealing with the large domains of attributes. None of the existing approaches satisfactorily deals with all three challenges. Overcoming these three challenges, we first propose an approach called Two-Dimensional Grids (TDG). Its main idea is to carefully use binning to partition the two-dimensional (2-D) domains of all attribute pairs into 2-D grids that can answer all 2-D range queries and then estimate the answer of a higher dimensional range query from the answers of the associated 2-D range queries. However, in order to reduce errors due to noises, coarse granularities are needed for each attribute in 2-D grids, losing fine-grained distribution information for individual attributes. To correct this deficiency, we further propose Hybrid-Dimensional Grids (HDG), which also introduces 1-D grids to capture finer-grained information on distribution of each individual attribute and combines information from 1-D and 2-D grids to answer range queries. To make HDG consistently effective, we provide a guideline for properly choosing granularities of grids based on an analysis of how different sources of errors are impacted by these choices. Extensive experiments conducted on real and synthetic datasets show that HDG can give a significant improvement over the existing approaches.
引用
收藏
页码:378 / 390
页数:13
相关论文
共 50 条
  • [31] Parity-based inference control for multi-dimensional range sum queries
    Wang, Lingyu
    Li, Yingjiu
    Jajodia, Sushil
    Wijesekera, Duminda
    JOURNAL OF COMPUTER SECURITY, 2007, 15 (04) : 417 - 445
  • [32] PRoBe: Multi-dimensional range queries in P2P networks
    Sahin, OD
    Antony, S
    Agrawal, D
    El Abbadi, A
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 332 - 346
  • [33] Answering Multiple Aggregate Queries under a Specific Privacy Condition
    Aranda, Jordi
    Nin, Jordi
    Herranz, Javier
    2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2018, : 661 - 666
  • [34] Linear Queries Estimation with Local Differential Privacy
    Bassily, Raef
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 721 - 729
  • [35] A workload-adaptive mechanism for linear queries under local differential privacy
    McKenna, Ryan
    Maity, Raj Kumar
    Mazumdar, Arya
    Miklau, Gerome
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (11): : 1905 - 1918
  • [36] Optimizing error of high-dimensional statistical queries under differential privacy
    McKenna, Ryan
    Miklau, Gerome
    Hay, Michael
    Machanavajjhala, Ashwin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (10): : 1206 - 1219
  • [37] The optimization of the Range-Count Queries in Differential Privacy
    Qian, Lei
    Song, Tao
    Liang, Alei
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 618 - 623
  • [38] Aggregate aware caching for multi-dimensional queries
    Deshpande, PM
    Naughton, JF
    ADVANCES IN DATABASE TECHNOLOGY-DEBT 2000, PROCEEDINGS, 2000, 1777 : 167 - 182
  • [39] A Data- and Workload-Aware Algorithm for Range Queries Under Differential Privacy
    Li, Chao
    Hay, Michael
    Miklau, Gerome
    Wang, Yue
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (05): : 341 - 352
  • [40] Cache Optimization for Multi-dimensional Data Queries
    Lu, Jiehua
    ICCSIT 2010 - 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 4, 2010, : 538 - 542