DPXPlain: Privately Explaining Aggregate Query Answers

被引:3
|
作者
Tao, Yuchao [1 ]
Gilad, Amir [1 ]
Machanavajjhala, Ashwin [1 ]
Roy, Sudeepa [1 ]
机构
[1] Duke Univ, Durham, NC 27708 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2022年 / 16卷 / 01期
关键词
DIFFERENTIAL PRIVACY; PROVENANCE; SECURE;
D O I
10.14778/3561261.3561271
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Differential privacy (DP) is the state-of-the-art and rigorous notion of privacy for answering aggregate database queries while preserving the privacy of sensitive information in the data. In today's era of data analysis, however, it poses new challenges for users to understand the trends and anomalies observed in the query results: Is the unexpected answer due to the data itself, or is it due to the extra noise that must be added to preserve DP? In the second case, even the observation made by the users on query results may be wrong. In the first case, can we still mine interesting explanations from the sensitive data while protecting its privacy? To address these challenges, we present a three-phase framework DPXPLAIN, which is the first system to the best of our knowledge for explaining group-by aggregate query answers with DP. In its three phases, DPXPLAIN (a) answers a group-by aggregate query with DP, (b) allows users to compare aggregate values of two groups and with high probability assesses whether this comparison holds or is flipped by the DP noise, and (c) eventually provides an explanation table containing the approximately 'top-k' explanation predicates along with their relative influences and ranks in the form of confidence intervals, while guaranteeing DP in all steps. We perform an extensive experimental analysis of DPXPLAIN with multiple use-cases on real and synthetic data showing that DPXPLAIN efficiently provides insightful explanations with good accuracy and utility.
引用
收藏
页码:113 / 126
页数:14
相关论文
共 50 条
  • [1] Explaining Differentially Private Query Results With DPXPlain
    Wang, Tingyu
    Tao, Yuchao
    Gilad, Amir
    Machanavajjhala, Ashwin
    Roy, Sudeepa
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (12): : 3962 - 3965
  • [2] LensXPlain: Visualizing and Explaining Contributing Subsets for Aggregate Query Answers
    Miao, Zhengjie
    Lee, Andrew
    Roy, Sudeepa
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (12): : 1898 - 1901
  • [3] Explaining Query Answers in Probabilistic Databases
    Debbi, Hichem
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, 8 (04): : 140 - 152
  • [4] Differentially private explanations for aggregate query answers
    Tao, Yuchao
    Gilad, Amir
    Machanavajjhala, Ashwin
    Roy, Sudeepa
    VLDB JOURNAL, 2025, 34 (02):
  • [5] Explaining Query Answers with Explanation-Ready Databases
    Roy, Sudeepa
    Orr, Laurel
    Suciu, Dan
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 9 (04): : 348 - 359
  • [6] Privacy Preserving Aggregate Query of OLAP for Accurate Answers
    Zhu, Youwen
    Huang, Liusheng
    Yang, Wei
    Dong, Fan
    JOURNAL OF COMPUTERS, 2010, 5 (11) : 1678 - 1685
  • [7] Interactive Summarization and Exploration of Top Aggregate Query Answers
    Wen, Yuhao
    Zhu, Xiaodan
    Roy, Sudeepa
    Yang, Jun
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (13): : 2196 - 2208
  • [8] Consistent query answers on numerical databases under aggregate constraints
    Flesca, S
    Furfaro, F
    Parisi, F
    DATABASE PROGRAMMING LANGUAGES, 2005, 3774 : 279 - 294
  • [9] QAGView: Interactively Summarizing High-Valued Aggregate Query Answers
    Wen, Yuhao
    Zhu, Xiaodan
    Roy, Sudeepa
    Yang, Jun
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1709 - 1712
  • [10] Going Beyond Provenance: Explaining Query Answers with Pattern-based Counterbalances
    Miao, Zhengjie
    Zeng, Qitian
    Glavic, Boris
    Roy, Sudeepa
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 485 - 502