Noise-Aware Statistical Inference with Differentially Private Synthetic Data

被引:0
|
作者
Raisa, Ossi [1 ]
Jalko, Joonas [1 ]
Kaski, Samuel [2 ,3 ]
Honkela, Antti [1 ]
机构
[1] Univ Helsinki, Helsinki, Finland
[2] Aalto Univ, Espoo, Finland
[3] Univ Manchester, Manchester, England
基金
芬兰科学院;
关键词
FOUNDATIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While generation of synthetic data under differential privacy (DP) has received a lot of attention in the data privacy community, analysis of synthetic data has received much less. Existing work has shown that simply analysing DP synthetic data as if it were real does not produce valid inferences of population-level quantities. For example, confidence intervals become too narrow, which we demonstrate with a simple experiment. We tackle this problem by combining synthetic data analysis techniques from the field of multiple imputation (MI), and synthetic data generation using noise-aware (NA) Bayesian modeling into a pipeline NA+MI that allows computing accurate uncertainty estimates for population-level quantities from DP synthetic data. To implement NA+MI for discrete data generation using the values of marginal queries, we develop a novel noise-aware synthetic data generation algorithm NAPSU-MQ using the principle of maximum entropy. Our experiments demonstrate that the pipeline is able to produce accurate confidence intervals from DP synthetic data. The intervals become wider with tighter privacy to accurately capture the additional uncertainty stemming from DP noise.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] APEx: Accuracy-Aware Differentially Private Data Exploration
    Ge, Chang
    He, Xi
    Ilyas, Ihab F.
    Machanavajjhala, Ashwin
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 177 - 194
  • [32] Differentially private and utility-aware publication of trajectory data
    Liu, Qi
    Yu, Juan
    Han, Jianmin
    Yao, Xin
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 180
  • [33] Real-time noise-aware tone mapping
    Eilertsen, Gabriel
    Mantiuk, Rafal K.
    Unger, Jonas
    ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (06):
  • [34] Noise-aware power optimization for on-chip interconnect
    Kim, KW
    Jung, SO
    Narayanan, U
    Liu, CL
    Kang, SM
    ISLPED '00: PROCEEDINGS OF THE 2000 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2000, : 108 - 113
  • [35] Collaborative learning from distributed data with differentially private synthetic data
    Prediger, Lukas
    Jalko, Joonas
    Honkela, Antti
    Kaski, Samuel
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [36] A TSV Noise-Aware 3-D Placer
    Lee, Yu-Min
    Chen, Chun
    Song, JiaXing
    Pan, Kuan-Te
    2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2015, : 1653 - 1658
  • [37] Noise-aware power optimization for on-chip interconnect
    Kim, Ki-Wook
    Jung, Seong-Ook
    Narayanan, Unni
    Liu, C.L.
    Kang, Sung-Mo
    Proceedings of the International Symposium on Low Power Electronics and Design, Digest of Technical Papers, 2000, : 108 - 113
  • [38] NOISE-AWARE DATA PRESERVING SEQUENTIAL MTCMOS CIRCUITS WITH DYNAMIC FORWARD BODY BIAS
    Jiao, Hailong
    Kursun, Volkan
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2011, 20 (01) : 125 - 145
  • [39] QuantumNAT: Quantum Noise-Aware Training with Noise Injection, Quantization and Normalization
    Wang, Hanrui
    Gu, Jiaqi
    Ding, Yongshan
    Li, Zirui
    Chong, Frederic T.
    Pan, David Z.
    Han, Song
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 1 - 6
  • [40] Cleaning training-datasets with noise-aware algorithms
    Escalante, H. Jair
    SEVENTH MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE, PROCEEDINGS, 2006, : 151 - 158