Algorithmically Effective Differentially Private Synthetic Data

被引:0
|
作者
He, Yiyun [1 ]
Vershynin, Roman [1 ]
Zhu, Yizhe [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92717 USA
关键词
differential privacy; synthetic data; Wasserstein metric;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a highly effective algorithmic approach for generating epsilon-differentially private synthetic data in a bounded metric space with near-optimal utility guarantees under the 1-Wasserstein distance. In particular, for a dataset X in the hypercube [0, 1](d), our algorithm generates synthetic dataset Y such that the expected 1-Wasserstein distance between the empirical measure of X and Y is O((epsilon n)(-1/d)) for d >= 2, and is O(log(2) (epsilon n)(epsilon n)(-1)) for d = 1. The accuracy guarantee is optimal up to a constant factor for d >= 2, and up to a logarithmic factor for d = 1. Our algorithm has a fast running time of O(epsilon dn) for all d >= 1 and demonstrates improved accuracy compared to the method in (Boedihardjo et al., 2022c) for d >= 2.
引用
收藏
页数:28
相关论文
共 50 条
  • [21] Differentially Private Release of Synthetic Graphs
    Elias, Marek
    Kapralov, Michael
    Kulkarni, Janardhan
    Lee, Yin Tat
    PROCEEDINGS OF THE THIRTY-FIRST ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS (SODA'20), 2020, : 560 - 578
  • [22] Differentially private synthetic mixed-type data generation for unsupervised learning
    Tantipongpipat, Uthaipon Tao
    Waites, Chris
    Boob, Digvijay
    Siva, Amaresh Ankit
    Cummings, Rachel
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2021, 15 (04): : 779 - 807
  • [23] Differentially Private Auctions for Private Data Crowdsourcing
    Shi, Mingyu
    Qiao, Yu
    Wang, Xinbo
    2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1 - 8
  • [24] POSTER: A Unified Framework of Differentially Private Synthetic Data Release with Generative Adversarial Network
    Lu, Pei-Hsuan
    Yu, Chia-Mu
    CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 2547 - 2549
  • [25] Differentially Private Data Generation with Missing Data
    Mohapatra, Shubhankar
    Zong, Jianqiao
    Kerschbaum, Florian
    He, Xi
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (08): : 2022 - 2035
  • [26] PrivSyn: Differentially Private Data Synthesis
    Zhang, Zhikun
    Wang, Tianhao
    Li, Ninghui
    Honorio, Jean
    Backes, Michael
    He, Shibo
    Chen, Jiming
    Zhang, Yang
    PROCEEDINGS OF THE 30TH USENIX SECURITY SYMPOSIUM, 2021, : 929 - 946
  • [27] Differentially Private Topological Data Analysis
    Kang, Taegyu
    Kim, Sehwan
    Sohn, Jinwon
    Awan, Jordan
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [28] Differentially Private Multidimensional Data Publication
    Zhang Ji
    Dong Xin
    Yu Jiadi
    Luo Yuan
    Li Minglu
    Wu Bin
    CHINA COMMUNICATIONS, 2014, 11 (01) : 79 - 85
  • [29] Differentially Private Distributed Data Analysis
    Takabi, Hassan
    Koppikar, Samir
    Zargar, Saman Taghavi
    2016 IEEE 2ND INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (IEEE CIC), 2016, : 212 - 218
  • [30] Differentially private multidimensional data publishing
    Al-Hussaeni, Khalil
    Fung, Benjamin C. M.
    Iqbal, Farkhund
    Liu, Junqiang
    Hung, Patrick C. K.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 56 (03) : 717 - 752