Algorithmically Effective Differentially Private Synthetic Data

被引:0
|
作者
He, Yiyun [1 ]
Vershynin, Roman [1 ]
Zhu, Yizhe [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92717 USA
关键词
differential privacy; synthetic data; Wasserstein metric;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a highly effective algorithmic approach for generating epsilon-differentially private synthetic data in a bounded metric space with near-optimal utility guarantees under the 1-Wasserstein distance. In particular, for a dataset X in the hypercube [0, 1](d), our algorithm generates synthetic dataset Y such that the expected 1-Wasserstein distance between the empirical measure of X and Y is O((epsilon n)(-1/d)) for d >= 2, and is O(log(2) (epsilon n)(epsilon n)(-1)) for d = 1. The accuracy guarantee is optimal up to a constant factor for d >= 2, and up to a logarithmic factor for d = 1. Our algorithm has a fast running time of O(epsilon dn) for all d >= 1 and demonstrates improved accuracy compared to the method in (Boedihardjo et al., 2022c) for d >= 2.
引用
收藏
页数:28
相关论文
共 50 条
  • [11] Analysis of Differentially Private Synthetic Data: A Measurement Error Approach
    Jiang, Yangdi
    Liu, Yi
    Yan, Xiaodong
    Charest, Anne-Sophie
    Kong, Linglong
    Jiang, Bei
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21206 - 21213
  • [12] Differentially Private Verification of Regression Predictions from Synthetic Data
    Yu, Haoyang
    Reiter, Jerome P.
    TRANSACTIONS ON DATA PRIVACY, 2018, 11 (03) : 279 - 297
  • [13] Differentially Private Synthetic Control
    Rho, Saeyoung
    Cummings, Rachel
    Misra, Vishal
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [14] Effective Route Recommendation Leveraging Differentially Private Location Data
    Kim, Jongwook
    MATHEMATICS, 2024, 12 (19)
  • [15] An Effective Differentially Private Data Releasing Algorithm for Decision Tree
    Zhu, Tianqing
    Xiong, Ping
    Xiang, Yang
    Zhou, Wanlei
    2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 388 - 395
  • [16] DP-CGAN : Differentially Private Synthetic Data and Label Generation
    Torkzadehmahani, Reihaneh
    Kairouz, Peter
    Paten, Benedict
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 98 - 104
  • [17] Noise-Aware Statistical Inference with Differentially Private Synthetic Data
    Raisa, Ossi
    Jalko, Joonas
    Kaski, Samuel
    Honkela, Antti
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [18] Differentially private synthetic medical data generation using convolutional GANs
    Torfi, Amirsina
    Fox, Edward A.
    Reddy, Chandan K.
    INFORMATION SCIENCES, 2022, 586 : 485 - 500
  • [19] Benchmarking Evaluation Protocols for Classifiers Trained on Differentially Private Synthetic Data
    Movahedi, Parisa
    Nieminen, Valtteri
    Perez, Ileana Montoya
    Daafane, Hiba
    Sukhwal, Dishant
    Pahikkala, Tapio
    Airola, Antti
    IEEE ACCESS, 2024, 12 : 118637 - 118648
  • [20] Differentially Private Release of Synthetic Graphs
    Elias, Marek
    Kapralov, Michael
    Kulkarni, Janardhan
    Lee, Yin Tat
    PROCEEDINGS OF THE 2020 ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2020, : 560 - 578