Algorithmically Effective Differentially Private Synthetic Data

被引:0
|
作者
He, Yiyun [1 ]
Vershynin, Roman [1 ]
Zhu, Yizhe [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92717 USA
关键词
differential privacy; synthetic data; Wasserstein metric;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a highly effective algorithmic approach for generating epsilon-differentially private synthetic data in a bounded metric space with near-optimal utility guarantees under the 1-Wasserstein distance. In particular, for a dataset X in the hypercube [0, 1](d), our algorithm generates synthetic dataset Y such that the expected 1-Wasserstein distance between the empirical measure of X and Y is O((epsilon n)(-1/d)) for d >= 2, and is O(log(2) (epsilon n)(epsilon n)(-1)) for d = 1. The accuracy guarantee is optimal up to a constant factor for d >= 2, and up to a logarithmic factor for d = 1. Our algorithm has a fast running time of O(epsilon dn) for all d >= 1 and demonstrates improved accuracy compared to the method in (Boedihardjo et al., 2022c) for d >= 2.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] PrivPfC: differentially private data publication for classification
    Dong Su
    Jianneng Cao
    Ninghui Li
    Min Lyu
    The VLDB Journal, 2018, 27 : 201 - 223
  • [42] Differentially Private Knowledge Distillation via Synthetic Text Generation
    University of Southern California, United States
    Proc. Annu. Meet. Assoc. Comput Linguist., (12957-12968):
  • [43] Differentially Private Knowledge Distillation via Synthetic Text Generation
    Flemings, James
    Annavaram, Murali
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12957 - 12968
  • [44] GlucoSynth: Generating Differentially-Private Synthetic Glucose Traces
    Lamp, Josephine
    Derdzinski, Mark
    Hannemann, Christopher
    van der Linden, Joost
    Feng, Lu
    Wang, Tianhao
    Evans, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [45] Differentially Private Publication of Vertically Partitioned Data
    Tang, Peng
    Cheng, Xiang
    Su, Sen
    Chen, Rui
    Shao, Huaxi
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (02) : 780 - 795
  • [46] A differentially private method for crowdsourcing data submission
    Zhang, Lefeng
    Xiong, Ping
    Ren, Wei
    Zhu, Tianqing
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (19):
  • [47] Research on Differentially Private Trajectory Data Publishing
    Feng Dengguo
    Zhang Min
    Ye Yutong
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (01) : 74 - 88
  • [48] Differentially Private Feature Selection for Data Mining
    Anandan, Balamurugan
    Clifton, Chris
    IWSPA '18: PROCEEDINGS OF THE FOURTH ACM INTERNATIONAL WORKSHOP ON SECURITY AND PRIVACY ANALYTICS, 2018, : 43 - 53
  • [49] Differentially Private Distance Learning in Categorical Data
    Battaglia, Elena
    Celano, Simone
    Pensa, Ruggero G.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 35 (05) : 2050 - 2088
  • [50] Saibot: A Differentially Private Data Search Platform
    Huang, Zezhou
    Liu, Jiaxiang
    Alabi, Daniel Gbenga
    Fernandez, Raul Castro
    Wu, Eugene
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (11): : 3057 - 3070