Data-Driven Approach for Evaluating Risk of Disclosure and Utility in Differentially Private Data Release

被引:6
|
作者
Chen, Kang-Cheng [1 ]
Yu, Chia-Mu [2 ]
Tai, Bo-Chen [3 ]
Li, Szu-Chuang [3 ]
Tsou, Yao-Tung [4 ]
Huang, Yennun [3 ]
Lin, Chia-Ming [5 ]
机构
[1] Yuan Ze Univ, Dept Comp Sci & Engn, Taoyuan, Taiwan
[2] Natl Chung Hsing Univ, Dept Comp Sci & Engn, Taichung, Taiwan
[3] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
[4] Feng Chia Univ, Dept Commun Engn, Taichung, Taiwan
[5] Inst Informat Ind Dept, Data Analyt Technol & Applicat, Taipei, Taiwan
关键词
D O I
10.1109/AINA.2017.172
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Differential privacy (DP) is a popular technique for protecting individual privacy and at the same for releasing data for public use. However, very few research efforts are devoted to the balance between the corresponding risk of data disclosure (RoD) and data utility. In this paper, we propose data-driven approaches for differentially private data release to evaluate RoD, and offer algorithms to evaluate whether the differentially private synthetic dataset has sufficient privacy. In addition to the privacy, the utility of the synthetic dataset is an important metric for differentially private data release. Thus, we also propose the data-driven algorithm via curve fitting to measure and predict the error of the statistical result incurred by random noise added to the original dataset. Finally, we present an algorithm for choosing appropriate privacy budget epsilon with the balance between the privacy and utility.
引用
收藏
页码:1130 / 1137
页数:8
相关论文
共 50 条
  • [41] A Data-Driven Approach for GPS Trajectory Data Cleaning
    Li, Lun
    Chen, Xiaohang
    Liu, Qizhi
    Bao, Zhifeng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 3 - 19
  • [42] A Missing Data Approach to Data-Driven Filtering and Control
    Markovsky, Ivan
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (04) : 1972 - 1978
  • [43] A Causal, Data-driven Approach to Modeling the Kepler Data
    Wang, Dun
    Hogg, David W.
    Foreman-Mackey, Daniel
    Schoelkopf, Bernhard
    PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 2016, 128 (967)
  • [44] Evaluating Variational Autoencoder as a Private Data Release Mechanism for Tabular Data
    Li, Szu-Chuang
    Tai, Bo-Chen
    Huang, Yennun
    2019 IEEE 24TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING (PRDC 2019), 2019, : 198 - 206
  • [45] Evaluating Classifiers Trained on Differentially Private Synthetic Health Data
    Movahedi, Parisa
    Nieminen, Valtteri
    Perez, Ileana Montoya
    Pahikkala, Tapio
    Airola, Antti
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 748 - 753
  • [46] Clover: An unbiased method for prioritizing differentially expressed genes using a data-driven approach
    Oba, Gina Miku
    Nakato, Ryuichiro
    GENES TO CELLS, 2024, 29 (06) : 456 - 470
  • [47] Data-Driven Precision Implementation Approach
    Cullen, Laura
    Hanrahan, Kirsten
    Tucker, Sharon J.
    Gallagher-Ford, Lynn
    AMERICAN JOURNAL OF NURSING, 2019, 119 (08) : 60 - 63
  • [48] Evaluating sustainable energy security in China and Kazakhstan: A comprehensive data-driven approach
    Darke, Walker
    Karatayev, Marat
    ENVIRONMENTAL AND SUSTAINABILITY INDICATORS, 2025, 26
  • [49] Controller implementability: a data-driven approach
    Padoan, Alberto
    Coulson, Jeremy
    Dorfler, Florian
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6098 - 6103
  • [50] A data-driven approach to nonlinear elasticity
    Nguyen, Lu Trong Khiem
    Keip, Marc-Andre
    COMPUTERS & STRUCTURES, 2018, 194 : 97 - 115