Data-Driven Approach for Evaluating Risk of Disclosure and Utility in Differentially Private Data Release

被引:6
|
作者
Chen, Kang-Cheng [1 ]
Yu, Chia-Mu [2 ]
Tai, Bo-Chen [3 ]
Li, Szu-Chuang [3 ]
Tsou, Yao-Tung [4 ]
Huang, Yennun [3 ]
Lin, Chia-Ming [5 ]
机构
[1] Yuan Ze Univ, Dept Comp Sci & Engn, Taoyuan, Taiwan
[2] Natl Chung Hsing Univ, Dept Comp Sci & Engn, Taichung, Taiwan
[3] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
[4] Feng Chia Univ, Dept Commun Engn, Taichung, Taiwan
[5] Inst Informat Ind Dept, Data Analyt Technol & Applicat, Taipei, Taiwan
关键词
D O I
10.1109/AINA.2017.172
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Differential privacy (DP) is a popular technique for protecting individual privacy and at the same for releasing data for public use. However, very few research efforts are devoted to the balance between the corresponding risk of data disclosure (RoD) and data utility. In this paper, we propose data-driven approaches for differentially private data release to evaluate RoD, and offer algorithms to evaluate whether the differentially private synthetic dataset has sufficient privacy. In addition to the privacy, the utility of the synthetic dataset is an important metric for differentially private data release. Thus, we also propose the data-driven algorithm via curve fitting to measure and predict the error of the statistical result incurred by random noise added to the original dataset. Finally, we present an algorithm for choosing appropriate privacy budget epsilon with the balance between the privacy and utility.
引用
收藏
页码:1130 / 1137
页数:8
相关论文
共 50 条
  • [21] Approach to data-driven learning
    Markov, Z.
    International Workshop on Fundamentals of Artificial Intelligence Research, 1991,
  • [22] AN APPROACH TO DATA-DRIVEN LEARNING
    MARKOV, Z
    LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1991, 535 : 127 - 140
  • [23] Data-Driven MoE: A Data-Driven Approach to Construct MoE by a Single LLM
    Teng, Zeyu
    Yan, Zhiwei
    Song, Yong
    Ye, Xiaozhou
    Ouyang, Ye
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14878 : 352 - 363
  • [24] Data-Driven Approach for Evaluating the Energy Efficiency in Multifamily Residential Buildings
    Seyrfar, Abolfazl
    Ataei, Hossein
    Movahedi, Ali
    Derrible, Sybil
    PRACTICE PERIODICAL ON STRUCTURAL DESIGN AND CONSTRUCTION, 2021, 26 (02)
  • [25] Evaluating the Potential of a Data-Driven Approach in Digital Service (Re)Design
    Mijac, Tea
    Jadric, Mario
    Cukusic, Maja
    CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS (CECIIS 2018), 2018, : 187 - 194
  • [26] Differentially Private Data Release over Multiple Tables
    Ghazi, Badih
    Hu, Xiao
    Kumar, Ravi
    Manurangsi, Pasin
    PROCEEDINGS OF THE 42ND ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, PODS 2023, 2023, : 207 - 219
  • [27] Differentially Private Data Release Via Wavelet Transforms
    Deng, Yu
    Zhuang, Yi-Feng
    Qian, Lei
    2015 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION SYSTEM (SEIS 2015), 2015, : 196 - 200
  • [28] Differentially Private Data Release through Multidimensional Partitioning
    Xiao, Yonghui
    Xiong, Li
    Yuan, Chun
    SECURE DATA MANAGEMENT, 2010, 6358 : 150 - +
  • [29] A Quantitative Approach for Evaluating the Utility of a Differentially Private Behavioral Science Dataset
    Hill, Raquel
    Hansen, Michael
    Janssen, Erick
    Sanders, Stephanie A.
    Heiman, Julia R.
    Xiong, Li
    2014 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2014, : 276 - 284
  • [30] Data-Driven Model for Risk Assessment of Cable Fire in Utility Tunnels Using Evidential Reasoning Approach
    彭欣
    姚帅寓
    胡昊
    杜守继
    JournalofDonghuaUniversity(EnglishEdition), 2023, 40 (02) : 202 - 215