A Game Theoretic Framework for Analyzing Re-Identification Risk

被引:22
|
作者
Wan, Zhiyu [1 ]
Vorobeychik, Yevgeniy [1 ]
Xia, Weiyi [1 ]
Clayton, Ellen Wright [2 ]
Kantarcioglu, Murat [3 ]
Ganta, Ranjit [3 ]
Heatherly, Raymond [4 ]
Malin, Bradley A. [4 ]
机构
[1] Vanderbilt Univ, Dept Elect Engn & Comp Sci, Nashville, TN 37235 USA
[2] Vanderbilt Univ, Ctr Biomed Eth & Soc, Nashville, TN 37235 USA
[3] Univ Texas Dallas, Dept Comp Sci, Richardson, TX 75083 USA
[4] Vanderbilt Univ, Dept Biomed Informat, Nashville, TN 37235 USA
来源
PLOS ONE | 2015年 / 10卷 / 03期
基金
美国国家科学基金会;
关键词
PRIVACY; NEIGHBORHOOD; RECORDS; SIZE;
D O I
10.1371/journal.pone.0120592
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Given the potential wealth of insights in personal data the big databases can provide, many organizations aim to share data while protecting privacy by sharing de-identified data, but are concerned because various demonstrations show such data can be re-identified. Yet these investigations focus on how attacks can be perpetrated, not the likelihood they will be realized. This paper introduces a game theoretic framework that enables a publisher to balance re-identification risk with the value of sharing data, leveraging a natural assumption that a recipient only attempts re-identification if its potential gains outweigh the costs. We apply the framework to a real case study, where the value of the data to the publisher is the actual grant funding dollar amounts from a national sponsor and the re-identification gain of the recipient is the fine paid to a regulator for violation of federal privacy rules. There are three notable findings: 1) it is possible to achieve zero risk, in that the recipient never gains from re-identification, while sharing almost as much data as the optimal solution that allows for a small amount of risk; 2) the zero-risk solution enables sharing much more data than a commonly invoked de-identification policy of the U.S. Health Insurance Portability and Accountability Act (HIPAA); and 3) a sensitivity analysis demonstrates these findings are robust to order-of-magnitude changes in player losses and gains. In combination, these findings provide support that such a framework can enable pragmatic policy decisions about de-identified data sharing.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Unified Framework for Joint Attribute Classification and Person Re-identification
    Sun, Chenxin
    Jiang, Na
    Zhang, Lei
    Wang, Yuehua
    Wu, Wei
    Zhou, Zhong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 637 - 647
  • [22] DeepBrainPrint: A Novel Contrastive Framework for Brain MRI Re-Identification
    Puglisi, Lemuel
    Barkhof, Frederik
    Alexander, Daniel C.
    Parker, Geoffrey J. M.
    Eshaghi, Arman
    Ravi, Daniele
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 716 - 729
  • [23] Estimating the re-identification risk of clinical data sets
    Dankar, Fida Kamal
    El Emam, Khaled
    Neisa, Angelica
    Roffey, Tyson
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2012, 12
  • [24] ICD-10-CM and the Risk of Re-Identification
    O'Neill, Liam
    INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2015, 10 (01) : IV - VII
  • [25] The risk of node re-identification in labeled social graphs
    Sameera Horawalavithana
    Juan Arroyo Flores
    John Skvoretz
    Adriana Iamnitchi
    Applied Network Science, 4
  • [26] Software Engineering Process for Developing a Person Re-identification Framework
    Fonseca Bustos, Jesus
    de la Torre Gomora, Miguel Angel
    Cervantes Alvarez, Salvador
    2018 7TH INTERNATIONAL CONFERENCE ON SOFTWARE PROCESS IMPROVEMENT (CIMPS): APPLICATIONS IN SOFTWARE ENGINEERING, 2018, : 69 - 77
  • [27] Quantifying the Re-identification Risk in Published Process Models
    Maatouk, Karim
    Mannhardt, Felix
    PROCESS MINING WORKSHOPS, ICPM 2021, 2022, 433 : 382 - 394
  • [28] Estimating Re-identification Risk by Means of Formal Conceptualization
    Aranda-Corral, Gonzalo A.
    Borrego-Diaz, Joaquin
    Galan-Paez, Juan
    14TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN SECURITY FOR INFORMATION SYSTEMS AND 12TH INTERNATIONAL CONFERENCE ON EUROPEAN TRANSNATIONAL EDUCATIONAL (CISIS 2021 AND ICEUTE 2021), 2022, 1400 : 13 - 22
  • [29] The re-identification risk of Canadians from longitudinal demographics
    Khaled El Emam
    David Buckeridge
    Robyn Tamblyn
    Angelica Neisa
    Elizabeth Jonker
    Aman Verma
    BMC Medical Informatics and Decision Making, 11
  • [30] Estimating the re-identification risk of clinical data sets
    Fida Kamal Dankar
    Khaled El Emam
    Angelica Neisa
    Tyson Roffey
    BMC Medical Informatics and Decision Making, 12