Synthetic Data Digital Twins and Data Trusts Control for Privacy in Health Data Sharing

被引:0
|
作者
Lomotey, Richard K. [1 ]
Kumi, Sandra [2 ]
Ray, Madhurima [3 ]
Deters, Ralph [2 ]
机构
[1] Penn State Univ, Informat Sci & Tech, Monaca, PA 15061 USA
[2] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK, Canada
[3] Penn State Univ, Dept Comp Sci, Monaca, PA USA
关键词
Synthetic Health Data; Digital Twins; Data Trusts; Machine Learning; Artificial Intelligence; Privacy; Middleware; FRAMEWORK;
D O I
10.1145/3643650.3658605
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Health data sharing is very valuable for medical research since it has the propensity to improve diagnostics, policy, medication, and so on. At the same time, sharing health data needs to be done without compromising the privacy of patients and stakeholders. However, recent advances in AI/ML and sophisticated analytics have proven to introduce biases that can easily identify patients based on their healthcare data, which violates privacy. In this work, we sort to address this major issue by exploring two emerging topics that are gaining attention from industry, academia, and governments, i.e., digital twins and data trusts. First, we proposed the use of digital twins (DTs) to generate synthetic records of patient's heart rate data. DTs are virtual replicas of the actual data and were created using two synthetic data generative models - Gaussian Copula (GC) and Tabular Variational Autoencoder (TVAE). The GC and TVAE achieved a maximum data quality score of 88% and 96% respectively. Next, we posit that the DTs should be shared with a data trusts layer. Data trusts are fiduciary frameworks that govern multi-party data sharing. The data trusts enforce access controls (based on metrics such as location, role-based, and policy-based) to the synthetic health data and reports to the data subject. The preliminary evaluations of the work show that merging the two techniques (i.e., synthetic data digital twins and data trusts) enforces better privacy for health data access. The synthetic data ensures more anonymization while the data trusts provide easy auditing, tracking, and efficient reporting to the patient or data subject. The paper also detailed the architectural design of the data trusts and evaluated the efficiency of the access control techniques.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [21] Towards effective data sharing in ophthalmology: data standardization and data privacy
    Halfpenny, William
    Baxter, Sally L.
    [J]. CURRENT OPINION IN OPHTHALMOLOGY, 2022, 33 (05) : 418 - 424
  • [22] Spatial data trusts: an emerging governance framework for sharing spatial data
    Radosevic, Nenad
    Duckham, Matt
    Rahaman, Mohammad Saiedur
    Ho, Serene
    Williams, Katherine
    Hashem, Tanzima
    Tao, Yaguang
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (01) : 1607 - 1639
  • [23] Mastering data privacy: leveraging K-anonymity for robust health data sharing
    Karagiannis, Stylianos
    Ntantogian, Christoforos
    Magkos, Emmanouil
    Tsohou, Aggeliki
    Ribeiro, Luis Landeiro
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2024, 23 (03) : 2189 - 2201
  • [24] Generation and evaluation of privacy preserving synthetic health data
    Yale, Andrew
    Dash, Saloni
    Dutta, Ritik
    Guyon, Isabelle
    Pavao, Adrien
    Bennett, Kristin P.
    [J]. NEUROCOMPUTING, 2020, 416 : 244 - 255
  • [25] Edge Centric Secure Data Sharing with Digital Twins in Smart Ecosystems
    Cathey, Glen
    Benson, James
    Gupta, Maanak
    Sandhu, Ravi
    [J]. 2021 THIRD IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS (TPS-ISA 2021), 2021, : 70 - 79
  • [26] Data Visualization for Digital Twins
    Comba, Joao L. D.
    Santos, Nicolau O.
    Rivera, Jonathan C.
    Romeu, Regis K.
    Abel, Mara
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2023, 25 (02) : 58 - 63
  • [27] Digital Twins for Data Centers
    Athavale, Jyotika
    Bash, Cullen
    Brewer, Wesley
    Maiterth, Matthias
    Milojicic, Dejan
    Petty, Harry
    Sarkar, Soumyendu
    [J]. Computer, 2024, 57 (10) : 151 - 158
  • [28] Enabling Health Data Sharing with Fine-Grained Privacy
    Bonomi, Luca
    Gousheh, Sepand
    Fan, Liyue
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 131 - 141
  • [29] Privacy Risks of Sharing Data from Environmental Health Studies
    Boronow, Katherine E.
    Perovich, Laura J.
    Sweeney, Latanya
    Yoo, Ji Su
    Rudel, Ruthann A.
    Brown, Phil
    Brody, Julia Green
    [J]. ENVIRONMENTAL HEALTH PERSPECTIVES, 2020, 128 (01)
  • [30] On Measuring the Privacy of Anonymized Data in Multiparty Network Data Sharing
    Chen Xiaoyun
    Su Yujie
    Tang Xiaosheng
    Huang Xiaohong
    Ma Yan
    [J]. CHINA COMMUNICATIONS, 2013, 10 (05) : 120 - 127