Privacy-preserving data sharing infrastructures for medical research: systematization and comparison

被引:33
|
作者
Wirth, Felix Nikolaus [1 ]
Meurers, Thierry [1 ]
Johns, Marco [1 ]
Prasser, Fabian [1 ]
机构
[1] Univmed Berlin, Berlin Inst Hlth, Charitepl 1, D-10117 Berlin, Germany
关键词
Biomedical data sharing; Privacy; Usefulness; Systematization; Distributed computing; Secure multi-party computing; Data enclave; HEALTH; ANALYTICS; ANONYMITY; RISK;
D O I
10.1186/s12911-021-01602-x
中图分类号
R-058 [];
学科分类号
摘要
Background Data sharing is considered a crucial part of modern medical research. Unfortunately, despite its advantages, it often faces obstacles, especially data privacy challenges. As a result, various approaches and infrastructures have been developed that aim to ensure that patients and research participants remain anonymous when data is shared. However, privacy protection typically comes at a cost, e.g. restrictions regarding the types of analyses that can be performed on shared data. What is lacking is a systematization making the trade-offs taken by different approaches transparent. The aim of the work described in this paper was to develop a systematization for the degree of privacy protection provided and the trade-offs taken by different data sharing methods. Based on this contribution, we categorized popular data sharing approaches and identified research gaps by analyzing combinations of promising properties and features that are not yet supported by existing approaches. Methods The systematization consists of different axes. Three axes relate to privacy protection aspects and were adopted from the popular Five Safes Framework: (1) safe data, addressing privacy at the input level, (2) safe settings, addressing privacy during shared processing, and (3) safe outputs, addressing privacy protection of analysis results. Three additional axes address the usefulness of approaches: (4) support for de-duplication, to enable the reconciliation of data belonging to the same individuals, (5) flexibility, to be able to adapt to different data analysis requirements, and (6) scalability, to maintain performance with increasing complexity of shared data or common analysis processes. Results Using the systematization, we identified three different categories of approaches: distributed data analyses, which exchange anonymous aggregated data, secure multi-party computation protocols, which exchange encrypted data, and data enclaves, which store pooled individual-level data in secure environments for access for analysis purposes. We identified important research gaps, including a lack of approaches enabling the de-duplication of horizontally distributed data or providing a high degree of flexibility. Conclusions There are fundamental differences between different data sharing approaches and several gaps in their functionality that may be interesting to investigate in future work. Our systematization can make the properties of privacy-preserving data sharing infrastructures more transparent and support decision makers and regulatory authorities with a better understanding of the trade-offs taken.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Similarity Test for Privacy-Preserving Medical Data Sharing Based on NTRU Encryption
    Xie, Shaofen
    Wu, Faguo
    Zhang, Xiao
    Yao, Wang
    Zheng, Zhiming
    [J]. PROCEEDINGS OF 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2019), 2019, : 20 - 23
  • [22] Toward Secure, Privacy-Preserving, and Interoperable Medical Data Sharing via Blockchain
    Jin, Hao
    Xu, Chen
    Luo, Yan
    Li, Peilong
    Cao, Yu
    Mathew, Jomol
    [J]. 2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 852 - 861
  • [23] PriveTAB : Secure and Privacy-Preserving sharing of Tabular Data
    Kotal, Anantaa
    Piplai, Aritran
    Chukkapalli, Sai Sree Laya
    Joshi, Anupam
    [J]. PROCEEDINGS OF THE 2022 ACM INTERNATIONAL WORKSHOP ON SECURITY AND PRIVACY ANALYTICS (IWSPA '22), 2022, : 35 - 45
  • [24] A Framework for Privacy-Preserving Data Sharing in the Smart Grid
    Alharbi, Khalil
    Lin, Xiaodong
    Shao, Jun
    [J]. 2014 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2014, : 214 - 219
  • [25] Privacy-Preserving Data Sharing in Smart Grid Systems
    Yang, Lei
    Xue, Hao
    Li, Fengjun
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2014, : 878 - 883
  • [26] A Privacy-Preserving Data Sharing Solution for Mobile Healthcare
    Huang, Chanying
    Yan, Kedong
    Wei, Songjie
    Lee, Dong Hoon
    [J]. PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC 2017), 2017, : 260 - 265
  • [27] Scalable and Privacy-Preserving Data Sharing Based on Blockchain
    Zheng, Bao-Kun
    Zhu, Lie-Huang
    Shen, Meng
    Gao, Feng
    Zhang, Chuan
    Li, Yan-Dong
    Yang, Jing
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2018, 33 (03) : 557 - 567
  • [28] PRShare: A Framework for Privacy-preserving, Interorganizational Data Sharing
    Idan, Lihi
    Feigenbaum, Joan
    [J]. ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2022, 25 (04)
  • [29] Privacy-preserving data sharing via probabilistic modeling
    Jalko, Joonas
    Lagerspetz, Eemil
    Haukka, Jari
    Tarkoma, Sasu
    Honkela, Antti
    Kaski, Samuel
    [J]. PATTERNS, 2021, 2 (07):
  • [30] Towards Privacy-preserving Data Sharing in Smart Environments
    Hernandez-Ramos, Jose L.
    Bernal Bernabe, Jorge
    Skarmeta, Antonio F.
    [J]. 2014 EIGHTH INTERNATIONAL CONFERENCE ON INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING (IMIS), 2014, : 334 - 339