Privacy-preserving data sharing infrastructures for medical research: systematization and comparison

被引:33
|
作者
Wirth, Felix Nikolaus [1 ]
Meurers, Thierry [1 ]
Johns, Marco [1 ]
Prasser, Fabian [1 ]
机构
[1] Univmed Berlin, Berlin Inst Hlth, Charitepl 1, D-10117 Berlin, Germany
关键词
Biomedical data sharing; Privacy; Usefulness; Systematization; Distributed computing; Secure multi-party computing; Data enclave; HEALTH; ANALYTICS; ANONYMITY; RISK;
D O I
10.1186/s12911-021-01602-x
中图分类号
R-058 [];
学科分类号
摘要
Background Data sharing is considered a crucial part of modern medical research. Unfortunately, despite its advantages, it often faces obstacles, especially data privacy challenges. As a result, various approaches and infrastructures have been developed that aim to ensure that patients and research participants remain anonymous when data is shared. However, privacy protection typically comes at a cost, e.g. restrictions regarding the types of analyses that can be performed on shared data. What is lacking is a systematization making the trade-offs taken by different approaches transparent. The aim of the work described in this paper was to develop a systematization for the degree of privacy protection provided and the trade-offs taken by different data sharing methods. Based on this contribution, we categorized popular data sharing approaches and identified research gaps by analyzing combinations of promising properties and features that are not yet supported by existing approaches. Methods The systematization consists of different axes. Three axes relate to privacy protection aspects and were adopted from the popular Five Safes Framework: (1) safe data, addressing privacy at the input level, (2) safe settings, addressing privacy during shared processing, and (3) safe outputs, addressing privacy protection of analysis results. Three additional axes address the usefulness of approaches: (4) support for de-duplication, to enable the reconciliation of data belonging to the same individuals, (5) flexibility, to be able to adapt to different data analysis requirements, and (6) scalability, to maintain performance with increasing complexity of shared data or common analysis processes. Results Using the systematization, we identified three different categories of approaches: distributed data analyses, which exchange anonymous aggregated data, secure multi-party computation protocols, which exchange encrypted data, and data enclaves, which store pooled individual-level data in secure environments for access for analysis purposes. We identified important research gaps, including a lack of approaches enabling the de-duplication of horizontally distributed data or providing a high degree of flexibility. Conclusions There are fundamental differences between different data sharing approaches and several gaps in their functionality that may be interesting to investigate in future work. Our systematization can make the properties of privacy-preserving data sharing infrastructures more transparent and support decision makers and regulatory authorities with a better understanding of the trade-offs taken.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Privacy-preserving data sharing infrastructures for medical research: systematization and comparison
    Felix Nikolaus Wirth
    Thierry Meurers
    Marco Johns
    Fabian Prasser
    [J]. BMC Medical Informatics and Decision Making, 21
  • [2] A Review of Secure and Privacy-Preserving Medical Data Sharing
    Jin, Hao
    Luo, Yan
    Li, Peilong
    Mathew, Jomol
    [J]. IEEE ACCESS, 2019, 7 : 61656 - 61669
  • [3] A Privacy-Preserving Medical Data Sharing Scheme Based on Blockchain
    Xu, Guangquan
    Qi, Chen
    Dong, Wenyu
    Gong, Lixiao
    Liu, Shaoying
    Chen, Si
    Liu, Jian
    Zheng, Xi
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (02) : 698 - 709
  • [4] Data Sharing and Privacy-Preserving of Medical Records Using Blockchain
    Kavathekar, Shraddha Suhas
    Patil, Rahul
    [J]. SUSTAINABLE COMMUNICATION NETWORKS AND APPLICATION, ICSCN 2019, 2020, 39 : 65 - 72
  • [5] Privacy-Preserving Federated Data Sharing
    Fioretto, Ferdinando
    Van Hentenryck, Pascal
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 638 - 646
  • [6] RESEARCH ON PRIVACY-PRESERVING PROBLEM FOR DATA-SHARING IN IDS
    Wang, Wenbin
    Sun, Qibo
    Yan, Danfeng
    [J]. CIICT 2008: PROCEEDINGS OF CHINA-IRELAND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATIONS TECHNOLOGIES 2008, 2008, : 320 - 323
  • [7] MedShare: A Privacy-Preserving Medical Data Sharing System by Using Blockchain
    Wang, Mingyue
    Guo, Yu
    Zhang, Chen
    Wang, Cong
    Huang, Hejiao
    Jia, Xiaohua
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (01) : 438 - 451
  • [8] Lightweight Privacy-Preserving Data Sharing Scheme for Internet of Medical Things
    Zhao, Zhuo
    Hsu, Chingfang
    Harn, Lein
    Yang, Qing
    Ke, Lulu
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [9] A Secure and Privacy-Preserving Medical Data Sharing via Consortium Blockchain
    Zhang, Duo
    Wang, Shangping
    Zhang, Yinglong
    Zhang, Qian
    Zhang, Yaling
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [10] A Privacy-Preserving Medical Data Sharing Scheme Based on Consortium Blockchain
    Liu, Jingwei
    Liang, Tianyu
    Sun, Rong
    Du, Xiaojiang
    Guizani, Mohsen
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,