Privacy-preserving record linkage in large databases using secure multiparty computation

被引:16
|
作者
Laud, Peeter [1 ]
Pankova, Alisa [1 ,2 ]
机构
[1] Cybernet AS, Ulikooli 2, EE-51003 Tartu, Estonia
[2] STACC, Ulikooli 2, EE-51003 Tartu, Estonia
来源
BMC MEDICAL GENOMICS | 2018年 / 11卷
关键词
Secure multiparty computation; Privacy-preserving record linkage; Deduplication; Privacy;
D O I
10.1186/s12920-018-0400-8
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Practical applications for data analysis may require combining multiple databases belonging to different owners, such as health centers. The analysis should be performed without violating privacy of neither the centers themselves, nor the patients whose records these centers store. To avoid biased analysis results, it may be important to remove duplicate records among the centers, so that each patient's data would be taken into account only once. This task is very closely related to privacy-preserving record linkage. Methods: This paper presents a solution to privacy-preserving deduplication among records of several databases using secure multiparty computation. It is build upon one of the fastest practical secure multiparty computation platforms, called Sharemind. Results: The tests on ca 10 million records of simulated databases with 1000 health centers of 10000 records each show that the computation is feasible in practice. The expected running time of the experiment is ca. 30 min for computing servers connected over 100 Mbit/s WAN, the expected error of the results is 2(-40), and no errors have been detected for the particular test set that we used for our benchmarks. Conclusions: The solution is ready for practical use. It has well-defined security properties, implied by the properties of Sharemind platform. The solution assumes that exact matching of records is required, and a possible future research would be extending it to approximate matching.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Privacy-preserving record linkage in large databases using secure multiparty computation
    Peeter Laud
    Alisa Pankova
    [J]. BMC Medical Genomics, 11
  • [2] Privacy-Preserving Biometric Identification Using Secure Multiparty Computation
    Bringer, Julien
    Chabanne, Herve
    Patey, Alain
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2013, 30 (02) : 42 - 52
  • [3] Privacy-Preserving Feature Selection with Secure Multiparty Computation
    Li, Xiling
    Dowsley, Rafael
    De Cock, Martine
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [4] Secure multiparty computation for privacy-preserving drug discovery
    Ma, Rong
    Li, Yi
    Li, Chenxing
    Wan, Fangping
    Hu, Hailin
    Xu, Wei
    Zeng, Jianyang
    [J]. BIOINFORMATICS, 2020, 36 (09) : 2872 - 2880
  • [6] Mainzelliste SecureEpiLinker (MainSEL): privacy-preserving record linkage using secure multi-party computation
    Stammler, Sebastian
    Kussel, Tobias
    Schoppmann, Phillipp
    Stampe, Florian
    Tremper, Galina
    Katzenbeisser, Stefan
    Hamacher, Kay
    Lablans, Martin
    [J]. BIOINFORMATICS, 2022, 38 (06) : 1657 - 1668
  • [7] Fast Privacy-Preserving Text Classification Based on Secure Multiparty Computation
    Resende, Amanda
    Railsback, Davis
    Dowsley, Rafael
    Nascimento, Anderson C. A.
    Aranha, Diego E.
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 428 - 442
  • [8] Privacy-Preserving Deep Learning Based on Multiparty Secure Computation: A Survey
    Zhang, Qiao
    Xin, Chunsheng
    Wu, Hongyi
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (13) : 10412 - 10429
  • [9] MPloC: Privacy-Preserving IP Verification using Logic Locking and Secure Multiparty Computation
    Mouris, Dimitris
    Gouert, Charles
    Tsoutsos, Nektarios Georgios
    [J]. 2023 IEEE 29TH INTERNATIONAL SYMPOSIUM ON ON-LINE TESTING AND ROBUST SYSTEM DESIGN, IOLTS, 2023,
  • [10] Accurate privacy-preserving record linkage for databases with missing values
    Vaiwsri, Sirintra
    Ranbaduge, Thilina
    Christen, Peter
    Schnell, Rainer
    [J]. INFORMATION SYSTEMS, 2022, 106