Composing Differential Privacy and Secure Computation: A case study on scaling private record linkage

被引:42
|
作者
He, Xi [1 ]
Machanavajjhala, Ashwin [1 ]
Flynn, Cheryl [2 ]
Srivastava, Divesh [2 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] AT&T Labs Res, Florham Pk, NJ USA
基金
美国国家科学基金会;
关键词
D O I
10.1145/3133956.3134030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Private record linkage (PRL) is the problem of identifying pairs of records that are similar as per an input matching rule from databases held by two parties that do not trust one another. We identify three key desiderata that a PRL solution must ensure: (1) perfect precision and high recall of matching pairs, (2) a proof of end-to-end privacy, and (3) communication and computational costs that scale subquadratically in the number of input records. We show that all of the existing solutions for PRL-including secure 2-party computation (S2PC), and their variants that use non-private or differentially private (DP) blocking to ensure subquadratic cost -violate at least one of the three desiderata. In particular, S2PC techniques guarantee end-to-end privacy but have either low recall or quadratic cost. In contrast, no end-to-end privacy guarantee has been formalized for solutions that achieve subquadratic cost. This is true even for solutions that compose DP and S2PC: DP does not permit the release of any exact information about the databases, while S2PC algorithms for PRL allow the release of matching records. In light of this deficiency, we propose a novel privacy model, called output constrained differential privacy, that shares the strong privacy protection of DP, but allows for the truthful release of the output of a certain function applied to the data. We apply this to PRL, and show that protocols satisfying this privacy model permit the disclosure of the true matching records, but their execution is insensitive to the presence or absence of a single non-matching record. We find that prior work that combine DP and S2PC techniques even fail to satisfy this end-to-end privacy model. Hence, we develop novel protocols that provably achieve this end-to-end privacy guarantee, together with the other two desiderata of PRL. Our empirical evaluation also shows that our protocols obtain high recall, scale near linearly in the size of the input databases and the output set of matching pairs, and have communication and computational costs that are at least 2 orders of magnitude smaller than S2PC baselines.
引用
收藏
页码:1389 / 1406
页数:18
相关论文
共 50 条
  • [1] Privacy-preserving record linkage in large databases using secure multiparty computation
    Peeter Laud
    Alisa Pankova
    [J]. BMC Medical Genomics, 11
  • [2] Privacy-preserving record linkage in large databases using secure multiparty computation
    Laud, Peeter
    Pankova, Alisa
    [J]. BMC MEDICAL GENOMICS, 2018, 11
  • [4] Mainzelliste SecureEpiLinker (MainSEL): privacy-preserving record linkage using secure multi-party computation
    Stammler, Sebastian
    Kussel, Tobias
    Schoppmann, Phillipp
    Stampe, Florian
    Tremper, Galina
    Katzenbeisser, Stefan
    Hamacher, Kay
    Lablans, Martin
    [J]. BIOINFORMATICS, 2022, 38 (06) : 1657 - 1668
  • [5] Differential Privacy for the Analyst via Private Equilibrium Computation
    Hsu, Justin
    Roth, Aaron
    Ullman, Jonathan
    [J]. STOC'13: PROCEEDINGS OF THE 2013 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2013, : 341 - 350
  • [6] Secure Approximate String Matching for Privacy-Preserving Record Linkage
    Essex, Aleksander
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (10) : 2623 - 2632
  • [7] A Highly Efficient and Secure Multidimensional Blocking Approach For Private Record Linkage
    Karakasidis, Alexandros
    Verykios, Vassilios S.
    [J]. 2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 428 - 435
  • [8] Cryptographically Secure Private Record Linkage Using Locality-Sensitive Hashing
    Wei, Ruidi
    Kerschbaum, Florian
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 17 (02): : 79 - 91
  • [9] Differential Cryptanalysis of Bloom Filters for Privacy-Preserving Record Linkage
    Yin, Weifeng
    Yuan, Lifeng
    Ren, Yizhi
    Meng, Weizhi
    Wang, Dong
    Wang, Qiuhua
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 6665 - 6678
  • [10] SPGC: An Integrated Framework of Secure Computation and Differential Privacy for Collaborative Learning
    Iwahana, Kazuki
    Yanai, Naoto
    Cruz, Jason Paul
    Fujiwara, Toru
    [J]. DATA PRIVACY MANAGEMENT, CRYPTOCURRENCIES AND BLOCKCHAIN TECHNOLOGY, ESORICS 2021, 2022, 13140 : 89 - 105