Cryptographically Secure Private Record Linkage Using Locality-Sensitive Hashing

被引:0
|
作者
Wei, Ruidi [1 ]
Kerschbaum, Florian [1 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 17卷 / 02期
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.14778/3626292.3626293
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Private record linkage (PRL) is the problem of identifying pairs of records that approximately match across datasets in a secure, privacy-preserving manner. Two-party PRL specifically allows each of the parties to obtain records from the other party, only given that each record matches with one of their own. The privacy goal is that no other information about the datasets should be released than the matching records. A fundamental challenge is not to leak information while at the same time not comparing all pairs of records. In plaintext record linkage this is done using a blocking strategy, e.g., locality-sensitive hashing. One recent approach proposed by He et al. (ACM CCS 2017) uses locality-sensitive hashing and then releases a provably differential private representation of the hash bins. However, differential privacy still leaks some, although provable bounded information and does not protect against attacks, such as property inference attacks. Another recent approach by Khurram and Kerschbaum (IEEE ICDE 2020) uses locality-preserving hashing and provides cryptographic security, i.e., it releases no information except the output. However, locality-preserving hash functions are much harder to construct than locality-sensitive hash functions and hence accuracy of this approach is limited, particularly on larger datasets. In this paper, we address the open problem of providing cryptographic security of PRL while using locality-sensitive hash functions. Using recent results in oblivious algorithms, we design a new cryptographically secure PRL with locality-sensitive hash functions. Our prototypical implementation can match 40000 records in the British National Library/Toronto Public Library and the North Carolina Voter Registry datasets with 99.3% and 99.9% accuracy, respectively, in less than an hour which is more than an order of magnitude faster than Khurram and Kerschbaum's work at a higher accuracy.
引用
收藏
页码:79 / 91
页数:13
相关论文
共 50 条
  • [41] On the Problem of p1-1 in Locality-Sensitive Hashing
    Ahle, Thomas Dybdahl
    [J]. SIMILARITY SEARCH AND APPLICATIONS, SISAP 2020, 2020, 12440 : 85 - 93
  • [42] Locality-Sensitive Hashing for Finding Nearest Neighbors in Probability Distributions
    Tang, Yi-Kun
    Mao, Xian-Ling
    Hao, Yi-Jing
    Xu, Cheng
    Huang, Heyan
    [J]. SOCIAL MEDIA PROCESSING, SMP 2017, 2017, 774 : 3 - 15
  • [43] Digital Watermarks for Videos Based on a Locality-Sensitive Hashing Algorithm
    Sun, Yajuan
    Srivastava, Gautam
    [J]. MOBILE NETWORKS & APPLICATIONS, 2023, 28 (05): : 1724 - 1737
  • [44] Fast Access for Star Catalog Based on Locality-Sensitive Hashing
    Zhu H.
    Liang B.
    Zhang T.
    [J]. Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2018, 36 (05): : 988 - 994
  • [45] Fast anomaly detection with locality-sensitive hashing and hyperparameter autotuning
    Meira, Jorge
    Eiras-Franco, Carlos
    Bolon-Canedo, Veronica
    Marreiros, Goreti
    Alonso-Betanzos, Amparo
    [J]. INFORMATION SCIENCES, 2022, 607 : 1245 - 1264
  • [46] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Liu, Renyang
    Zhao, Jun
    Chu, Xing
    Liang, Yu
    Zhou, Wei
    He, Jing
    [J]. SOFT COMPUTING, 2024, 28 (02) : 887 - 902
  • [47] A Scalable ECG Identification System Based on Locality-Sensitive Hashing
    Chu, Hui-Yu
    Lin, Tzu-Yun
    Lee, Song-Hong
    Chiu, Jui-Kun
    Nien, Cing-Ping
    Wu, Shun-Chi
    [J]. 2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [48] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Renyang Liu
    Jun Zhao
    Xing Chu
    Yu Liang
    Wei Zhou
    Jing He
    [J]. Soft Computing, 2024, 28 : 1041 - 1053
  • [49] A novel locality-sensitive hashing for large scale image retrieva
    Li, Junyi
    Li, Jianhua
    Ni, Bingbing
    Yan, Shuicheng
    [J]. Journal of Computational Information Systems, 2012, 8 (23): : 9611 - 9617
  • [50] Maintaining Academic Integrity in Programming: Locality-Sensitive Hashing and Recommendations
    Karnalim, Oscar
    [J]. EDUCATION SCIENCES, 2023, 13 (01):