Private Blocking Technique for Multi-party Privacy-Preserving Record Linkage

被引:5
|
作者
Han S. [1 ]
Shen D. [1 ]
Nie T. [1 ]
Kou Y. [1 ]
Yu G. [1 ]
机构
[1] College of Computer Science and Engineering, Northeastern University, Shenyang
基金
中国国家自然科学基金;
关键词
k-anonymity; Paillier cryptosystem; Privacy-preserving record linkage; Private blocking; Scalability;
D O I
10.1007/s41019-017-0041-5
中图分类号
学科分类号
摘要
The process of matching and integrating records that relate to the same entity from one or more datasets is known as record linkage, and it has become an increasingly important subject in many application areas, including business, government and health system. The data from these areas often contain sensitive information. To prevent privacy breaches, ideally records should be linked in a private way such that no information other than the matching result is leaked in the process, and this technique is called privacy-preserving record linkage (PPRL). With the increasing data, scalability becomes the main challenge of PPRL, and many private blocking techniques have been developed for PPRL. They are aimed at reducing the number of record pairs to be compared in the matching process by removing obvious non-matching pairs without compromising privacy. However, most of them are designed for two databases and they vary widely in their ability to balance competing goals of accuracy, efficiency and security. In this paper, we propose a novel private blocking approach for PPRL based on dynamic k-anonymous blocking and Paillier cryptosystem which can be applied on two or multiple databases. In dynamic k-anonymous blocking, our approach dynamically generates blocks satisfying k-anonymity and more accurate values to represent the blocks with varying k. We also propose a novel similarity measure method which performs on the numerical attributes and combines with Paillier cryptosystem to measure the similarity of two or more blocks in security, which provides strong privacy guarantees that none information reveals even collusion. Experiments conducted on a public dataset of voter registration records validate that our approach is scalable to large databases and keeps a high quality of blocking. We compare our method with other techniques and demonstrate the increases in security and accuracy. © 2017, The Author(s).
引用
收藏
页码:187 / 196
页数:9
相关论文
共 50 条
  • [1] Hashing-Based Distributed Multi-party Blocking for Privacy-Preserving Record Linkage
    Ranbaduge, Thilina
    Vatsalan, Dinusha
    Christen, Peter
    Verykios, Vassilios
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT II, 2016, 9652 : 415 - 427
  • [2] MERLIN - A Tool for Multi-party Privacy-preserving Record Linkage
    Ranbaduge, Thilina
    Vatsalan, Dinusha
    Christen, Peter
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1640 - 1643
  • [3] Incremental clustering techniques for multi-party Privacy-Preserving Record Linkage
    Vatsalan, Dinusha
    Christen, Peter
    Rahm, Erhard
    [J]. DATA & KNOWLEDGE ENGINEERING, 2020, 128 (128)
  • [4] A Multi-Party Privacy-Preserving Record Linkage Method Based on Secondary Encoding
    Han, Shumin
    Wang, Yizi
    Shen, Derong
    Wang, Chuang
    [J]. MATHEMATICS, 2024, 12 (12)
  • [5] A Parallel Multi-Party Privacy-Preserving Record Linkage Method Based on a Consortium Blockchain
    Han, Shumin
    Wang, Zikang
    Shen, Dengrong
    Wang, Chuang
    [J]. MATHEMATICS, 2024, 12 (12)
  • [6] Clustering-Based Scalable Indexing for Multi-party Privacy-Preserving Record Linkage
    Ranbaduge, Thilina
    Vatsalan, Dinusha
    Christen, Peter
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 549 - 561
  • [7] Enhanced Multi-Party Privacy-Preserving Record Linkage Using Trusted Execution Environments
    Han, Shumin
    Shen, Kuixing
    Shen, Derong
    Wang, Chuang
    [J]. MATHEMATICS, 2024, 12 (15)
  • [8] Mainzelliste SecureEpiLinker (MainSEL): privacy-preserving record linkage using secure multi-party computation
    Stammler, Sebastian
    Kussel, Tobias
    Schoppmann, Phillipp
    Stampe, Florian
    Tremper, Galina
    Katzenbeisser, Stefan
    Hamacher, Kay
    Lablans, Martin
    [J]. BIOINFORMATICS, 2022, 38 (06) : 1657 - 1668
  • [9] A Tutorial on Blocking Methods for Privacy-Preserving Record Linkage
    Karapiperis, Dimitrios
    Verykios, Vassilios S.
    Katsiri, Eleftheria
    Delis, Alex
    [J]. ALGORITHMIC ASPECTS OF CLOUD COMPUTING, ALGOCLOUD 2015, 2016, 9511 : 3 - 15
  • [10] Privacy-Preserving Multi-Party Clustering: An Empirical Study
    Silva, Arlei
    Bellala, Gowtham
    [J]. 2017 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2017, : 326 - 333