Privacy-Preserving Temporal Record Linkage

被引:4
|
作者
Ranbaduge, Thilina [1 ]
Christen, Peter [1 ]
机构
[1] Australian Natl Univ, Res Sch Comp Sci, Canberra, ACT 0200, Australia
基金
澳大利亚研究理事会;
关键词
D O I
10.1109/ICDM.2018.00053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Record linkage (RL) is the process of identifying matching records from different databases that refer to the same entity. It is common that the attribute values of records that belong to the same entity do evolve over time, for example people can change their surname or address. Therefore, to identify the records that refer to the same entity over time, RL should make use of temporal information such as the time-stamp of when a record was created and/or update last. However, if RL needs to be conducted on information about people, due to privacy and confidentiality concerns organizations are often not willing or allowed to share sensitive data in their databases, such as personal medical records, or location and financial details, with other organizations. This paper is the first to propose a privacy-preserving temporal record linkage (PPTRL) protocol that can link records across different databases while ensuring the privacy of the sensitive data in these databases. We propose a novel protocol based on Bloom filter encoding which incorporates the temporal information available in records during the linkage process. Our approach uses homomorphic encryption to securely calculate the probabilities of entities changing attribute values in their records over a period of time. Based on these probabilities we generate a set of masking Bloom filters to adjust the similarities between record pairs. We provide a theoretical analysis of the complexity and privacy of our technique and conduct an empirical study on large real databases containing several millions of records. The experimental results show that our approach can achieve better linkage quality compared to non-temporal PPRL while providing privacy to individuals in the databases that are being linked.
引用
收藏
页码:377 / 386
页数:10
相关论文
共 50 条
  • [1] A scalable privacy-preserving framework for temporal record linkage
    Thilina Ranbaduge
    Peter Christen
    [J]. Knowledge and Information Systems, 2020, 62 : 45 - 78
  • [2] A scalable privacy-preserving framework for temporal record linkage
    Ranbaduge, Thilina
    Christen, Peter
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (01) : 45 - 78
  • [3] Privacy-preserving record linkage
    Verykios, Vassilios S.
    Christen, Peter
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 3 (05) : 321 - 332
  • [4] Privacy-Preserving Record Linkage
    Hall, Rob
    Fienberg, Stephen E.
    [J]. PRIVACY IN STATISTICAL DATABASES, 2010, 6344 : 269 - +
  • [5] Privacy-Preserving Record Linkage with Spark
    Valkering, Onno
    Belloum, Adam
    [J]. 2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 440 - 448
  • [6] Privacy-preserving record linkage using autoencoders
    Victor Christen
    Tim Häntschel
    Peter Christen
    Erhard Rahm
    [J]. International Journal of Data Science and Analytics, 2023, 15 : 347 - 357
  • [7] A taxonomy of privacy-preserving record linkage techniques
    Vatsalan, Dinusha
    Christen, Peter
    Verykios, Vassilios S.
    [J]. INFORMATION SYSTEMS, 2013, 38 (06) : 946 - 969
  • [8] Privacy-preserving record linkage using autoencoders
    Christen, Victor
    Haentschel, Tim
    Christen, Peter
    Rahm, Erhard
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2023, 15 (04) : 347 - 357
  • [9] Privacy-Preserving Record Linkage for Cardinality Counting
    Wu, Nan
    Vatsalan, Dinusha
    Kaafar, Mohamed Ali
    Ramesh, Sanath Kumar
    [J]. PROCEEDINGS OF THE 2023 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ASIA CCS 2023, 2023, : 53 - 64
  • [10] Towards Privacy-Preserving Record Linkage with Record-Wise Linkage Policy
    Kaiho, Takahito
    Lu, Wen-jie
    Amagasa, Toshiyuki
    Sakuma, Jun
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2017, PT I, 2017, 10438 : 233 - 248