A scalable privacy-preserving framework for temporal record linkage

被引:0
|
作者
Thilina Ranbaduge
Peter Christen
机构
[1] Australian National University,Research School of Computer Science
来源
关键词
Secure multiparty computation; Encryption; Temporal records;
D O I
暂无
中图分类号
学科分类号
摘要
Record linkage (RL) is the process of identifying matching records from different databases that refer to the same entity. In many applications, it is common that the attribute values of records that belong to the same entity evolve over time, for example people can change their surname or address. Therefore, to identify the records that refer to the same entity over time, RL should make use of temporal information such as the time-stamp of when a record was created and/or update last. However, if RL needs to be conducted on information about people, due to privacy and confidentiality concerns organisations are often not willing or allowed to share sensitive data in their databases, such as personal medical records or location and financial details, with other organisations. This paper proposes a scalable framework for privacy-preserving temporal record linkage that can link different databases while ensuring the privacy of sensitive data in these databases. We propose two protocols that can be used in different linkage scenarios with and without a third party. Our protocols use Bloom filter encoding which incorporates the temporal information available in records during the linkage process. Our approaches first securely calculate the probabilities of entities changing attribute values in their records over a period of time. Based on these probabilities, we then generate a set of masking Bloom filters to adjust the similarities between record pairs. We provide a theoretical analysis of the complexity and privacy of our techniques and conduct an empirical study on large real databases containing several millions of records. The experimental results show that our approaches can achieve better linkage quality compared to non-temporal PPRL while providing privacy to individuals in the databases that are being linked.
引用
收藏
页码:45 / 78
页数:33
相关论文
共 50 条
  • [1] A scalable privacy-preserving framework for temporal record linkage
    Ranbaduge, Thilina
    Christen, Peter
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (01) : 45 - 78
  • [2] Privacy-Preserving Temporal Record Linkage
    Ranbaduge, Thilina
    Christen, Peter
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 377 - 386
  • [3] ScaDS Research on Scalable Privacy-preserving Record Linkage
    Martin Franke
    Marcel Gladbach
    Ziad Sehili
    Florens Rohde
    Erhard Rahm
    [J]. Datenbank-Spektrum, 2019, 19 (1) : 31 - 40
  • [4] A Vulnerability Assessment Framework for Privacy-preserving Record Linkage
    Vidanage, Anushka
    Christen, Peter
    Ranbaduge, Thilina
    Schnell, Rainer
    [J]. ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2023, 26 (03)
  • [5] Privacy-preserving record linkage
    Verykios, Vassilios S.
    Christen, Peter
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 3 (05) : 321 - 332
  • [6] Privacy-Preserving Record Linkage
    Hall, Rob
    Fienberg, Stephen E.
    [J]. PRIVACY IN STATISTICAL DATABASES, 2010, 6344 : 269 - +
  • [7] Semantic privacy-preserving framework for electronic health record linkage
    Lu, Yang
    Sinnott, Richard O.
    [J]. TELEMATICS AND INFORMATICS, 2018, 35 (04) : 737 - 752
  • [8] Privacy-Preserving Record Linkage with Spark
    Valkering, Onno
    Belloum, Adam
    [J]. 2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 440 - 448
  • [9] FEDERAL: A Framework for Distance-Aware Privacy-Preserving Record Linkage
    Karapiperis, Dimitrios
    Gkoulalas-Divanis, Aris
    Verykios, Vassilios S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (02) : 292 - 304
  • [10] A Practical and Scalable Privacy-preserving Framework
    Avgerinos, Nikos
    D'Antonio, Salvatore
    Kamara, Irene
    Kotselidis, Christos
    Lazarou, Ioannis
    Mannarino, Teresa
    Meditskos, Georgios
    Papachristopoulou, Konstantina
    Papoutsis, Angelos
    Roccetti, Paolo
    Zuber, Martin
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2023, : 598 - 603