Record Linkage Using Graph Consistency

被引:0
|
作者
Schraagen, Marijn [1 ]
Kosters, Walter [1 ]
机构
[1] Leiden Univ, Leiden Inst Adv Comp Sci, NL-2300 RA Leiden, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper provides a method for automated record linkage in the historical domain based on collective entity resolution. Multiple records are considered for linkage simultaneously, using plausible record sequences as a substitute for pair-wise record similarity measures such as string edit distance. The method is applied to the problem of family reconstruction from historical archives. A benchmark evaluation shows that the approach provides a computationally efficient way to produce family reconstructions which are useful in practise. Further improvements in linkage accuracy are expected by addressing data issues and linkage assumption violations.
引用
收藏
页码:471 / 483
页数:13
相关论文
共 50 条
  • [1] Knowledge graph based methods for record linkage
    Gautam B.
    Ramos Terrades O.
    Pujadas-Mora J.M.
    Valls M.
    Ramos Terrades, Oriol (oriolrt@cvc.uab.cat), 2020, Elsevier B.V., Netherlands (136) : 127 - 133
  • [2] SuperPart: Supervised graph partitioning for record linkage
    Reas, Russell
    Ash, Steve
    Barton, Rob
    Borthwick, Andrew
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 387 - 396
  • [3] Enhanced graph based genealogical record linkage
    Sweet, Cary
    Oezyer, Tansel
    Alhajj, Reda
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 476 - +
  • [4] Robust Temporal Graph Clustering for Group Record Linkage
    Nanayakkara, Charini
    Christen, Peter
    Ranbaduge, Thilina
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT II, 2019, 11440 : 526 - 538
  • [5] IMPROVING ACCURACY OF RECORD LINKAGE USING GRAPH STRUCTURES: RELEVANCE FOR HEALTH OUTCOMES RESEARCH?
    IJzerman, N.
    Lin, P.
    IJzerman, M.
    Aickelin, U.
    VALUE IN HEALTH, 2020, 23 : S323 - S323
  • [6] Use of graph theory measures to identify errors in record linkage
    Randall, Sean M.
    Boyd, James H.
    Ferrante, Anna M.
    Bauer, Jacqueline K.
    Semmens, James B.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2014, 115 (02) : 55 - 63
  • [7] A Graph Matching Attack on Privacy-Preserving Record Linkage
    Vidanage, Anushka
    Christen, Peter
    Ranbaduge, Thilina
    Schnell, Rainer
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1485 - 1494
  • [8] Clink A Novel Record Linkage Methodology based on Graph Interactions
    Boghdady, Mahmoud
    El-Tazi, Neamat
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2017, : 165 - 171
  • [9] A Proposition for Resilient Graph-Based Record Linkage Using Parallel Processing on Distributed Networks
    Jupin, Joseph
    Shi, Justin Y.
    2015 RESILIENCE WEEK (RSW), 2015, : 188 - 190
  • [10] Efficient Record Linkage Algorithms Using Complete Linkage Clustering
    Mamun, Abdullah-Al
    Aseltine, Robert
    Rajasekaran, Sanguthevar
    PLOS ONE, 2016, 11 (04):