An efficient validation method of probabilistic record linkage including readmissions and twins

被引:69
|
作者
Tromp, M. [1 ,3 ]
Ravelli, A. C. J. [1 ]
Meray, N. [1 ]
Reitsma, J. B. [2 ]
Bonsel, G. J. [3 ]
机构
[1] Univ Amsterdam, Acad Med Ctr, Dept Med Informat, NL-1105 AZ Amsterdam, Netherlands
[2] Univ Amsterdam, Acad Med Ctr, Dept Clin Epidemiol Biostat & Bioinformat, NL-1105 AZ Amsterdam, Netherlands
[3] Univ Amsterdam, Acad Med Ctr, Dept Publ Hlth Epidemiol, NL-1105 AZ Amsterdam, Netherlands
关键词
epidemiologic methods; medical record linkage; pediatrics; registries; validation study;
D O I
10.3414/ME0489
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To describe an efficient, generalizable approach to validate probabilistic record linkage results, in particular by a model-guided detection of linking errors, and to apply this approach to validate linkage of admissions of newborns. Methods: Our double-blind validation procedure consisted of three steps: sample selection, data collection and data analysis. The linked Dutch national newborn admission registry contained 30,082 records for 2001 including readmissions (7.4%) and twins (9.7%). A highly informative sample was selected from the linked file by oversampling uncertain links based on model-derived linking weight. Four hundred and eight fox forms with minimal registry information (admissions of 191 children) were sent out to different pediatric units. The pediatricians were asked to create a short detailed patient history from independent sources. The linkage status and additional record data was validated against this external information. Results. Response rate was 97% (395/408 faxes). Accuracy of the linkage of singleton admissions was high: except for some expected errors in the uncertain area (0.02% of record pairs), linkage was error-free. Validation of multiple birth readmissions showed 37% linkage errors due to low data quality of the multiple birth variables. The quality of the linked registry file was still high; only 1.7% of the children were from a multiple birth with multiple admissions, resulting in less than 1% linking error. Conclusions. Our external validation procedure of record linkage was feasible, efficient, and informative about identifying the source of the errors.
引用
收藏
页码:356 / 363
页数:8
相关论文
共 50 条
  • [41] Probabilistic Record Linkage in Astronomy: Directional Cross-Identification and Beyond
    Budavari, Tamas
    Loredo, Thomas J.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 2, 2015, 2 : 113 - 139
  • [42] Detecting Duplicates at Hospital Admission: Comparison of Deterministic and Probabilistic Record Linkage
    Waldenburger, Andreas
    Nasseh, Daniel
    Stausberg, Juergen
    UNIFYING THE APPLICATIONS AND FOUNDATIONS OF BIOMEDICAL AND HEALTH INFORMATICS, 2016, 226 : 135 - 138
  • [43] Accuracy of probabilistic record linkage applied to health databases: systematic review
    da Silveira, Daniele Pinto
    Artmann, Elizabeth
    REVISTA DE SAUDE PUBLICA, 2009, 43 (05): : 875 - 882
  • [44] Privacy Preserving Probabilistic Record Linkage Without Trusted Third Party
    Lazrig, Ibrahim
    Ong, Toan C.
    Ray, Indrajit
    Ray, Indrakshi
    Jiang, Xiaoqian
    Vaidya, Jaideep
    2018 16TH ANNUAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2018, : 75 - 84
  • [45] A practical approach for incorporating dependence among fields in probabilistic record linkage
    Joanne K Daggy
    Huiping Xu
    Siu L Hui
    Roland E Gamache
    Shaun J Grannis
    BMC Medical Informatics and Decision Making, 13
  • [46] A practical approach for incorporating dependence among fields in probabilistic record linkage
    Daggy, Joanne K.
    Xu, Huiping
    Hui, Siu L.
    Gamache, Roland E.
    Grannis, Shaun J.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2013, 13
  • [47] Adaptive Sorted Neighborhood Methods for Efficient Record Linkage
    Yan, Su
    Lee, Dongwon
    Kan, Min-Yen
    Giles, C. Lee
    PROCEEDINGS OF THE 7TH ACM/IEE JOINT CONFERENCE ON DIGITAL LIBRARIES: BUILDING & SUSTAINING THE DIGITAL ENVIRONMENT, 2007, : 185 - +
  • [48] An unsupervised blocking technique for more efficient record linkage
    O'Hare, Kevin
    Jurek-Loughrey, Anna
    de Campos, Cassio
    DATA & KNOWLEDGE ENGINEERING, 2019, 122 (181-195) : 181 - 195
  • [49] A Suite of Efficient Randomized Algorithms for Streaming Record Linkage
    Karapiperis, Dimitrios
    Tjortjis, Christos
    Verykios, Vassilios S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 2803 - 2813
  • [50] A heterogeneous field matching method for record linkage
    Minton, SN
    Nanjo, C
    Knoblock, CA
    Michalowski, M
    Michelson, M
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 314 - 321