An efficient validation method of probabilistic record linkage including readmissions and twins

被引:69
|
作者
Tromp, M. [1 ,3 ]
Ravelli, A. C. J. [1 ]
Meray, N. [1 ]
Reitsma, J. B. [2 ]
Bonsel, G. J. [3 ]
机构
[1] Univ Amsterdam, Acad Med Ctr, Dept Med Informat, NL-1105 AZ Amsterdam, Netherlands
[2] Univ Amsterdam, Acad Med Ctr, Dept Clin Epidemiol Biostat & Bioinformat, NL-1105 AZ Amsterdam, Netherlands
[3] Univ Amsterdam, Acad Med Ctr, Dept Publ Hlth Epidemiol, NL-1105 AZ Amsterdam, Netherlands
关键词
epidemiologic methods; medical record linkage; pediatrics; registries; validation study;
D O I
10.3414/ME0489
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To describe an efficient, generalizable approach to validate probabilistic record linkage results, in particular by a model-guided detection of linking errors, and to apply this approach to validate linkage of admissions of newborns. Methods: Our double-blind validation procedure consisted of three steps: sample selection, data collection and data analysis. The linked Dutch national newborn admission registry contained 30,082 records for 2001 including readmissions (7.4%) and twins (9.7%). A highly informative sample was selected from the linked file by oversampling uncertain links based on model-derived linking weight. Four hundred and eight fox forms with minimal registry information (admissions of 191 children) were sent out to different pediatric units. The pediatricians were asked to create a short detailed patient history from independent sources. The linkage status and additional record data was validated against this external information. Results. Response rate was 97% (395/408 faxes). Accuracy of the linkage of singleton admissions was high: except for some expected errors in the uncertain area (0.02% of record pairs), linkage was error-free. Validation of multiple birth readmissions showed 37% linkage errors due to low data quality of the multiple birth variables. The quality of the linked registry file was still high; only 1.7% of the children were from a multiple birth with multiple admissions, resulting in less than 1% linking error. Conclusions. Our external validation procedure of record linkage was feasible, efficient, and informative about identifying the source of the errors.
引用
收藏
页码:356 / 363
页数:8
相关论文
共 50 条
  • [1] Probabilistic record linkage
    Sayers, Adrian
    Ben-Shlomo, Yoav
    Blom, Ashley W.
    Steele, Fiona
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2016, 45 (03) : 954 - 964
  • [2] Probabilistic record linkage and a method to calculate the positive predictive value
    Blakely, T
    Salmond, C
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2002, 31 (06) : 1246 - 1252
  • [3] Validating distance-based record linkage with probabilistic record linkage
    Domingo-Ferrer, J
    Torra, V
    [J]. TOPICS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2504 : 207 - 215
  • [4] Efficient Private Record Linkage
    Yakout, Mohamed
    Atallah, Mikhail J.
    Elmagarmid, Ahmed
    [J]. ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1283 - 1286
  • [5] A METHOD OF RECORD LINKAGE
    OSHIMA, A
    SAKAGAMI, F
    HANAI, A
    FUJIMOTO, I
    [J]. ENVIRONMENTAL HEALTH PERSPECTIVES, 1979, 32 (OCT) : 221 - 230
  • [6] A study on the probabilistic record linkage and its application
    Choi, Yeonok
    Lee, Sangin
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2021, 34 (05) : 849 - 861
  • [7] A Probabilistic Record Linkage Model for Survival Data
    Hof, Michel H.
    Ravelli, Anita C.
    Zwinderman, Aeilko H.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (520) : 1504 - 1515
  • [8] Probabilistic Record Linkage for Disclosure Risk Assessment
    Shlomo, Natalie
    [J]. PRIVACY IN STATISTICAL DATABASES, PSD 2014, 2014, 8744 : 269 - 282
  • [9] Results from simulated data sets: probabilistic record linkage outperforms deterministic record linkage
    Tromp, Miranda
    Ravelli, Anita C.
    Bonsel, Gouke J.
    Hasman, Arie
    Reitsma, Johannes B.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2011, 64 (05) : 565 - 572
  • [10] An Introduction to Probabilistic Record Linkage with a Focus on Linkage Processing for WTC Registries
    Asher, Jana
    Resnick, Dean
    Brite, Jennifer
    Brackbill, Robert
    Cone, James
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (18) : 1 - 16