Deterministic Record Linkage versus Similarity Functions: a Study in Health Databases from Brazil

被引:2
|
作者
Firmino Suzuki, Katia Mitiko [1 ]
Porto Filho, Carlos Humberto
Cozin, Luis Fernando
Pereyra, Lucas Calabrez [2 ]
de Azevedo Marques, Paulo Mazzoncini [2 ]
机构
[1] Univ Sao Paulo, Sch Med Ribeirao Preto, Av Bandeirantes 3900,Campus USP, BR-14049900 Rib Preto, SP, Brazil
[2] HCFMRP, Sch Med Ribeirao Preto, Ctr Med, Rib Preto, SP, Brazil
关键词
Information systems; database record linkage; deterministic record linkage; similarity function;
D O I
10.3233/978-1-61499-289-9-562
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The record linkage is a strategy that allows linking different databases of information from patient records. Adopting the deterministic method and similarity functions (Dice, Jaro, Jaro-Winkler and Levenshtein) for the integration of heterogeneous databases aimed at different levels of health care Brazilian (primary, secondary and tertiary). The sensitivity of deterministic method was 54.5% (95% CI: 50.4 to 58.5). The best result obtained with the dissent of only one variable (mother's name) was 80.6% (95% CI: 77.2 to 83.6) and the best result obtained using the similarity function Jaro-Winkler was 91.8% (95% CI: 89.4 to 93.9). The deterministic method has high specificity but sensitivity can be reduced by the existence of spellings and typing errors in the databases. Thus, the step-by-step approach where there was disagreement in at least one of the relationship variable can increase the sensitivity of the method and the use of similarity functions.
引用
收藏
页码:562 / 566
页数:5
相关论文
共 50 条
  • [1] Assessing record linkage between health care and Vital Statistics databases using deterministic methods
    Li, Bing
    Quan, Hude
    Fong, Andrew
    Lu, Mingshan
    [J]. BMC HEALTH SERVICES RESEARCH, 2006, 6 (1)
  • [2] Assessing record linkage between health care and Vital Statistics databases using deterministic methods
    Bing Li
    Hude Quan
    Andrew Fong
    Mingshan Lu
    [J]. BMC Health Services Research, 6
  • [3] Learnable similarity functions and their applications to clustering and record linkage
    Bilenko, M
    [J]. PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 981 - 982
  • [4] Results from simulated data sets: probabilistic record linkage outperforms deterministic record linkage
    Tromp, Miranda
    Ravelli, Anita C.
    Bonsel, Gouke J.
    Hasman, Arie
    Reitsma, Johannes B.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2011, 64 (05) : 565 - 572
  • [5] Application and analysis of record linkage techniques to integrate Brazilian health databases
    Ferreira da Silva Barros, Maicon Herverton Lino
    da Silva Leite, Morgana Thalita
    Sampaio, Vanderson
    Lynn, Theo
    Endo, Patricia Takako
    [J]. 2020 INTERNATIONAL CONFERENCE ON CYBER SITUATIONAL AWARENESS, DATA ANALYTICS AND ASSESSMENT (CYBER SA 2020), 2020,
  • [6] Accuracy of probabilistic record linkage applied to health databases: systematic review
    da Silveira, Daniele Pinto
    Artmann, Elizabeth
    [J]. REVISTA DE SAUDE PUBLICA, 2009, 43 (05): : 875 - 882
  • [7] Validation of a Hierarchical Deterministic Record-Linkage Algorithm Using Data From 2 Different Cohorts of Human Immunodeficiency Virus-Infected Persons and Mortality Databases in Brazil
    Pacheco, Antonio G.
    Saraceni, Valeria
    Tuboi, Suely H.
    Moulton, Lawrence H.
    Chaisson, Richard E.
    Cavalcante, Solange C.
    Durovni, Betina
    Faulhaber, Jose C.
    Golub, Jonathan E.
    King, Bonnie
    Schechter, Mauro
    Harrison, Lee H.
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2008, 168 (11) : 1326 - 1332
  • [8] SELF-REPORTED MENTAL HEALTH VERSUS PSYCHOTROPIC MEDICATION RECORD AS A PREDICTOR OF SUICIDE: A RECORD LINKAGE STUDY
    Onyeka, I. N.
    Maguire, A.
    O'Reilly, D.
    [J]. JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2020, 74 : A5 - A5
  • [9] Achievements and challenges for employing record linkage techniques in health research and evaluation in Brazil
    Coeli, Claudia Medina
    Pinheiro, Rejane Sobrino
    de Camargo, Kenneth Rochel, Jr.
    [J]. EPIDEMIOLOGIA E SERVICOS DE SAUDE, 2015, 24 (04): : 795 - 802
  • [10] Stroke and mental health care: a record linkage study
    Driessen, G
    Evers, S
    Verhey, F
    van Os, J
    [J]. SOCIAL PSYCHIATRY AND PSYCHIATRIC EPIDEMIOLOGY, 2001, 36 (12) : 608 - 612