Robustness of prevalence estimates derived from misclassified data from administrative databases

被引:45
|
作者
Ladouceur, Martin
Rahme, Elham
Pineau, Christian A.
Joseph, Lawrence
机构
[1] Montreal Gen Hosp, Div Clin Epidemiol, Montreal, PQ H3A 1A1, Canada
[2] McGill Univ, Dept Epidemiol & Biostat, Montreal, PQ H3A 1A2, Canada
[3] Montreal Gen Hosp, Dept Med, Montreal, PQ H3G 1A4, Canada
[4] Montreal Gen Hosp, Div Rheumatol, Dept Med, Montreal, PQ H3G 1A4, Canada
关键词
administrative databases; Bayesian latent class models; diagnosis; misclassification; prevalence; robustness; sensitivity; specificity;
D O I
10.1111/j.1541-0420.2006.00665.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Because primary data collection can be expensive, researchers are increasingly using information collected in medical administrative databases for scientific purposes. This information, however, is typically collected for reasons other than research, and many such databases have been shown to contain substantial proportions of misclassification errors. For example, many administrative databases contain fields for patient diagnostic codes, but these are often missing or inaccurate, in part because physician reimbursement schemes depend on medical acts performed rather than any diagnosis. Errors in ascertaining which individuals have a given disease bias not only prevalence estimates, but also estimates of associations between the disease and other variables, such as medication rise. We attempt to estimate the prevalence of osteoarthritis (OA) among elderly Quebeckers using a government administrative database. We compare a naive estimate relying solely on the physician diagnoses of CA listed in the database to estimates from several different Bayesian latent class models which adjust for misclassified physician diagnostic codes via use of other available diagnostic clues. We find that the prevalence estimates vary widely, depending on the model used and assumptions made. We conclude that any inferences from these databases need to be interpreted with great caution, until further work estimating the reliability of database items is carried out.
引用
收藏
页码:272 / 279
页数:8
相关论文
共 50 条
  • [31] Enrollment factors and bias of disease prevalence estimates in administrative claims data
    Jensen, Elizabeth T.
    Cook, Suzanne F.
    Allen, Jeffery K.
    Logie, John
    Brookhart, Maurice Alan
    Kappelman, Michael D.
    Dellon, Evan S.
    ANNALS OF EPIDEMIOLOGY, 2015, 25 (07) : 519 - 525
  • [32] Prevalence of multimorbidity in medical inpatients Validation of estimates based on administrative data
    Schneider, Florian
    Kaplan, Vladimir
    Rodak, Roksana
    Battegay, Edouard
    Holzer, Barbara
    SWISS MEDICAL WEEKLY, 2012, 142
  • [33] EVAPOTRANSPIRATION ESTIMATES DERIVED FROM SUBSOIL SALINITY DATA
    MATTHIAS, AD
    HASSAN, HM
    HU, YQ
    WATSON, JE
    WARRICK, AW
    JOURNAL OF HYDROLOGY, 1986, 85 (3-4) : 209 - 223
  • [34] ESTIMATION OF TEST ERROR RATES, DISEASE PREVALENCE AND RELATIVE RISK FROM MISCLASSIFIED DATA - A REVIEW
    WALTER, SD
    IRWIG, LM
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 1988, 41 (09) : 923 - 937
  • [35] Evaluating Clinical Outcomes From Administrative Databases
    Weintraub, William S.
    Bellows, Brandon K.
    JACC-CARDIOVASCULAR INTERVENTIONS, 2020, 13 (15) : 1786 - 1788
  • [36] The burden of clostridium difficile infection: estimates of the incidence of CDI from U.S. Administrative databases
    Margaret A. Olsen
    Yinong Young-Xu
    Dustin Stwalley
    Ciarán P. Kelly
    Dale N. Gerding
    Mohammed J. Saeed
    Cedric Mahé
    Erik R. Dubberke
    BMC Infectious Diseases, 16
  • [37] Prevalence and incidence of hidradenitis suppurativa in the United States: Evidence from large administrative claims databases
    Sundaram, Murali
    Styles, Amy
    Guerin, Annie
    Aberki, Clemence
    Okun, Martin
    Hayes, Oscar
    Mulani, Parvez
    JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2013, 68 (04) : AB58 - AB58
  • [38] Multimorbidity in Australia: Comparing estimates derived using administrative data sources and survey data
    Lujic, Sanja
    Simpson, Judy M.
    Zwar, Nicholas
    Hosseinzadeh, Hassan
    Jorm, Louisa
    PLOS ONE, 2017, 12 (08):
  • [39] The Analysis of Misclassified Ordinal Data from Designed Experiments
    Hamada, M. S.
    Ryan, K. J.
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2016, 32 (01) : 223 - 229
  • [40] Fetal alcohol syndrome in Alaska, 1977 through 1992. An administrative prevalence derived from multiple data sources
    Egeland, GM
    Perham-Hester, KA
    Gessner, BD
    Ingle, D
    Berner, JE
    Middaugh, JP
    AMERICAN JOURNAL OF PUBLIC HEALTH, 1998, 88 (05) : 781 - 786