Informative presence bias in analyses of electronic health records-derived data: a cautionary note

被引:7
|
作者
Harton, Joanna [1 ]
Mitra, Nandita [1 ]
Hubbard, Rebecca A. [1 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, 423 Guardian Dr, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
longitudinal data; survival analysis; electronic health records; misclassification; bias; LONGITUDINAL DATA SUBJECT; OBSERVATION TIMES; MODEL;
D O I
10.1093/jamia/ocac050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective Electronic health record (EHR)-derived data are extensively used in health research. However, the pattern of patient interaction with the healthcare system can result in informative presence bias if those who have poorer health have more data recorded than healthier patients. We aimed to determine how informative presence affects bias across multiple scenarios informed by real-world healthcare utilization patterns. Materials and methods We conducted an analysis of EHR data from a pediatric healthcare system as well as simulation studies to characterize conditions under which informative presence bias is likely to occur. This analysis extends prior work by examining a variety of scenarios for the relationship between a biomarker and a health event of interest and the healthcare visit process. Results Using biomarker values gathered at both informative and noninformative visits when estimating the effect of the biomarker on the event of interest resulted in minimal bias when the biomarker was relatively stable over time but produced substantial bias when the biomarker was more volatile. Adjusting analyses for the number of prior visits within a fixed look-back window was able to reduce but not eliminate this bias. Discussion These results suggest that bias may arise frequently in commonly encountered scenarios and may not be eliminated by adjusting for prior visit intensity. Conclusion Depending on the context, the estimated effect from analyses using data from all visits available may diverge from the true effect. Sensitivity analyses using only visits likely to be informative or noninformative based on visit type may aid in the assessment of the magnitude of potential bias.
引用
收藏
页码:1191 / 1199
页数:9
相关论文
共 50 条
  • [1] On the Nature of Informative Presence Bias in Analyses of Electronic Health Records
    McGee, Glen
    Haneuse, Sebastien
    Coull, Brent A.
    Weisskopf, Marc G.
    Rotem, Ran S.
    [J]. EPIDEMIOLOGY, 2022, 33 (01) : 105 - 113
  • [2] A Quantitative Bias Analysis Approach to Informative Presence Bias in Electronic Health Records
    Zhang, Hanxi
    Clark, Amy S.
    Hubbard, Rebecca A.
    [J]. EPIDEMIOLOGY, 2024, 35 (03) : 349 - 358
  • [3] Characterization of a novel oncology electronic health records-derived data linkage to commercial claims
    Ascha, Mustafa
    Yerram, Prakirthi
    Wang, Jocelyn
    Reiss, Samantha
    Royce, Trevor
    Brake, Sarah
    Calip, Gregory
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2022, 31 : 546 - 546
  • [4] Adapting electronic health records-derived phenotypes to claims data: Lessons learned in using limited clinical data for phenotyping
    Ostropolets, Anna
    Reich, Christian
    Ryan, Patrick
    Shang, Ning
    Hripcsak, George
    Weng, Chunhua
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 102
  • [5] Equity and bias in electronic health records data
    Boyd, Andrew D.
    Gonzalez-Guarda, Rosa
    Lawrence, Katharine
    Patil, Crystal L.
    Ezenwa, Miriam O.
    O'Brien, Emily C.
    Paek, Hyung
    Braciszewski, Jordan M.
    Adeyemi, Oluwaseun
    Cuthel, Allison M.
    Darby, Juanita E.
    Zigler, Christina K.
    Ho, P. Michael
    Faurot, Keturah R.
    Staman, Karen
    Leigh, Jonathan W.
    Dailey, Dana L.
    Cheville, Andrea
    Del Fiol, Guilherme
    Knisely, Mitchell R.
    Marsolo, Keith
    Richesson, Rachel L.
    Schlaeger, Judith M.
    [J]. CONTEMPORARY CLINICAL TRIALS, 2023, 130
  • [6] Bias correction models for electronic health records data in the presence of non-random sampling
    Kim, Jiyu
    Anthopolos, Rebecca
    Zhong, Judy
    [J]. BIOMETRICS, 2024, 80 (01)
  • [7] Accounting for misclassification in electronic health records-derived exposures using generalized linear finite mixture models
    Hubbard R.A.
    Johnson E.
    Chubak J.
    Wernli K.J.
    Kamineni A.
    Bogart A.
    Rutter C.M.
    [J]. Health Services and Outcomes Research Methodology, 2017, 17 (2) : 101 - 112
  • [8] Cautionary note:Electronic medical records,a potential disaster in the making?
    Bruce Rothschild
    [J]. World Journal of Rheumatology, 2013, (01) : 1 - 2
  • [9] A cautionary note on electronic cigarettes and vascular health
    Fetterman, Jessica L.
    Hamburg, Naomi M.
    [J]. VASCULAR MEDICINE, 2018, 23 (05) : 426 - 427
  • [10] How and when informative visit processes can bias inference when using electronic health records data for clinical research
    Goldstein, Benjamin A.
    Phelan, Matthew
    Pagidipati, Neha J.
    Peskoe, Sarah B.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (12) : 1609 - 1617