A framework for understanding selection bias in real-world healthcare data

被引:1
|
作者
Kundu, Ritoban [1 ]
Shi, Xu [1 ]
Morrison, Jean [1 ]
Barrett, Jessica [2 ]
Mukherjee, Bhramar [3 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] Univ Cambridge, MRC, Biostat Unit, Cambridge, England
[3] Univ Michigan, Dept Biostat & Epidemiol, 1415 Washington Hts,SPH 1, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
calibration; directed acyclic graphs; inverse probability weighting; Michigan Genomics Initiative; nonprobability sample; poststratification; GENOME-WIDE ASSOCIATION; CALIBRATION ESTIMATORS; RECORDS; IMPUTATION; REGRESSION; MODELS;
D O I
10.1093/jrsssa/qnae039
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Using administrative patient-care data such as Electronic Health Records (EHR) and medical/pharmaceutical claims for population-based scientific research has become increasingly common. With vast sample sizes leading to very small standard errors, researchers need to pay more attention to potential biases in the estimates of association parameters of interest, specifically to biases that do not diminish with increasing sample size. Of these multiple sources of biases, in this paper, we focus on understanding selection bias. We present an analytic framework using directed acyclic graphs for guiding applied researchers to dissect how different sources of selection bias may affect estimates of the association between a binary outcome and an exposure (continuous or categorical) of interest. We consider four easy-to-implement weighting approaches to reduce selection bias with accompanying variance formulae. We demonstrate through a simulation study when they can rescue us in practice with analysis of real-world data. We compare these methods using a data example where our goal is to estimate the well-known association of cancer and biological sex, using EHR from a longitudinal biorepository at the University of Michigan Healthcare system. We provide annotated R codes to implement these weighted methods with associated inference.
引用
收藏
页码:606 / 635
页数:30
相关论文
共 50 条
  • [1] Real-World Data and Real-World Evidence in Healthcare in the United States and Europe Union
    Zou, Kelly H.
    Berger, Marc L.
    [J]. BIOENGINEERING-BASEL, 2024, 11 (08):
  • [2] A DIAGNOSTIC FRAMEWORK TO EVALUATE REAL-WORLD DATA SOURCES FOR REAL-WORLD EVIDENCE GENERATION
    Denysyk, L.
    Doyle, J.
    Sood, R.
    [J]. VALUE IN HEALTH, 2018, 21 : S89 - S89
  • [3] Addressing Posttreatment Selection Bias in Comparative Effectiveness Research, Using Real-World Data and Simulation
    Belviso, Nicholas
    Zhang, Yichi
    Aronow, Herbert D.
    Wyss, Richard
    Barbour, Marilyn
    Kogut, Stephen
    Lawal, Oluwadolapo D.
    Zhan, Si Y.
    Don, Prabhani Kuruppumullage
    Wen, Xuerong
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2022, 191 (02) : 331 - 340
  • [4] Real-World Battles with Real-World Data
    Brown, Jeffrey
    Bate, Andrew
    Platt, Robert
    Raebel, Marsha
    Sauer, Brian
    Trifiro, Gianluca
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2017, 26 : 254 - 255
  • [5] METRICS AND MAPPINGS - A FRAMEWORK FOR UNDERSTANDING REAL-WORLD QUANTITATIVE ESTIMATION
    BROWN, NR
    SIEGLER, RS
    [J]. PSYCHOLOGICAL REVIEW, 1993, 100 (03) : 511 - 534
  • [6] Disability theorising and real-world educational practice: a framework for understanding
    Gable, Alison S.
    [J]. DISABILITY & SOCIETY, 2014, 29 (01) : 86 - 100
  • [7] Real-world study: from real-world data to real-world evidence
    Wen, Yi
    [J]. TRANSLATIONAL BREAST CANCER RESEARCH, 2020, 1
  • [8] Pharmacogenetics of warfarin and healthcare costs - Real-world data analysis
    Lahteenmaki, Jaakko
    Vuorinen, Anna-Leena
    Lehto, Mika
    Niemi, Mikko
    Forsberg, Markus M.
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2023, 32 (03) : 382 - 386
  • [9] Real-World Data for Healthcare Research in China: Call for Actions
    Xie, Jipan
    Wu, Eric Q.
    Wang, Shan
    Cheng, Tao
    Zhou, Zhou
    Zhong, Jia
    Liu, Larry
    [J]. VALUE IN HEALTH REGIONAL ISSUES, 2022, 27 : 72 - 81
  • [10] ASSESSING THE QUALITY OF REAL-WORLD DATA AND REAL-WORLD EVIDENCE IN ONCOLOGY RESEARCH: A COHESIVE FRAMEWORK FOR RESEARCHERS
    Su, Z.
    Dye, J.
    Wilson, T.
    Amirian, E. S.
    O'Sullivan, A.
    [J]. VALUE IN HEALTH, 2023, 26 (06) : S377 - S377