The Role of Prediction Modeling in Propensity Score Estimation: An Evaluation of Logistic Regression, bCART, and the Covariate-Balancing Propensity Score

被引:77
|
作者
Wyss, Richard [1 ]
Ellis, Alan R. [2 ]
Brookhart, M. Alan [1 ]
Girman, Cynthia J. [1 ,3 ]
Funk, Michele Jonsson [1 ]
LoCasale, Robert [4 ]
Stuermer, Til [1 ]
机构
[1] Univ N Carolina, Gillings Sch Global Publ Hlth, Dept Epidemiol, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Cecil G Sheps Ctr Hlth Serv Res, Chapel Hill, NC 27599 USA
[3] Merck Sharp & Dohme Corp, Merck Res Labs, Ctr Observat & Real World Evidence, Data Analyt & Observat Methods, N Wales, PA USA
[4] Merck Sharp & Dohme Corp, Merck Res Labs, Dept Epidemiol, N Wales, PA USA
基金
美国医疗保健研究与质量局;
关键词
cardiovascular disease; covariate balance; diabetes; epidemiologic methods; propensity score; regression; simulation;
D O I
10.1093/aje/kwu181
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
The covariate-balancing propensity score (CBPS) extends logistic regression to simultaneously optimize covariate balance and treatment prediction. Although the CBPS has been shown to perform well in certain settings, its performance has not been evaluated in settings specific to pharmacoepidemiology and large database research. In this study, we use both simulations and empirical data to compare the performance of the CBPS with logistic regression and boosted classification and regression trees. We simulated various degrees of model misspecification to evaluate the robustness of each propensity score (PS) estimation method. We then applied these methods to compare the effect of initiating glucagonlike peptide-1 agonists versus sulfonylureas on cardiovascular events and all-cause mortality in the US Medicare population in 2007-2009. In simulations, the CBPS was generally more robust in terms of balancing covariates and reducing bias compared with misspecified logistic PS models and boosted classification and regression trees. All PS estimation methods performed similarly in the empirical example. For settings common to pharmacoepidemiology, logistic regression with balance checks to assess model specification is a valid method for PS estimation, but it can require refitting multiple models until covariate balance is achieved. The CBPS is a promising method to improve the robustness of PS models.
引用
收藏
页码:645 / 655
页数:11
相关论文
共 50 条
  • [21] HYBRID APPLICATIONS OF EXACT COVARIATE MATCHING AND PROPENSITY SCORE IN THE EVALUATION OF PATIENT PERSISTENCE
    Yang, G.
    Henderson, S. C.
    [J]. VALUE IN HEALTH, 2009, 12 (03) : A103 - A103
  • [22] Comparison of Propensity Score Methods and Covariate Adjustment Evaluation in 4 Cardiovascular Studies
    Elze, Markus C.
    Gregson, John
    Baber, Usman
    Williamson, Elizabeth
    Sartori, Samantha
    Mehran, Roxana
    Nichols, Melissa
    Stone, Gregg W.
    Pocock, Stuart J.
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2017, 69 (03) : 345 - 357
  • [23] Covariate balancing functional propensity score for functional treatments in cross-sectional observational studies
    Zhang, Xiaoke
    Xue, Wu
    Wang, Qiyue
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 163
  • [24] Misuse of Regression Adjustment for Additional Confounders Following Insufficient Propensity Score Balancing
    Shinozaki, Tomohiro
    Nojima, Masanori
    [J]. EPIDEMIOLOGY, 2019, 30 (04) : 541 - 548
  • [25] Abstract: Data Mining Alternatives to Logistic Regression for Propensity Score Estimation: Neural Networks and Support Vector Machines
    Keller, Bryan S. B.
    Kim, Jee-Seon
    Steiner, Peter M.
    [J]. MULTIVARIATE BEHAVIORAL RESEARCH, 2013, 48 (01) : 164 - 164
  • [26] Balancing properties: A need for the application of propensity score methods in estimation of treatment effects
    Urkaregi, Arantza
    Martinez-Indart, Lorea
    Pijoán, JoséIgnacio
    [J]. SORT, 2014, 38 (02): : 271 - 284
  • [27] Balancing properties: A need for the application of propensity score methods in estimation of treatment effects
    Urkaregi, Arantza
    Martinez-Indart, Lorea
    Ignacio Pijoan, Jose
    [J]. SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2014, 38 (02) : 271 - 284
  • [28] The Comparative Performance of Logistic Regression and Random Forest in Propensity Score Methods: a Simulation Study
    Ali, M. Sanni
    Khalid, Sara
    Collins, Gary S.
    Prieto-Alhambra, Daniel
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2017, 26 : 489 - 489
  • [29] Systematic differences in treatment effect estimates between propensity score methods and logistic regression
    Martens, Edwin P.
    Pestman, Wiebe R.
    de Boer, Anthonius
    Belitser, Svetlana V.
    Klungel, Olaf H.
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2008, 37 (05) : 1142 - 1147
  • [30] Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation
    Wyss, Richard
    Schneeweiss, Sebastian
    van der Laan, Mark
    Lendle, Samuel D.
    Ju, Cheng
    Franklin, Jessica M.
    [J]. EPIDEMIOLOGY, 2018, 29 (01) : 96 - 106