The Role of Prediction Modeling in Propensity Score Estimation: An Evaluation of Logistic Regression, bCART, and the Covariate-Balancing Propensity Score

被引:77
|
作者
Wyss, Richard [1 ]
Ellis, Alan R. [2 ]
Brookhart, M. Alan [1 ]
Girman, Cynthia J. [1 ,3 ]
Funk, Michele Jonsson [1 ]
LoCasale, Robert [4 ]
Stuermer, Til [1 ]
机构
[1] Univ N Carolina, Gillings Sch Global Publ Hlth, Dept Epidemiol, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Cecil G Sheps Ctr Hlth Serv Res, Chapel Hill, NC 27599 USA
[3] Merck Sharp & Dohme Corp, Merck Res Labs, Ctr Observat & Real World Evidence, Data Analyt & Observat Methods, N Wales, PA USA
[4] Merck Sharp & Dohme Corp, Merck Res Labs, Dept Epidemiol, N Wales, PA USA
基金
美国医疗保健研究与质量局;
关键词
cardiovascular disease; covariate balance; diabetes; epidemiologic methods; propensity score; regression; simulation;
D O I
10.1093/aje/kwu181
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
The covariate-balancing propensity score (CBPS) extends logistic regression to simultaneously optimize covariate balance and treatment prediction. Although the CBPS has been shown to perform well in certain settings, its performance has not been evaluated in settings specific to pharmacoepidemiology and large database research. In this study, we use both simulations and empirical data to compare the performance of the CBPS with logistic regression and boosted classification and regression trees. We simulated various degrees of model misspecification to evaluate the robustness of each propensity score (PS) estimation method. We then applied these methods to compare the effect of initiating glucagonlike peptide-1 agonists versus sulfonylureas on cardiovascular events and all-cause mortality in the US Medicare population in 2007-2009. In simulations, the CBPS was generally more robust in terms of balancing covariates and reducing bias compared with misspecified logistic PS models and boosted classification and regression trees. All PS estimation methods performed similarly in the empirical example. For settings common to pharmacoepidemiology, logistic regression with balance checks to assess model specification is a valid method for PS estimation, but it can require refitting multiple models until covariate balance is achieved. The CBPS is a promising method to improve the robustness of PS models.
引用
收藏
页码:645 / 655
页数:11
相关论文
共 50 条