Logistic regression frequently outperformed propensity score methods especially for large datasets: a simulation study

被引:19
|
作者
Wilkinson, Jack D. [1 ]
Mamas, Mamas A. [2 ]
Kontopantelis, Evangelos [3 ]
机构
[1] Univ Manchester, Fac Biol, Ctr Biostat, Manchester Acad Hlth Sci Ctr, Rm 1-307 Jean McFarlane Bldg,Univ Pl,Oxford Rd, Manchester M13 9PL, England
[2] Keele Univ, Ctr Prognosis Res, Keele Cardiovasc Res Grp, Keele, England
[3] Univ Manchester, Div Informat Imaging & Data Sci, Manchester, England
基金
英国惠康基金;
关键词
Confounding; Propensity scores; Odds ratio; Marginal odds ratio; Regression standardization; Logistic regression; Simulation study; ADJUSTMENT;
D O I
10.1016/j.jclinepi.2022.09.009
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives: In observational studies, researchers must select a method to control for confounding. Options include propensity score (PS) methods and regression. It remains unclear how dataset characteristics (size, overlap in PSs, and exposure prevalence) influence the relative performance of the methods. Study Design and Setting: A simulation study to evaluate the role of dataset characteristics on the performance of PS methods, compared to logistic regression, for estimating a marginal odds ratio was conducted. Dataset size, overlap in PSs, and exposure prevalence were varied. Results: Regression showed poor coverage for small sample sizes, but with large sample sizes was relatively robust to imbalance in PSs and low exposure prevalence. PS methods displayed suboptimal coverage as overlap in PSs decreased, which was exacerbated at larger sample sizes. Power of matching methods was particularly affected by a lack of overlap, low exposure prevalence, and small sample size. The advantage of regression for large data size was reduced in sensitivity analysis with a complementary log -log outcome generation mechanism and unmeasured confounding, with superior bias and error but inferior coverage to matching methods. Conclusion: Dataset characteristics influence performance of methods for confounder adjustment. In many scenarios, regression may be the preferable option. (c) 2022 The Author(s). Published by Elsevier Inc.
引用
收藏
页码:176 / 184
页数:9
相关论文
共 50 条
  • [31] Logistic Regression With Multiple Random Effects: A Simulation Study of Estimation Methods and Statistical Packages REPLY
    Kim, Yoonsang
    Emery, Sherry
    AMERICAN STATISTICIAN, 2014, 68 (02): : 130 - 131
  • [32] Robust Estimators in Logistic Regression: A Comparative Simulation Study
    Ahmad, Sanizah
    Ramli, Norazan Mohamed
    Midi, Habshah
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2010, 9 (02) : 502 - 511
  • [33] Sample size determination for logistic regression: A simulation study
    School of Mathematical Sciences, University of Technology Sydney, Broadway, Australia
    Commun. Stat. Simul. Comput., 2 (360-373):
  • [34] To tune or not to tune, a case study of ridge logistic regression in small or sparse datasets
    Hana Šinkovec
    Georg Heinze
    Rok Blagus
    Angelika Geroldinger
    BMC Medical Research Methodology, 21
  • [35] Sample Size Determination for Logistic Regression: A Simulation Study
    Bush, Stephen
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2015, 44 (02) : 360 - 373
  • [36] Example of an over-fitting issue in a logistic regression estimating a high-dimensional propensity score
    Foch, Caroline
    Batech, Michael
    Gottwald-Hostalek, Ulrike
    Verpillat, Patrice
    Boutmy, Emmanuelle
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 383 - 384
  • [37] To tune or not to tune, a case study of ridge logistic regression in small or sparse datasets
    Sinkovec, Hana
    Heinze, Georg
    Blagus, Rok
    Geroldinger, Angelika
    BMC MEDICAL RESEARCH METHODOLOGY, 2021, 21 (01)
  • [38] The performance of different propensity score methods for estimating absolute effects of treatments on survival outcomes: A simulation study
    Austin, Peter C.
    Schuster, Tibor
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (05) : 2214 - 2237
  • [39] Abstract: Data Mining Alternatives to Logistic Regression for Propensity Score Estimation: Neural Networks and Support Vector Machines
    Keller, Bryan S. B.
    Kim, Jee-Seon
    Steiner, Peter M.
    MULTIVARIATE BEHAVIORAL RESEARCH, 2013, 48 (01) : 164 - 164
  • [40] Comparison of multivariable-adjusted logistic regression model with propensity score techniques using pharmacy claims data
    Khoza, Star
    Barner, Jamie C.
    Richards, Kristin M.
    JOURNAL OF PHARMACEUTICAL HEALTH SERVICES RESEARCH, 2011, 2 (04) : 233 - 242