Logistic regression frequently outperformed propensity score methods especially for large datasets: a simulation study
被引:19
|
作者:
Wilkinson, Jack D.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Manchester, Fac Biol, Ctr Biostat, Manchester Acad Hlth Sci Ctr, Rm 1-307 Jean McFarlane Bldg,Univ Pl,Oxford Rd, Manchester M13 9PL, EnglandUniv Manchester, Fac Biol, Ctr Biostat, Manchester Acad Hlth Sci Ctr, Rm 1-307 Jean McFarlane Bldg,Univ Pl,Oxford Rd, Manchester M13 9PL, England
Wilkinson, Jack D.
[1
]
Mamas, Mamas A.
论文数: 0引用数: 0
h-index: 0
机构:
Keele Univ, Ctr Prognosis Res, Keele Cardiovasc Res Grp, Keele, EnglandUniv Manchester, Fac Biol, Ctr Biostat, Manchester Acad Hlth Sci Ctr, Rm 1-307 Jean McFarlane Bldg,Univ Pl,Oxford Rd, Manchester M13 9PL, England
Mamas, Mamas A.
[2
]
论文数: 引用数:
h-index:
机构:
Kontopantelis, Evangelos
[3
]
机构:
[1] Univ Manchester, Fac Biol, Ctr Biostat, Manchester Acad Hlth Sci Ctr, Rm 1-307 Jean McFarlane Bldg,Univ Pl,Oxford Rd, Manchester M13 9PL, England
[2] Keele Univ, Ctr Prognosis Res, Keele Cardiovasc Res Grp, Keele, England
[3] Univ Manchester, Div Informat Imaging & Data Sci, Manchester, England
Objectives: In observational studies, researchers must select a method to control for confounding. Options include propensity score (PS) methods and regression. It remains unclear how dataset characteristics (size, overlap in PSs, and exposure prevalence) influence the relative performance of the methods. Study Design and Setting: A simulation study to evaluate the role of dataset characteristics on the performance of PS methods, compared to logistic regression, for estimating a marginal odds ratio was conducted. Dataset size, overlap in PSs, and exposure prevalence were varied. Results: Regression showed poor coverage for small sample sizes, but with large sample sizes was relatively robust to imbalance in PSs and low exposure prevalence. PS methods displayed suboptimal coverage as overlap in PSs decreased, which was exacerbated at larger sample sizes. Power of matching methods was particularly affected by a lack of overlap, low exposure prevalence, and small sample size. The advantage of regression for large data size was reduced in sensitivity analysis with a complementary log -log outcome generation mechanism and unmeasured confounding, with superior bias and error but inferior coverage to matching methods. Conclusion: Dataset characteristics influence performance of methods for confounder adjustment. In many scenarios, regression may be the preferable option. (c) 2022 The Author(s). Published by Elsevier Inc.