Optimal allocation in stratified cluster-based outcome-dependent sampling designs

被引:3
|
作者
Sauer, Sara [1 ]
Hedt-Gauthier, Bethany [1 ,2 ]
Haneuse, Sebastien [1 ]
机构
[1] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Harvard Med Sch, Dept Global Hlth & Social Med, Boston, MA USA
基金
美国国家卫生研究院;
关键词
cluster-based sampling; generalized estimating equations; Health Management Information Systems; optimal allocation; outcome-dependent sampling; BINARY RESPONSE DATA; 2-PHASE;
D O I
10.1002/sim.9016
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In public health research, finite resources often require that decisions be made at the study design stage regarding which individuals to sample for detailed data collection. At the same time, when study units are naturally clustered, as patients are in clinics, it may be preferable to sample clusters rather than the study units, especially when the costs associated with travel between clusters are high. In this setting, aggregated data on the outcome and select covariates are sometimes routinely available through, for example, a country's Health Management Information System. If used wisely, this information can be used to guide decisions regarding which clusters to sample, and potentially obtain gains in efficiency over simple random sampling. In this article, we derive a series of formulas for optimal allocation of resources when a single-stage stratified cluster-based outcome-dependent sampling design is to be used and a marginal mean model is specified to answer the question of interest. Specifically, we consider two settings: (i) when a particular parameter in the mean model is of primary interest; and, (ii) when multiple parameters are of interest. We investigate the finite population performance of the optimal allocation framework through a comprehensive simulation study. Our results show that there are trade-offs that must be considered at the design stage: optimizing for one parameter yields efficiency gains over balanced and simple random sampling, while resulting in losses for the other parameters in the model. Optimizing for all parameters simultaneously yields smaller gains in efficiency, but mitigates the losses for the other parameters in the model.
引用
收藏
页码:4090 / 4107
页数:18
相关论文
共 50 条
  • [1] Practical strategies for operationalizing optimal allocation in stratified cluster-based outcome-dependent sampling designs
    Sauer, Sara
    Hedt-Gauthier, Bethany
    Haneuse, Sebastien
    [J]. STATISTICS IN MEDICINE, 2023, 42 (07) : 917 - 935
  • [2] Optimal sampling allocation for outcome-dependent designs in cluster-correlated data settings
    Rivera-Rodriguez, Claudia
    Haneuse, Sebastien
    Sauer, Sara
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2022, 31 (12) : 2400 - 2414
  • [3] Likelihood-based analysis of outcome-dependent sampling designs with longitudinal data
    Zelnick, Leila R.
    Schildcrout, Jonathan S.
    Heagerty, Patrick J.
    [J]. STATISTICS IN MEDICINE, 2018, 37 (13) : 2120 - 2133
  • [4] Causal inference in outcome-dependent two-phase sampling designs
    Wang, Weiwei
    Scharfstein, Daniel
    Tan, Zhiqiang
    MacKenzie, Ellen J.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 : 947 - 969
  • [5] Selection Bias with Outcome-dependent Sampling
    Sjolander, Arvid
    [J]. EPIDEMIOLOGY, 2023, 34 (02) : 186 - 191
  • [6] Small-sample inference for cluster-based outcome-dependent sampling schemes in resource-limited settings: Investigating low birthweight in Rwanda
    Sauer, Sara
    Hedt-Gauthier, Bethany
    Rivera-Rodriguez, Claudia
    Haneuse, Sebastien
    [J]. BIOMETRICS, 2022, 78 (02) : 701 - 715
  • [7] Model misspecification and robust analysis for outcome-dependent sampling designs under generalized linear models
    Maronge, Jacob M.
    Schildcrout, Jonathan S.
    Rathouz, Paul J.
    [J]. STATISTICS IN MEDICINE, 2023, 42 (09) : 1338 - 1352
  • [8] Outcome-dependent sampling in cluster-correlated data settings with application to hospital profiling
    McGee, Glen
    Schildcrout, Jonathan
    Normand, Sharon-Lise
    Haneuse, Sebastien
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2020, 183 (01) : 379 - 402
  • [9] Outcome-dependent sampling in cluster-correlated data settings with application to hospital profiling
    McGee, Glen
    Schildcrout, Jonathan
    Normand, Sharon-Lise
    Haneuse, Sebastien
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2019, : 379 - 402
  • [10] Statistical inference methods and applications of outcome-dependent sampling designs under generalized linear models
    YAN Shu
    DING JieLi
    LIU YanYan
    [J]. Science China Mathematics, 2017, 60 (07) : 1219 - 1238