Optimal allocation in stratified cluster-based outcome-dependent sampling designs

被引:3
|
作者
Sauer, Sara [1 ]
Hedt-Gauthier, Bethany [1 ,2 ]
Haneuse, Sebastien [1 ]
机构
[1] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Harvard Med Sch, Dept Global Hlth & Social Med, Boston, MA USA
基金
美国国家卫生研究院;
关键词
cluster-based sampling; generalized estimating equations; Health Management Information Systems; optimal allocation; outcome-dependent sampling; BINARY RESPONSE DATA; 2-PHASE;
D O I
10.1002/sim.9016
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In public health research, finite resources often require that decisions be made at the study design stage regarding which individuals to sample for detailed data collection. At the same time, when study units are naturally clustered, as patients are in clinics, it may be preferable to sample clusters rather than the study units, especially when the costs associated with travel between clusters are high. In this setting, aggregated data on the outcome and select covariates are sometimes routinely available through, for example, a country's Health Management Information System. If used wisely, this information can be used to guide decisions regarding which clusters to sample, and potentially obtain gains in efficiency over simple random sampling. In this article, we derive a series of formulas for optimal allocation of resources when a single-stage stratified cluster-based outcome-dependent sampling design is to be used and a marginal mean model is specified to answer the question of interest. Specifically, we consider two settings: (i) when a particular parameter in the mean model is of primary interest; and, (ii) when multiple parameters are of interest. We investigate the finite population performance of the optimal allocation framework through a comprehensive simulation study. Our results show that there are trade-offs that must be considered at the design stage: optimizing for one parameter yields efficiency gains over balanced and simple random sampling, while resulting in losses for the other parameters in the model. Optimizing for all parameters simultaneously yields smaller gains in efficiency, but mitigates the losses for the other parameters in the model.
引用
收藏
页码:4090 / 4107
页数:18
相关论文
共 50 条
  • [21] Adaptive Optimal Allocation in Stratified Sampling Methods
    Pierre Étoré
    Benjamin Jourdain
    [J]. Methodology and Computing in Applied Probability, 2010, 12 : 335 - 360
  • [22] Adaptive Optimal Allocation in Stratified Sampling Methods
    Etore, Pierre
    Jourdain, Benjamin
    [J]. METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2010, 12 (03) : 335 - 360
  • [23] Energy efficiency based optimal task allocation for cluster-based target tracing
    Zhang, Xihuang
    Jin, Chen
    [J]. DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 1223 - 1226
  • [24] Recent progresses in outcome-dependent sampling with failure time data
    Jieli Ding
    Tsui-Shan Lu
    Jianwen Cai
    Haibo Zhou
    [J]. Lifetime Data Analysis, 2017, 23 : 57 - 82
  • [25] Two-wave two-phase outcome-dependent sampling designs, with applications to longitudinal binary data
    Tao, Ran
    Mercaldo, Nathaniel D.
    Haneuse, Sebastien
    Maronge, Jacob M.
    Rathouz, Paul J.
    Heagerty, Patrick J.
    Schildcrout, Jonathan S.
    [J]. STATISTICS IN MEDICINE, 2021, 40 (08) : 1863 - 1876
  • [26] Semiparametric accelerated failure time modeling for multivariate failure times under multivariate outcome-dependent sampling designs
    Lu, Tsui-Shan
    Kang, Sangwook
    Zhou, Haibo
    [J]. STATISTICS AND ITS INTERFACE, 2020, 13 (03) : 373 - 383
  • [27] On kernel density estimation based on different stratified sampling with optimal allocation
    Samawi, Hani
    Chatterjee, Arpita
    Yin, JingJing
    Rochani, Haresh
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (22) : 10973 - 10990
  • [28] Recent progresses in outcome-dependent sampling with failure time data
    Ding, Jieli
    Lu, Tsui-Shan
    Cai, Jianwen
    Zhou, Haibo
    [J]. LIFETIME DATA ANALYSIS, 2017, 23 (01) : 57 - 82
  • [29] Improving decision tree performance through induction- and cluster-based stratified sampling
    Gill, AA
    Smith, GD
    Bagnall, AJ
    [J]. INTELLIGENT DAA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 339 - 344
  • [30] An estimated likelihood method for continuous outcome regression models with outcome-dependent sampling
    Weaver, MA
    Zhou, HB
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 459 - 469