Response-dependent two-phase sampling designs for biomarker studies

被引:15
|
作者
McIsaac, Michael A. [1 ]
Cook, Richard J. [2 ]
机构
[1] Queens Univ, Dept Publ Hlth Sci, Kingston, ON K7L 3N6, Canada
[2] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Asymptotic relative efficiency; augmented inverse probability weighted estimating functions; inverse probability weighting; maximum likelihood estimation; response-dependent sampling; two-phase design; SEMIPARAMETRIC REGRESSION-MODELS; SURVEY CALIBRATION ESTIMATORS; DOUBLY ROBUST ESTIMATION; LOGISTIC-REGRESSION; PSORIATIC-ARTHRITIS; MAXIMUM-LIKELIHOOD; MISSING DATA; CONNECTIONS;
D O I
10.1002/cjs.11207
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Two-phase sampling designs are developed and investigated for use in the context of a rheumatology study where interest lies in the association between a biomarker with an expensive assay and disease progression. We derive optimal phase-II stratum-specific sampling probabilities for analyses from parametric maximum likelihood (ML), mean score (MS), inverse probability weighted (IPW) and augmented inverse probability weighted estimating equations (AIPW). The easy-to-implement optimally efficient design for the MS estimator is found to be asymptotically optimal for the IPW and AIPW estimators we consider, and is shown to result in efficiency gains over balanced and simple random sampling even when analyses are likelihood-based. We further demonstrate the robustness of this optimal design and show that it results in very efficient estimation even when the model or parameters used in its derivation are misspecified. The Canadian Journal of Statistics 42: 268-284; 2014 (c) 2014 Statistical Society of Canada Resume Des plans d'echantillonnage a deux phases sont developpes et etudies dans le contexte d'une etude en rhumatologie dont l'interet principal porte sur l'association entre un biomarqueur dont la bioanalyse est dispendieuse et la progression de la maladie. Les auteurs calculent les probabilites d'echantillonnage optimales specifiques a chaque strate en phase II pour les equations d'estimation basees sur le maximum de vraisemblance (MV), le score moyen (SM), la ponderation par l'inverse des probabilites (PIP), et la PIP augmentee (PIPA). Facile a implementer et optimalement efficace, le plan pour l'estimateur SM est aussi asymptotiquement optimal pour les estimateurs PIP et PIPA consideres par les auteurs, en plus de presenter un gain d'efficacite par rapport aux plans bases sur un echantillon aleatoire simple ou balance lorsque l'estimation est effectuee par le MV. Les auteurs demontrent aussi la robustesse de ce plan optimal qui conduit a des estimateurs tres efficaces meme lorsque le modele ou ses parametres sont mal specifies. La revue canadienne de statistique 42: 268-284; 2014 (c) 2014 Societe statistique du Canada
引用
收藏
页码:268 / 284
页数:17
相关论文
共 50 条
  • [1] Evaluation of Designs and Estimation Methods Under Response-Dependent Two-Phase Sampling for Genetic Association Studies
    Brady Ryan
    Ananthika Nirmalkanna
    Candemir Cigsar
    Yildiz E. Yilmaz
    Statistics in Biosciences, 2023, 15 : 510 - 539
  • [2] Evaluation of Designs and Estimation Methods Under Response-Dependent Two-Phase Sampling for Genetic Association Studies
    Ryan, Brady
    Nirmalkanna, Ananthika
    Cigsar, Candemir
    Yilmaz, Yildiz E.
    STATISTICS IN BIOSCIENCES, 2023, 15 (02) : 510 - 539
  • [3] Adaptive response-dependent two-phase designs: Some results on robustness and efficiency
    Yang, Ce
    Diao, Liqun
    Cook, Richard J.
    STATISTICS IN MEDICINE, 2022, 41 (22) : 4403 - 4425
  • [4] Adaptive sampling in two-phase designs: a biomarker study for progression in arthritis
    McIsaac, Michael A.
    Cook, Richard J.
    STATISTICS IN MEDICINE, 2015, 34 (21) : 2899 - 2912
  • [5] Causal inference in outcome-dependent two-phase sampling designs
    Wang, Weiwei
    Scharfstein, Daniel
    Tan, Zhiqiang
    MacKenzie, Ellen J.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 : 947 - 969
  • [6] Optimality in two-phase sampling designs for post-GWAS studies
    Espin-Garcia, Osvaldo
    Craiu, Radu V.
    Bull, Shelley B.
    GENETIC EPIDEMIOLOGY, 2018, 42 (07) : 698 - 698
  • [7] Optimization of Two-Phase Sampling Designs With Application to Naturalistic Driving Studies
    Imberg, Henrik
    Lisovskaja, Vera
    Selpi
    Nerman, Olle
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (04) : 3575 - 3588
  • [8] Score tests for association under response-dependent sampling designs for expensive covariates
    Derkach, Andriy
    Lawless, Jerald F.
    Sun, Lei
    BIOMETRIKA, 2015, 102 (04) : 988 - 994
  • [9] A two-phase sampling scheme and πps designs
    Laitila, Thomas
    Olofsson, Jens
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2011, 141 (05) : 1646 - 1654
  • [10] Optimal Designs of Two-Phase Studies
    Tao, Ran
    Zeng, Donglin
    Lin, Dan-Yu
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1946 - 1959