Two-Phase Stratified Sampling Designs for Regional Sequencing

被引:5
|
作者
Chen, Zhijian [1 ]
Craiu, Radu V. [2 ]
Bull, Shelley B. [1 ,3 ]
机构
[1] Mt Sinai Hosp, Samuel Lunenfeld Res Inst, Toronto, ON M5T 3L9, Canada
[2] Univ Toronto, Dept Stat, Toronto, ON, Canada
[3] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
基金
加拿大创新基金会; 加拿大健康研究院;
关键词
fine-mapping; genetic association studies; two-phase design; optimal allocation; quantitative trait; QUANTITATIVE TRAIT LOCI; GENETIC ASSOCIATION; REPLICATION; EFFICIENT;
D O I
10.1002/gepi.21624
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
By systematic examination of common tag single-nucleotide polymorphisms (SNPs) across the genome, the genome-wide association study (GWAS) has proven to be a successful approach to identify genetic variants that are associated with complex diseases and traits. Although the per base pair cost of sequencing has dropped dramatically with the advent of the next-generation technologies, it may still only be feasible to obtain DNA sequence data for a portion of available study subjects due to financial constraints. Two-phase sampling designs have been used frequently in large-scale surveys and epidemiological studies where certain variables are too costly to be measured on all subjects. We consider two-phase stratified sampling designs for genetic association, in which tag SNPs for candidate genes or regions are genotyped on all subjects in phase 1, and a proportion of subjects are selected into phase 2 based on genotypes at one or more tag SNPs. Deep sequencing in the region is then applied to genotype phase 2 subjects at sequence SNPs. We investigate alternative sampling designs for selection of phase 2 subjects within strata defined by tag SNP genotypes and develop methods of inference for sequence SNP variant associations using data from both phases. In comparison to methods that use data from phase 2 alone, the combined analysis improves efficiency. Genet. Epidemiol. 36:320-332, 2012. (c) 2012 Wiley Periodicals, Inc.
引用
收藏
页码:320 / 332
页数:13
相关论文
共 50 条
  • [1] Two-phase Stratified Sampling Designs for Regional Sequencing
    Chen, Zhijian
    Craiu, Radu V.
    Bull, Shelley B.
    GENETIC EPIDEMIOLOGY, 2012, 36 (02) : 125 - 125
  • [2] Stratified Two-Phase Ranked Set Sampling
    Arnab, Raghunath
    Anderson, George
    Olaomi, John O.
    Rodriguez, B. C.
    PAKISTAN JOURNAL OF STATISTICS AND OPERATION RESEARCH, 2019, 15 (04) : 867 - 879
  • [3] Variance estimation for two-phase stratified sampling
    Binder, DA
    Babyak, C
    Brodeur, M
    Hidiroglou, M
    Jocelyn, W
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2000, 28 (04): : 751 - 764
  • [4] A two-phase sampling scheme and πps designs
    Laitila, Thomas
    Olofsson, Jens
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2011, 141 (05) : 1646 - 1654
  • [5] Replication variance estimation for two-phase stratified sampling
    Kim, JK
    Navarro, A
    Fuller, WA
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) : 312 - 320
  • [6] Asymptotic normality under two-phase sampling designs
    Chen, Jiahua
    Rao, J. N. K.
    STATISTICA SINICA, 2007, 17 (03) : 1047 - 1064
  • [7] A note on the concept of invariance in two-phase sampling designs
    Beaumont, Jean-Francois
    Haziza, David
    SURVEY METHODOLOGY, 2016, 42 (02) : 319 - 323
  • [8] Two-phase designs for joint quantitative-trait-dependent and genotype-dependent sampling in post-GWAS regional sequencing
    Espin-Garcia, Osvaldo
    Craiu, Radu V.
    Bull, Shelley B.
    GENETIC EPIDEMIOLOGY, 2018, 42 (01) : 104 - 116
  • [9] Mann-Whitney test for two-phase stratified sampling
    Saegusa, Takumi
    STAT, 2021, 10 (01):
  • [10] Two-phase stratified sampling and analysis for predicting binary outcomes
    Cao, Yaqi
    Haneuse, Sebastien
    Zheng, Yingye
    Chen, Jinbo
    BIOSTATISTICS, 2023, 24 (03) : 585 - 602