Family-Based Association Studies for Next-Generation Sequencing

被引:32
|
作者
Zhu, Yun [1 ,2 ]
Xiong, Momiao [1 ,2 ]
机构
[1] Univ Texas Houston, Sch Publ Hlth, Ctr Human Genet, Houston, TX 77030 USA
[2] Univ Texas Houston, Sch Publ Hlth, Div Biostat, Houston, TX 77030 USA
基金
美国国家卫生研究院;
关键词
RARE VARIANTS; ARCHITECTURE; GENOMICS; GENES; RAP80;
D O I
10.1016/j.ajhg.2012.04.022
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
An individual's disease risk is determined by the compounded action of both common variants, inherited from remote ancestors, that segregated within the population and rare variants, inherited from recent ancestors, that segregated mainly within pedigrees. Next-generation sequencing (NGS) technologies generate high-dimensional data that allow a nearly complete evaluation of genetic variation. Despite their promise, NGS technologies also suffer from remarkable limitations: high error rates, enrichment of rare variants, and a large proportion of missing values, as well as the fact that most current analytical methods are designed for population-based association studies. To meet the analytical challenges raised by NGS, we propose a general framework for sequence-based association studies that can use various types of family and unrelated-individual data sampled from any population structure and a universal procedure that can transform any population-based association test statistic for use in family-based association tests. We develop family-based functional principal-component analysis (FPCA) with or without smoothing, a generalized T-2, combined multivariate and collapsing (CMC) method, and single-marker association test statistics. Through intensive simulations, we demonstrate that the family-based smoothed FPCA (SFPCA) has the correct type I error rates and much more power to detect association of (1) common variants, (2) rare variants, (3) both common and rare variants, and (4) variants with opposite directions of effect from other population-based or family-based association analysis methods. The proposed statistics are applied to two data sets with pedigree structures. The results show that the smoothed FPCA has a much smaller p value than other statistics.
引用
收藏
页码:1028 / 1045
页数:18
相关论文
共 50 条
  • [41] Next-generation DNA sequencing
    Shendure, Jay
    Ji, Hanlee
    [J]. NATURE BIOTECHNOLOGY, 2008, 26 (10) : 1135 - 1145
  • [42] Next-Generation Sequencing Challenges
    Baker S.C.
    [J]. 2017, Mary Ann Liebert Inc. (37): : 1and14 - 15
  • [43] Combinatorics and next-generation sequencing
    Nick Patterson
    Stacey Gabriel
    [J]. Nature Biotechnology, 2009, 27 : 826 - 827
  • [44] Next-Generation Sequencing Strategies
    Levy, Shawn E.
    Boone, Braden E.
    [J]. COLD SPRING HARBOR PERSPECTIVES IN MEDICINE, 2019, 9 (07):
  • [45] Next-Generation Sequencing Platforms
    Mardis, Elaine R.
    [J]. ANNUAL REVIEW OF ANALYTICAL CHEMISTRY, VOL 6, 2013, 6 : 287 - 303
  • [46] Next-generation sequencing and norovirus
    Cotten, Matthew
    Koopmans, Marion
    [J]. FUTURE VIROLOGY, 2016, 11 (11) : 719 - 722
  • [47] NEXT-GENERATION SEQUENCING, THE BASICS
    Corbett, Mark
    [J]. CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY, 2011, 39 : 89 - 89
  • [48] Next-generation DNA sequencing
    Jay Shendure
    Hanlee Ji
    [J]. Nature Biotechnology, 2008, 26 : 1135 - 1145
  • [49] Democratizing Next-Generation Sequencing
    [J]. Genetic Engineering and Biotechnology News, 2022, 42 (07): : 24 - 25
  • [50] Next-Generation Sequencing and the Cytopathologist
    Dumur, Catherine I.
    Kraft, Adele O.
    [J]. CANCER CYTOPATHOLOGY, 2015, 123 (02) : 69 - 70