Medium-coverage DNA sequencing in the design of the genetic association study

被引:1
|
作者
Xu, Chao [1 ,2 ]
Zhang, Ruiyuan [1 ]
Shen, Hui [1 ]
Deng, Hong-Wen [1 ,3 ]
机构
[1] Tulane Univ, Ctr Bioinformat & Genom, Sch Publ Hlth & Trop Med, Dept Biostat & Data Sci, New Orleans, LA 70112 USA
[2] Univ Oklahoma, Dept Biostat & Epidemiol, Hlth Sci Ctr, Oklahoma City, OK 73104 USA
[3] Cent South Univ, Sch Basic Med Sci, Changsha 410013, Peoples R China
基金
美国国家卫生研究院;
关键词
GENOME; IMPUTATION; POWER;
D O I
10.1038/s41431-020-0656-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
DNA sequencing is a widely used tool in genetic association study. Sequencing cost remains a major concern in sequencing-based study, although the application of next generation sequencing has dramatically decreased the sequencing cost and increased the efficiency. The choice of sequencing depth and the sequencing sample size will largely determine the final study investment and performance. Many studies have been conducted to find a cost-effective design of sequencing depth that can achieve certain sequencing accuracy using minimal sequencing cost. The strategies previously studied can be classified into two groups: (1) single-stage to sequence all the samples using either high (>similar to 30x) or low (<similar to 10x) sequencing depth; and (2) two-stage to sequence an affordable number of individuals at a high-coverage followed by a large sample of low-coverage sequencing. However, limited studies examined the performance of the medium-coverage (10-30x) sequencing depth for a genetic association study, where the optimum sequencing depth may exist. In this study, using a published simulation framework, we comprehensively compared the medium-coverage sequencing (MCS) to the single- and two-stage high/low-coverage sequencing in terms of the power and type I error of the variant discovery and association testing. We found, given certain sequencing effort, MCS yielded a comparable discovery power and better type I error control compared with the best (highest power) scenarios using other high- and low-coverage single-stage or two-stage designs. However, MCS was not as competent as other designs with respect to the association power, especially for the rare variants and when the sequencing investment was limited.
引用
收藏
页码:1459 / 1466
页数:8
相关论文
共 50 条
  • [1] Medium-coverage DNA sequencing in the design of the genetic association study
    Chao Xu
    Ruiyuan Zhang
    Hui Shen
    Hong-Wen Deng
    European Journal of Human Genetics, 2020, 28 : 1459 - 1466
  • [2] Low-, high-coverage, and two-stage DNA sequencing in the design of the genetic association study
    Xu, Chao
    Wu, Kehao
    Zhang, Ji-Gang
    Shen, Hui
    Deng, Hong-Wen
    GENETIC EPIDEMIOLOGY, 2017, 41 (03) : 187 - 197
  • [3] AbCD: arbitrary coverage design for sequencing-based genetic studies
    Kang, Jian
    Huang, Kuan-Chieh
    Xu, Zheng
    Wang, Yunfei
    Abecasis, Goncalo R.
    Li, Yun
    BIOINFORMATICS, 2013, 29 (06) : 799 - 801
  • [4] Low-coverage sequencing: Implications for design of complex trait association studies
    Li, Yun
    Sidore, Carlo
    Kang, Hyun Min
    Boehnke, Michael
    Abecasis, Goncalo R.
    GENOME RESEARCH, 2011, 21 (06) : 940 - 951
  • [5] Aspects of coverage in medical DNA sequencing
    Michael C Wendl
    Richard K Wilson
    BMC Bioinformatics, 9
  • [6] Aspects of coverage in medical DNA sequencing
    Wendl, Michael C.
    Wilson, Richard K.
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [7] A New Method for Genetic Region Association Testing with Massively Different Sequencing Depths of Coverage
    Hendricks, Audrey E.
    Billups, Stephen
    Zeggini, Eleftheria
    Barroso, Ines
    Santorico, Stephanie A.
    Dupuis, Josee
    GENETIC EPIDEMIOLOGY, 2016, 40 (07) : 615 - 615
  • [8] A general coverage theory for shotgun DNA sequencing
    Wendl, Michael C.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (06) : 1177 - 1196
  • [9] Detection of large sequence insertions by a hybrid approach that combine de novo assembly and resequencing of medium-coverage genome sequences
    Lee, Taeyoung
    Kim, Moon Young
    Ha, Jungmin
    Lee, Suk-Ha
    GENOME, 2018, 61 (10) : 745 - 754
  • [10] Statistic Analysis for Genetic Epidemiology Study After DNA Sequencing
    Li, Hongbin
    He, Guangzhong
    Zhang, Meile
    Wang, Xulan
    Guo, Qiuting
    Fan, Yonggang
    INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING BIOMEDICAL ENGINEERING, AND INFORMATICS (SPBEI 2013), 2014, : 536 - 542