Quality control of next-generation sequencing data without a reference

被引:51
|
作者
Trivedil, Urmi H. [1 ]
Cezard, Timothee [1 ]
Bridgett, Stephen [1 ]
Montazam, Anna [1 ]
Nichols, Jenna [1 ]
Blaxter, Mark [1 ,2 ]
Gharbi, Karim [1 ,2 ]
机构
[1] Univ Edinburgh, Edinburgh Genom, Ashworth Labs, Edinburgh EH9 3JT, Midlothian, Scotland
[2] Univ Edinburgh, Inst Evolutionary Biol, Ashworth Labs, Edinburgh EH9 3JT, Midlothian, Scotland
关键词
GENOME; ALIGNMENT;
D O I
10.3389/fgene.2014.00111
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Next-generation sequencing (NGS) technologies have dramatically expanded the breadth of genomics. Genome-scale data, once restricted to a small number of biomedical model organisms, can now be generated for virtually any species at remarkable speed and low cost. Yet non-model organisms often lack a suitable reference to map sequence reads against, making alignment-based quality control (QC) of NGS data more challenging than cases where a well-assembled genome is already available. Here we show that by generating a rapid, non-optimized draft assembly of raw reads, it is possible to obtain reliable and informative QC metrics, thus removing the need for a high quality reference. We use benchmark datasets generated from control samples across a range of genome sizes to illustrate that QC inferences made using draft assemblies are broadly equivalent to those made using a well-established reference, and describe QC tools routinely used in our production facility to assess the quality of NGS data from non-model organisms.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Pathway analysis with next-generation sequencing data
    Zhao, Jinying
    Zhu, Yun
    Boerwinkle, Eric
    Xiong, Momiao
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2015, 23 (04) : 507 - 515
  • [42] Genotyping microsatellites in next-generation sequencing data
    Dashnow, Harriet
    Tan, Susan
    Das, Debjani
    Easteal, Simon
    Oshlack, Alicia
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [43] Genotyping microsatellites in next-generation sequencing data
    Harriet Dashnow
    Susan Tan
    Debjani Das
    Simon Easteal
    Alicia Oshlack
    [J]. BMC Bioinformatics, 16
  • [44] Next-generation big data federation access control: A reference model
    Awaysheh, Feras M.
    Alazab, Mamoun
    Gupta, Maanak
    Pena, Tomas F.
    Cabaleiro, Jose C.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 108 : 726 - 741
  • [45] SAMStat 2: quality control for next generation sequencing data
    Lassmann, Timo
    [J]. BIOINFORMATICS, 2023, 39 (01)
  • [46] Next-generation sequencing for next-generation breeding, and more
    Tsai, Chung-Jui
    [J]. NEW PHYTOLOGIST, 2013, 198 (03) : 635 - 637
  • [47] ForestQC: Quality control on genetic variants from next-generation sequencing data using random forest
    Li, Jiajin
    Jew, Brandon
    Zhan, Lingyu
    Hwang, Sungoo
    Coppola, Giovanni
    Freimer, Nelson B.
    Sul, Jae Hoon
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (12)
  • [48] Next-Generation Sequencing Demands Next-Generation Phenotyping
    Hennekam, Raoul C. M.
    Biesecker, Leslie G.
    [J]. HUMAN MUTATION, 2012, 33 (05) : 884 - 886
  • [49] A Novel System Control for Quality Control of Diagnostic Tests Based on Next-Generation Sequencing
    Chan, Maurice
    Smirnov, Arseny
    Mulawadi, Fabianus
    Lim, Priscilla
    Lim, Wen-Huey
    Leong, See Ting
    Low, Hwee Meng
    Yee, Mei Qi
    Yeo, Yong Qiang
    Zhou, Xin
    Lee, Charlie
    Huang, Wen
    Welebob, Louis
    Wu, Mengchu
    [J]. JOURNAL OF APPLIED LABORATORY MEDICINE, 2016, 1 (01): : 25 - 35
  • [50] Next-Generation Sequencing
    Xiong, Momiao
    Zhao, Zhongming
    Arnold, Jonathan
    Yu, Fuli
    [J]. JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2010,