Targeted Assembly of Short Sequence Reads

被引:25
|
作者
Warren, Rene L. [1 ]
Holt, Robert A. [1 ,2 ]
机构
[1] British Columbia Canc Agcy, Genome Sci Ctr, Vancouver, BC V5Z 4E6, Canada
[2] Simon Fraser Univ, Dept Mol Biol & Biochem, Burnaby, BC V5A 1S6, Canada
来源
PLOS ONE | 2011年 / 6卷 / 05期
关键词
SHORT DNA-SEQUENCES; GENOME; MAP;
D O I
10.1371/journal.pone.0019816
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
As next-generation sequence (NGS) production continues to increase, analysis is becoming a significant bottleneck. However, in situations where information is required only for specific sequence variants, it is not necessary to assemble or align whole genome data sets in their entirety. Rather, NGS data sets can be mined for the presence of sequence variants of interest by localized assembly, which is a faster, easier, and more accurate approach. We present TASR, a streamlined assembler that interrogates very large NGS data sets for the presence of specific variants by only considering reads within the sequence space of input target sequences provided by the user. The NGS data set is searched for reads with an exact match to all possible short words within the target sequence, and these reads are then assembled stringently to generate a consensus of the target and flanking sequence. Typically, variants of a particular locus are provided as different target sequences, and the presence of the variant in the data set being interrogated is revealed by a successful assembly outcome. However, TASR can also be used to find unknown sequences that flank a given target. We demonstrate that TASR has utility in finding or confirming genomic mutations, polymorphisms, fusions and integration events. Targeted assembly is a powerful method for interrogating large data sets for the presence of sequence variants of interest. TASR is a fast, flexible and easy to use tool for targeted assembly.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] De novo assembly of short sequence reads
    Paszkiewicz, Konrad
    Studholme, David J.
    BRIEFINGS IN BIOINFORMATICS, 2010, 11 (05) : 457 - 472
  • [2] Fragment assembly with short reads
    Chaisson, M
    Pevzner, P
    Tang, HX
    BIOINFORMATICS, 2004, 20 (13) : 2067 - 2074
  • [3] Parallel, tag-directed assembly of locally derived short sequence reads
    Hiatt J.B.
    Patwardhan R.P.
    Turner E.H.
    Lee C.
    Shendure J.
    Nature Methods, 2010, 7 (2) : 119 - 122
  • [4] Parallel, tag-directed assembly of locally derived short sequence reads
    Hiatt, Joseph B.
    Patwardhan, Rupali P.
    Turner, Emily H.
    Lee, Choli
    Shendure, Jay
    NATURE METHODS, 2010, 7 (02) : 119 - U47
  • [5] A new strategy for genome assembly using short sequence reads and reduced representation libraries
    Young, Andrew L.
    Abaan, Hatice Ozel
    Zerbino, Daniel
    Mullikin, James C.
    Birney, Ewan
    Margulies, Elliott H.
    GENOME RESEARCH, 2010, 20 (02) : 249 - 256
  • [6] Sequence assembly from corrupted shotgun reads
    Ganguly, Shirshendu
    Mossel, Elchanan
    Racz, Miklos Z.
    2016 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2016, : 265 - 269
  • [7] New approaches for metagenome assembly with short reads
    Ayling, Martin
    Clark, Matthew D.
    Leggett, Richard M.
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (02) : 584 - 594
  • [8] MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads
    Namiki, Toshiaki
    Hachiya, Tsuyoshi
    Tanaka, Hideaki
    Sakakibara, Yasubumi
    NUCLEIC ACIDS RESEARCH, 2012, 40 (20)
  • [9] Optimal spliced alignments of short sequence reads
    Fabio De Bona
    Stephan Ossowski
    Korbinian Schneeberger
    Gunnar Rätsch
    BMC Bioinformatics, 9
  • [10] Optimal spliced alignments of short sequence reads
    De Bona, Fabio
    Ossowski, Stephan
    Schneeberger, Korbinian
    Raetsch, Gunnar
    BIOINFORMATICS, 2008, 24 (16) : I174 - I180