Sequence verification of synthetic DNA by assembly of sequencing reads

被引:10
|
作者
Wilson, Mandy L. [1 ]
Cai, Yizhi [1 ]
Hanlon, Regina [1 ]
Taylor, Samantha [1 ]
Chevreux, Bastien [2 ]
Setubal, Joao C. [1 ]
Tyler, Brett M. [3 ]
Peccoud, Jean [1 ,4 ]
机构
[1] Virginia Tech, Virginia Bioinformat Inst, Blacksburg, VA 24061 USA
[2] DSM Nutr Prod Ltd, Dept Human Nutr & Hlth, CH-4002 Basel, Switzerland
[3] Oregon State Univ, Ctr Genome Res & Biocomput, Corvallis, OR 97331 USA
[4] MC 0193 Virginia Tech, ICTAS Ctr Syst Biol Engn Tissues, Blacksburg, VA 24061 USA
基金
美国国家科学基金会; 美国食品与农业研究所;
关键词
GENE SYNTHESIS; GENOME; BIOLOGY; BIOINFORMATICS; RETRIEVAL; ALIGNMENT;
D O I
10.1093/nar/gks908
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene synthesis attempts to assemble user-defined DNA sequences with base-level precision. Verifying the sequences of construction intermediates and the final product of a gene synthesis project is a critical part of the workflow, yet one that has received the least attention. Sequence validation is equally important for other kinds of curated clone collections. Ensuring that the physical sequence of a clone matches its published sequence is a common quality control step performed at least once over the course of a research project. GenoREAD is a web-based application that breaks the sequence verification process into two steps: the assembly of sequencing reads and the alignment of the resulting contig with a reference sequence. GenoREAD can determine if a clone matches its reference sequence. Its sophisticated reporting features help identify and troubleshoot problems that arise during the sequence verification process. GenoREAD has been experimentally validated on thousands of gene-sized constructs from an ORFeome project, and on longer sequences including whole plasmids and synthetic chromosomes. Comparing GenoREAD results with those from manual analysis of the sequencing data demonstrates that GenoREAD tends to be conservative in its diagnostic. GenoREAD is available at www.genoread.org.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Nanopore sequencing and assembly of a human genome with ultra-long reads
    Jain, Miten
    Koren, Sergey
    Miga, Karen H.
    Quick, Josh
    Rand, Arthur C.
    Sasani, Thomas A.
    Tyson, John R.
    Beggs, Andrew D.
    Dilthey, Alexander T.
    Fiddes, Ian T.
    Malla, Sunir
    Marriott, Hannah
    Nieto, Tom
    O'Grady, Justin
    Olsen, Hugh E.
    Pedersen, Brent S.
    Rhie, Arang
    Richardson, Hollian
    Quinlan, Aaron R.
    Snutch, Terrance P.
    Tee, Louise
    Paten, Benedict
    Phillippy, Adam M.
    Simpson, Jared T.
    Loman, Nicholas J.
    Loose, Matthew
    NATURE BIOTECHNOLOGY, 2018, 36 (04) : 338 - +
  • [32] Ultra-accurate microbial amplicon sequencing with synthetic long reads
    Benjamin J. Callahan
    Dmitry Grinevich
    Siddhartha Thakur
    Michael A. Balamotis
    Tuval Ben Yehezkel
    Microbiome, 9
  • [33] BatMeth: improved mapper for bisulfite sequencing reads on DNA methylation
    Lim, Jing-Quan
    Tennakoon, Chandana
    Li, Guoliang
    Wong, Eleanor
    Ruan, Yijun
    Wei, Chia-Lin
    Sung, Wing-Kin
    GENOME BIOLOGY, 2012, 13 (10):
  • [34] Assembly and Analysis of Unmapped Genome Sequence Reads Reveal Novel Sequence and Variation in Dogs
    Holden, Lindsay A.
    Arumilli, Meharji
    Hytonen, Marjo K.
    Hundi, Sruthi
    Salojarvi, Jarkko
    Brown, Kim H.
    Lohi, Hannes
    SCIENTIFIC REPORTS, 2018, 8
  • [35] Assembly and Analysis of Unmapped Genome Sequence Reads Reveal Novel Sequence and Variation in Dogs
    Lindsay A. Holden
    Meharji Arumilli
    Marjo K. Hytönen
    Sruthi Hundi
    Jarkko Salojärvi
    Kim H. Brown
    Hannes Lohi
    Scientific Reports, 8
  • [36] BatMeth: improved mapper for bisulfite sequencing reads on DNA methylation
    Jing-Quan Lim
    Chandana Tennakoon
    Guoliang Li
    Eleanor Wong
    Yijun Ruan
    Chia-Lin Wei
    Wing-Kin Sung
    Genome Biology, 13 (10)
  • [37] LONG READS DNA SEQUENCING IN GENOMICS AND VENOM GLAND TRANSCRIPTOMICS
    Viala, Vincent
    TOXICON, 2020, 177 : S2 - S2
  • [38] cloudSPAdes: assembly of synthetic long reads using de Bruijn graphs
    Tolstoganov, Ivan
    Bankevich, Anton
    Chen, Zhoutao
    Pevzner, Pavel A.
    BIOINFORMATICS, 2019, 35 (14) : I61 - I70
  • [39] A de novo metagenomic assembly program for shotgun DNA reads
    Lai, Binbin
    Ding, Ruogu
    Li, Yang
    Duan, Liping
    Zhu, Huaiqiu
    BIOINFORMATICS, 2012, 28 (11) : 1455 - 1462
  • [40] Multiple Sequence Assembly from Reads Alignable to a Common Reference Genome
    Peng, Qian
    Smith, Andrew D.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (05) : 1283 - 1295