Sequence verification of synthetic DNA by assembly of sequencing reads

被引:10
|
作者
Wilson, Mandy L. [1 ]
Cai, Yizhi [1 ]
Hanlon, Regina [1 ]
Taylor, Samantha [1 ]
Chevreux, Bastien [2 ]
Setubal, Joao C. [1 ]
Tyler, Brett M. [3 ]
Peccoud, Jean [1 ,4 ]
机构
[1] Virginia Tech, Virginia Bioinformat Inst, Blacksburg, VA 24061 USA
[2] DSM Nutr Prod Ltd, Dept Human Nutr & Hlth, CH-4002 Basel, Switzerland
[3] Oregon State Univ, Ctr Genome Res & Biocomput, Corvallis, OR 97331 USA
[4] MC 0193 Virginia Tech, ICTAS Ctr Syst Biol Engn Tissues, Blacksburg, VA 24061 USA
基金
美国国家科学基金会; 美国食品与农业研究所;
关键词
GENE SYNTHESIS; GENOME; BIOLOGY; BIOINFORMATICS; RETRIEVAL; ALIGNMENT;
D O I
10.1093/nar/gks908
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene synthesis attempts to assemble user-defined DNA sequences with base-level precision. Verifying the sequences of construction intermediates and the final product of a gene synthesis project is a critical part of the workflow, yet one that has received the least attention. Sequence validation is equally important for other kinds of curated clone collections. Ensuring that the physical sequence of a clone matches its published sequence is a common quality control step performed at least once over the course of a research project. GenoREAD is a web-based application that breaks the sequence verification process into two steps: the assembly of sequencing reads and the alignment of the resulting contig with a reference sequence. GenoREAD can determine if a clone matches its reference sequence. Its sophisticated reporting features help identify and troubleshoot problems that arise during the sequence verification process. GenoREAD has been experimentally validated on thousands of gene-sized constructs from an ORFeome project, and on longer sequences including whole plasmids and synthetic chromosomes. Comparing GenoREAD results with those from manual analysis of the sequencing data demonstrates that GenoREAD tends to be conservative in its diagnostic. GenoREAD is available at www.genoread.org.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Decoding long nanopore sequencing reads of natural DNA
    Laszlo, Andrew H.
    Derrington, Ian M.
    Ross, Brian C.
    Brinkerhoff, Henry
    Adey, Andrew
    Nova, Ian C.
    Craig, Jonathan M.
    Langford, Kyle W.
    Samson, Jenny Mae
    Daza, Riza
    Doering, Kenji
    Shendure, Jay
    Gundlach, Jens H.
    NATURE BIOTECHNOLOGY, 2014, 32 (08) : 829 - 833
  • [22] Continuous Embeddings of DNA Sequencing Reads and Application to Metagenomics
    Menegaux, Romain
    Vert, Jean-Philippe
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (06) : 509 - 518
  • [23] Decoding long nanopore sequencing reads of natural DNA
    Andrew H Laszlo
    Ian M Derrington
    Brian C Ross
    Henry Brinkerhoff
    Andrew Adey
    Ian C Nova
    Jonathan M Craig
    Kyle W Langford
    Jenny Mae Samson
    Riza Daza
    Kenji Doering
    Jay Shendure
    Jens H Gundlach
    Nature Biotechnology, 2014, 32 : 829 - 833
  • [24] Erratum: Sense from sequence reads: methods for alignment and assembly
    Paul Flicek
    Ewan Birney
    Nature Methods, 2010, 7 : 479 - 479
  • [25] Evaluation of CircRNA Sequence Assembly Methods Using Long Reads
    Zhang, Jingjing
    Hossain, Md. Tofazzal
    Liu, Weiguo
    Peng, Yin
    Pan, Yi
    Wei, Yanjie
    FRONTIERS IN GENETICS, 2022, 13
  • [26] High-Quality Draft Genome Sequence of Kibdelosporangium philippinense, Generated by Hybrid Assembly of Short and Long Sequencing Reads
    Fedorov, Eugenia A.
    Omeragic, Medina
    Shalygina, Kristina F.
    Farwell, Ashlyn C.
    MacLea, Kyle S.
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2022, 11 (04):
  • [27] Ultra-accurate microbial amplicon sequencing with synthetic long reads
    Callahan, Benjamin J.
    Grinevich, Dmitry
    Thakur, Siddhartha
    Balamotis, Michael A.
    Ben Yehezkel, Tuval
    MICROBIOME, 2021, 9 (01)
  • [28] Iterative Learning for Reference-Guided DNA Sequence Assembly From Short Reads: Algorithms and Limits of Performance
    Shen, Xiaohu
    Shamaiah, Manohar
    Vikalo, Haris
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (17) : 4425 - 4435
  • [29] Nanopore sequencing and assembly of a human genome with ultra-long reads
    Miten Jain
    Sergey Koren
    Karen H Miga
    Josh Quick
    Arthur C Rand
    Thomas A Sasani
    John R Tyson
    Andrew D Beggs
    Alexander T Dilthey
    Ian T Fiddes
    Sunir Malla
    Hannah Marriott
    Tom Nieto
    Justin O'Grady
    Hugh E Olsen
    Brent S Pedersen
    Arang Rhie
    Hollian Richardson
    Aaron R Quinlan
    Terrance P Snutch
    Louise Tee
    Benedict Paten
    Adam M Phillippy
    Jared T Simpson
    Nicholas J Loman
    Matthew Loose
    Nature Biotechnology, 2018, 36 : 338 - 345
  • [30] WHATSHAP: Weighted Haplotype Assembly for Future-Generation Sequencing Reads
    Patterson, Murray
    Marschall, Tobias
    Pisanti, Nadia
    Van Iersel, Leo
    Stougie, Leen
    Klau, Gunnar W.
    Schonhuth, Alexander
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2015, 22 (06) : 498 - 509