Genome sequence assembly: Algorithms and issues

被引:44
|
作者
Pop, M [1 ]
Salzberg, SL [1 ]
Shumway, M [1 ]
机构
[1] Inst Genom Res, Rockville, MD USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D O I
10.1109/MC.2002.1016901
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Ultimately, genome sequencing seeks to provide an organism's complete DNA sequence. Automation of DNA sequencing allowed scientists to decode entire genomes and gave birth to genomics, the analytic and comparative study of genomes. Although genomes can include billions of nucleotides, the chemical reactions researchers use to decode the DNA are accurate for only about 600 to 700 nucleotides at a time. The DNA reads that sequencing produces must then be assembled into a complete picture of the genome. Errors and certain DNA characteristics complicate assembly. Resolving these problems entails an additional and costly finishing phase that involves extensive human intervention. Assembly programs can dramatically reduce this cost by taking into account additional information obtained during finishing. Algorithms that can assemble millions of DNA fragments into gene sequences underlie the current revolution in biotechnology, helping researchers build the growing database of complete genomes.
引用
收藏
页码:47 / +
页数:9
相关论文
共 50 条
  • [41] Heterozygous genome assembly via binary classification of homologous sequence
    Bodily, Paul M.
    Fujimoto, M. Stanley
    Ortega, Cameron
    Okuda, Nozomu
    Price, Jared C.
    Clement, Mark J.
    Snell, Quinn
    BMC BIOINFORMATICS, 2015, 16
  • [42] PASQUAL: Parallel Techniques for Next Generation Genome Sequence Assembly
    Liu, Xing
    Pande, Pushkar R.
    Meyerhenke, Henning
    Bader, David A.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (05) : 977 - 986
  • [43] Heterozygous genome assembly via binary classification of homologous sequence
    Paul M Bodily
    M Stanley Fujimoto
    Cameron Ortega
    Nozomu Okuda
    Jared C Price
    Mark J Clement
    Quinn Snell
    BMC Bioinformatics, 16
  • [44] Assembly and annotation of whole-genome sequence of Fusarium equiseti
    Li, Xueping
    Xu, Shiyang
    Zhang, Jungao
    Li, Minquan
    GENOMICS, 2021, 113 (04) : 2870 - 2876
  • [45] The genome of Arachis hypogaea: Genetic linkage map will aid the whole genome sequence assembly
    Guo, B.
    Qin, H.
    Feng, S.
    Chen, C.
    Culbreath, A.
    Zhang, X.
    Holbrook, C.
    Ozias-Akins, P.
    Liang, X.
    PHYTOPATHOLOGY, 2011, 101 (06) : S66 - S67
  • [46] Assembly and Analysis of Unmapped Genome Sequence Reads Reveal Novel Sequence and Variation in Dogs
    Holden, Lindsay A.
    Arumilli, Meharji
    Hytonen, Marjo K.
    Hundi, Sruthi
    Salojarvi, Jarkko
    Brown, Kim H.
    Lohi, Hannes
    SCIENTIFIC REPORTS, 2018, 8
  • [47] Assembly and Analysis of Unmapped Genome Sequence Reads Reveal Novel Sequence and Variation in Dogs
    Lindsay A. Holden
    Meharji Arumilli
    Marjo K. Hytönen
    Sruthi Hundi
    Jarkko Salojärvi
    Kim H. Brown
    Hannes Lohi
    Scientific Reports, 8
  • [48] Comparative analysis of algorithms for whole-genome assembly of pyrosequencing data
    Finotello, Francesca
    Lavezzo, Enrico
    Fontana, Paolo
    Peruzzo, Denis
    Albiero, Alessandro
    Barzon, Luisa
    Falda, Marco
    Di Camillo, Barbara
    Toppo, Stefano
    BRIEFINGS IN BIOINFORMATICS, 2012, 13 (03) : 269 - 280
  • [49] Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence
    Joseph Cheung
    Xavier Estivill
    Razi Khaja
    Jeffrey R MacDonald
    Ken Lau
    Lap-Chee Tsui
    Stephen W Scherer
    Genome Biology, 4
  • [50] Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence
    Cheung, J
    Estivill, X
    Khaja, R
    MacDonald, JR
    Lau, K
    Tsui, LC
    Scherer, SW
    GENOME BIOLOGY, 2003, 4 (04)