Genome sequence assembly: Algorithms and issues

被引:44
|
作者
Pop, M [1 ]
Salzberg, SL [1 ]
Shumway, M [1 ]
机构
[1] Inst Genom Res, Rockville, MD USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D O I
10.1109/MC.2002.1016901
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Ultimately, genome sequencing seeks to provide an organism's complete DNA sequence. Automation of DNA sequencing allowed scientists to decode entire genomes and gave birth to genomics, the analytic and comparative study of genomes. Although genomes can include billions of nucleotides, the chemical reactions researchers use to decode the DNA are accurate for only about 600 to 700 nucleotides at a time. The DNA reads that sequencing produces must then be assembled into a complete picture of the genome. Errors and certain DNA characteristics complicate assembly. Resolving these problems entails an additional and costly finishing phase that involves extensive human intervention. Assembly programs can dramatically reduce this cost by taking into account additional information obtained during finishing. Algorithms that can assemble millions of DNA fragments into gene sequences underlie the current revolution in biotechnology, helping researchers build the growing database of complete genomes.
引用
收藏
页码:47 / +
页数:9
相关论文
共 50 条
  • [21] DNA sequence assembly algorithms based on clustering approaches
    Elloumi, M
    METMBS'00: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, VOLS I AND II, 2000, : 717 - 723
  • [22] Flexible and Efficient Algorithms for Abelian Matching in Genome Sequence
    Faro, Simone
    Pavone, Arianna
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2019, PT I, 2019, 11465 : 307 - 318
  • [23] The sequence and de novo assembly of Oxygymnocypris stewartii genome
    Hai-Ping Liu
    Shi-Jun Xiao
    Nan Wu
    Di Wang
    Yan-Chao Liu
    Chao-Wei Zhou
    Qi-Yong Liu
    Rui-Bin Yang
    Wen-Kai Jiang
    Qi-Qi Liang
    Chi Wangjiu
    Jun-Hua Zhang
    Xiao-Hui Gong
    Zhen-Bo Yuan
    Scientific Data, 6
  • [24] Limitations of next-generation genome sequence assembly
    Alkan, Can
    Sajjadian, Saba
    Eichler, Evan E.
    NATURE METHODS, 2011, 8 (01) : 61 - 65
  • [25] Long-read sequence assembly of the gorilla genome
    Gordon, David
    Huddleston, John
    Chaisson, Mark J. P.
    Hill, Christopher M.
    Kronenberg, Zev N.
    Munson, Katherine M.
    Malig, Maika
    Raja, Archana
    Fiddes, Ian
    Hillier, LaDeana W.
    Dunn, Christopher
    Baker, Carl
    Armstrong, Joel
    Diekhans, Mark
    Paten, Benedict
    Shendure, Jay
    Wilson, Richard K.
    Haussler, David
    Chin, Chen-Shan
    Eichler, Evan E.
    SCIENCE, 2016, 352 (6281)
  • [26] The sequence and de novo assembly of hog deer genome
    Wang, Wei
    Yan, Hui-Juan
    Chen, Shi-Yi
    Li, Zhen-Zhen
    Yi, Jun
    Niu, Li-Li
    Deng, Jia-Po
    Chen, Wei-Gang
    Pu, Yang
    Jia, Xianbo
    Qu, Yu
    Chen, Ang
    Zhong, Yan
    Yu, Xin-Ming
    Pang, Shuai
    Huang, Wan-Long
    Han, Yue
    Liu, Guang-Jian
    Yu, Jian-Qiu
    SCIENTIFIC DATA, 2019, 6 (1)
  • [27] Application of Perl in Genome DNA Sequence Assembly and Annotation
    Zhang, Shengli
    Li, Dongfang
    Xu, Guifang
    Shan, Changjuan
    Wu, Yingxia
    Wang, Chunhu
    2011 SECOND ETP/IITA CONFERENCE ON TELECOMMUNICATION AND INFORMATION (TEIN 2011), VOL 2, 2011, : 153 - 155
  • [28] The sequence and de novo assembly of hog deer genome
    Wei Wang
    Hui-Juan Yan
    Shi-Yi Chen
    Zhen-Zhen Li
    Jun Yi
    Li-Li Niu
    Jia-Po Deng
    Wei-Gang Chen
    Yang Pu
    Xianbo Jia
    Yu Qu
    Ang Chen
    Yan Zhong
    Xin-Ming Yu
    Shuai Pang
    Wan-Long Huang
    Yue Han
    Guang-Jian Liu
    Jian-Qiu Yu
    Scientific Data, 6
  • [29] Limitations of next-generation genome sequence assembly
    Alkan C.
    Sajjadian S.
    Eichler E.E.
    Nature Methods, 2011, 8 (1) : 61 - 65
  • [30] The sequence and de novo assembly of the wild yak genome
    Liu, Yanbin
    Luo, Jiayu
    Dou, Jiajia
    Yan, Biyao
    Ren, Qingmiao
    Tang, Bolin
    Wang, Kun
    Qiu, Qiang
    SCIENTIFIC DATA, 2020, 7 (01)