Genome sequence assembly: Algorithms and issues

被引:44
|
作者
Pop, M [1 ]
Salzberg, SL [1 ]
Shumway, M [1 ]
机构
[1] Inst Genom Res, Rockville, MD USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D O I
10.1109/MC.2002.1016901
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Ultimately, genome sequencing seeks to provide an organism's complete DNA sequence. Automation of DNA sequencing allowed scientists to decode entire genomes and gave birth to genomics, the analytic and comparative study of genomes. Although genomes can include billions of nucleotides, the chemical reactions researchers use to decode the DNA are accurate for only about 600 to 700 nucleotides at a time. The DNA reads that sequencing produces must then be assembled into a complete picture of the genome. Errors and certain DNA characteristics complicate assembly. Resolving these problems entails an additional and costly finishing phase that involves extensive human intervention. Assembly programs can dramatically reduce this cost by taking into account additional information obtained during finishing. Algorithms that can assemble millions of DNA fragments into gene sequences underlie the current revolution in biotechnology, helping researchers build the growing database of complete genomes.
引用
收藏
页码:47 / +
页数:9
相关论文
共 50 条
  • [31] A physical, genetic and functional sequence assembly of the barley genome
    Mayer, Klaus F. X.
    Waugh, Robbie
    Langridge, Peter
    Close, Timothy J.
    Wise, Roger P.
    Graner, Andreas
    Matsumoto, Takashi
    Sato, Kazuhiro
    Schulman, Alan
    Muehlbauer, Gary J.
    Stein, Nils
    Ariyadasa, Ruvini
    Schulte, Daniela
    Poursarebani, Naser
    Zhou, Ruonan
    Steuernagel, Burkhard
    Mascher, Martin
    Scholz, Uwe
    Shi, Bujun
    Langridge, Peter
    Madishetty, Kavitha
    Svensson, Jan T.
    Bhat, Prasanna
    Moscou, Matthew
    Resnik, Josh
    Close, Timothy J.
    Muehlbauer, Gary J.
    Hedley, Pete
    Liu, Hui
    Morris, Jenny
    Waugh, Robbie
    Frenkel, Zeev
    Korol, Avraham
    Berges, Helene
    Graner, Andreas
    Stein, Nils
    Steuernagel, Burkhard
    Taudien, Stefan
    Groth, Marco
    Felder, Marius
    Lonardi, Stefano
    Duma, Denisa
    Alpert, Matthew
    Cordero, Francesa
    Beccuti, Marco
    Ciardo, Gianfranco
    Ma, Yaqin
    Wanamaker, Steve
    Stein, Nils
    Close, Timothy J.
    NATURE, 2012, 491 (7426) : 711 - +
  • [32] The sequence and de novo assembly of the wild yak genome
    Yanbin Liu
    Jiayu Luo
    Jiajia Dou
    Biyao Yan
    Qingmiao Ren
    Bolin Tang
    Kun Wang
    Qiang Qiu
    Scientific Data, 7
  • [33] The sequence and de novo assembly of Oxygymnocypris stewartii genome
    Liu, Hai-Ping
    Xiao, Shi-Jun
    Wu, Nan
    Wang, Di
    Liu, Yan-Chao
    Zhou, Chao-Wei
    Liu, Qi-Yong
    Yang, Rui-Bin
    Jiang, Wen-Kai
    Liang, Qi-Qi
    Jiu, Wang
    Zhang, Chi
    Gong, Jun-Hua
    Yuan, Xiao-Hui
    Mou, Zhen-Bo
    SCIENTIFIC DATA, 2019, 6 (1)
  • [34] The sequence and de novo assembly of the giant panda genome
    Ruiqiang Li
    Wei Fan
    Geng Tian
    Hongmei Zhu
    Lin He
    Jing Cai
    Quanfei Huang
    Qingle Cai
    Bo Li
    Yinqi Bai
    Zhihe Zhang
    Yaping Zhang
    Wen Wang
    Jun Li
    Fuwen Wei
    Heng Li
    Min Jian
    Jianwen Li
    Zhaolei Zhang
    Rasmus Nielsen
    Dawei Li
    Wanjun Gu
    Zhentao Yang
    Zhaoling Xuan
    Oliver A. Ryder
    Frederick Chi-Ching Leung
    Yan Zhou
    Jianjun Cao
    Xiao Sun
    Yonggui Fu
    Xiaodong Fang
    Xiaosen Guo
    Bo Wang
    Rong Hou
    Fujun Shen
    Bo Mu
    Peixiang Ni
    Runmao Lin
    Wubin Qian
    Guodong Wang
    Chang Yu
    Wenhui Nie
    Jinhuan Wang
    Zhigang Wu
    Huiqing Liang
    Jiumeng Min
    Qi Wu
    Shifeng Cheng
    Jue Ruan
    Mingwei Wang
    Nature, 2010, 463 : 311 - 317
  • [35] The sequence and de novo assembly of the giant panda genome
    Li, Ruiqiang
    Fan, Wei
    Tian, Geng
    Zhu, Hongmei
    He, Lin
    Cai, Jing
    Huang, Quanfei
    Cai, Qingle
    Li, Bo
    Bai, Yinqi
    Zhang, Zhihe
    Zhang, Yaping
    Wang, Wen
    Li, Jun
    Wei, Fuwen
    Li, Heng
    Jian, Min
    Li, Jianwen
    Zhang, Zhaolei
    Nielsen, Rasmus
    Li, Dawei
    Gu, Wanjun
    Yang, Zhentao
    Xuan, Zhaoling
    Ryder, Oliver A.
    Leung, Frederick Chi-Ching
    Zhou, Yan
    Cao, Jianjun
    Sun, Xiao
    Fu, Yonggui
    Fang, Xiaodong
    Guo, Xiaosen
    Wang, Bo
    Hou, Rong
    Shen, Fujun
    Mu, Bo
    Ni, Peixiang
    Lin, Runmao
    Qian, Wubin
    Wang, Guodong
    Yu, Chang
    Nie, Wenhui
    Wang, Jinhuan
    Wu, Zhigang
    Liang, Huiqing
    Min, Jiumeng
    Wu, Qi
    Cheng, Shifeng
    Ruan, Jue
    Wang, Mingwei
    NATURE, 2010, 463 (7279) : 311 - 317
  • [36] Current Strategies of Polyploid Plant Genome Sequence Assembly
    Kyriakidou, Maria
    Tai, Helen H.
    Anglin, Noelle L.
    Ellis, David
    Stromvik, Martina V.
    FRONTIERS IN PLANT SCIENCE, 2018, 9
  • [38] Application of genetic algorithms to assembly sequence planning with limited resources
    Inst d'Organitzacio i Control de, Sistemes Industrials , Barcelona, Spain
    Proc IEEE Int Symp Assem Task Plan, (411-416):
  • [39] Issues in interpreting and using genome-wide sequence data
    Durbin, Richard
    JOURNAL OF MEDICAL GENETICS, 2010, 47 : S18 - S18
  • [40] Erratum: The sequence and de novo assembly of the giant panda genome
    Ruiqiang Li
    Wei Fan
    Geng Tian
    Hongmei Zhu
    Lin He
    Jing Cai
    Quanfei Huang
    Qingle Cai
    Bo Li
    Yinqi Bai
    Zhihe Zhang
    Yaping Zhang
    Wen Wang
    Jun Li
    Fuwen Wei
    Heng Li
    Min Jian
    Jianwen Li
    Zhaolei Zhang
    Rasmus Nielsen
    Dawei Li
    Wanjun Gu
    Zhentao Yang
    Zhaoling Xuan
    Oliver A. Ryder
    Frederick Chi-Ching Leung
    Yan Zhou
    Jianjun Cao
    Xiao Sun
    Yonggui Fu
    Xiaodong Fang
    Xiaosen Guo
    Bo Wang
    Rong Hou
    Fujun Shen
    Bo Mu
    Peixiang Ni
    Runmao Lin
    Wubin Qian
    Guodong Wang
    Chang Yu
    Wenhui Nie
    Jinhuan Wang
    Zhigang Wu
    Huiqing Liang
    Jiumeng Min
    Qi Wu
    Shifeng Cheng
    Jue Ruan
    Mingwei Wang
    Nature, 2010, 463 : 1106 - 1106