ARACHNE: A whole-genome shotgun assembler

被引:2
|
作者
Batzoglou, S
Jaffe, DB
Stanley, K
Butler, J
Gnerre, S
Mauceli, E
Berger, B
Mesirov, JP
Lander, ES [1 ]
机构
[1] MIT, Whitehead Inst Biomed Res, Ctr Genome Res, Cambridge, MA 02141 USA
[2] MIT, Comp Sci Lab, Cambridge, MA 02139 USA
[3] Univ Nebraska, Dept Math & Stat, Lincoln, NE 68588 USA
[4] MIT, Dept Math, Cambridge, MA 02141 USA
[5] MIT, Dept Biol, Cambridge, MA 02141 USA
关键词
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe a new computer system, called ARACHNE, for assembling genome sequence using paired-end whole-genome shotgun reads. ARACHNE has several key features, including an efficient and sensitive procedure for finding read overlaps, a procedure for scoring overlaps that achieves high accuracy by correcting errors before assembly, read merger based on forward-reverse links, and detection of repeat contigs by forward-reverse link inconsistency. To test ARACHNE, we created simulated reads providing similar to10-fold coverage of the genomes of H. influenzae, S. cerevisiae, and A melanogaster, as well as human chromosomes 21 and 22. The assemblies of these simulated reads yielded nearly complete coverage of the respective genomes, with a small number of contigs joined into a smaller number of supercontigs (or scaffolds). For example, analysis of the A melanogaster genome yielded similar to98% coverage with an N50 contig length of 324 kb and an N50 supercontig length of 5143 kb. The assembly accuracy was high, although not perfect: small errors occurred at a frequency of roughly 1 per 1 Mb (typically, deletion of similar to1 kb in size), with a very small number of other misassemblies. The assembly was rapid: the Drosophila assembly required only 21 hours on a single 667 MHz processor and used 8.4 Gb of memory.
引用
收藏
页码:177 / 189
页数:13
相关论文
共 50 条
  • [1] Against a whole-genome shotgun
    Green, P
    [J]. GENOME RESEARCH, 1997, 7 (05): : 410 - 417
  • [2] Human whole-genome shotgun sequencing
    Weber, JL
    Myers, EW
    [J]. GENOME RESEARCH, 1997, 7 (05) : 401 - 409
  • [3] Whole-genome sequence assembly for mammalian genomes: Arachne 2
    Jaffe, DB
    Butler, J
    Gnerre, S
    Mauceli, E
    Lindblad-Toh, K
    Mesirov, JP
    Zody, MC
    Lander, ES
    [J]. GENOME RESEARCH, 2003, 13 (01) : 91 - 96
  • [4] Whole-genome shotgun optical mapping of Rhodobacter sphaeroides strain 2.4.1 and its use for whole-genome shotgun sequence assembly
    Zhou, SG
    Kvikstad, E
    Kile, A
    Severin, J
    Forrest, D
    Runnheim, R
    Churas, C
    Hickman, JW
    Mackenzie, C
    Choudhary, M
    Donohue, T
    Kaplan, S
    Schwartz, DC
    [J]. GENOME RESEARCH, 2003, 13 (09) : 2142 - 2151
  • [5] A whole-genome shotgun approach to human reference genome sequencing
    Morishita, Shinichi
    [J]. NATURE REVIEWS GENETICS, 2024, 25 (04) : 236 - 236
  • [6] A whole-genome shotgun approach to human reference genome sequencing
    Shinichi Morishita
    [J]. Nature Reviews Genetics, 2024, 25 : 236 - 236
  • [7] Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes
    Aparicio, S
    Chapman, J
    Stupka, E
    Putnam, N
    Chia, J
    Dehal, P
    Christoffels, A
    Rash, S
    Hoon, S
    Smit, A
    Gelpke, MDS
    Roach, J
    Oh, T
    Ho, IY
    Wong, M
    Detter, C
    Verhoef, F
    Predki, P
    Tay, A
    Lucas, S
    Richardson, P
    Smith, SF
    Clark, MS
    Edwards, YJK
    Doggett, N
    Zharkikh, A
    Tavtigian, SV
    Pruss, D
    Barnstead, M
    Evans, C
    Baden, H
    Powell, J
    Glusman, G
    Rowen, L
    Hood, L
    Tan, YH
    Elgar, G
    Hawkins, T
    Venkatesh, B
    Rokhsar, D
    Brenner, S
    [J]. SCIENCE, 2002, 297 (5585) : 1301 - 1310
  • [8] Whole-genome shotgun assembly and comparison of human genome assemblies
    Istrail, S
    Sutton, GG
    Florea, L
    Halpern, AL
    Mobarry, CM
    Lippert, R
    Walenz, B
    Shatkay, H
    Dew, I
    Miller, JR
    Flanigan, MJ
    Edwards, NJ
    Bolanos, R
    Fasulo, D
    Halldorsson, BV
    Hannenhalli, S
    Turner, R
    Yooseph, S
    Lu, F
    Nusskern, DR
    Shue, BC
    Zheng, XQH
    Zhong, F
    Delcher, AL
    Huson, DH
    Kravitz, SA
    Mouchard, L
    Reinert, K
    Remington, KA
    Clark, AG
    Waterman, MS
    Eichler, EE
    Adams, MD
    Hunkapiller, MW
    Myers, EW
    Venter, JC
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (07) : 1916 - 1921
  • [9] Analysis of the breadwheat genome using whole-genome shotgun sequencing
    Brenchley, Rachel
    Spannagl, Manuel
    Pfeifer, Matthias
    Barker, Gary L. A.
    D'Amore, Rosalinda
    Allen, Alexandra M.
    McKenzie, Neil
    Kramer, Melissa
    Kerhornou, Arnaud
    Bolser, Dan
    Kay, Suzanne
    Waite, Darren
    Trick, Martin
    Bancroft, Ian
    Gu, Yong
    Huo, Naxin
    Luo, Ming-Cheng
    Sehgal, Sunish
    Gill, Bikram
    Kianian, Sharyar
    Anderson, Olin
    Kersey, Paul
    Dvorak, Jan
    McCombie, W. Richard
    Hall, Anthony
    Mayer, Klaus F. X.
    Edwards, Keith J.
    Bevan, Michael W.
    Hall, Neil
    [J]. NATURE, 2012, 491 (7426) : 705 - 710
  • [10] Comparing vertebrate whole-genome shotgun reads to the human genome
    Chen, R
    Bouck, JB
    Weinstock, GM
    Gibbs, RA
    [J]. GENOME RESEARCH, 2001, 11 (11) : 1807 - 1816