PCAP: A whole-genome assembly program

被引:186
|
作者
Huang, XQ [1 ]
Wang, JM
Aluru, S
Yang, SP
Hillier, L
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
[2] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50011 USA
[3] Washington Univ, Sch Med, Genome Sequencing Ctr, St Louis, MO 63108 USA
关键词
D O I
10.1101/gr.1390403
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe a whole-genome assembly program named PCAP for processing tens of millions of reads. The PCAP program has several features to address efficiency and accuracy issues in assembly. Multiple processors are used to perform most time-consuming computations in assembly. A more sensitive method is used to avoid missing overlaps caused by sequencing errors. Repetitive regions of reads are detected oil the basis of many overlaps with other reads, instead of many shorter word matches with other reads. Contaminated end regions of reads are identified and removed. Generation of a consensus sequence for a contig is based on an alignment of reads in the contig, in which both base quality values and coverage information are used to determine every consensus base. The PCAP program was tested on a mouse whole-genome data set of 30 million reads and a human Chromosome 20 data set of 1.7 million reads. The program is freely available for academic use.
引用
收藏
页码:2164 / 2170
页数:7
相关论文
共 50 条
  • [1] A whole-genome assembly of Drosophila
    Myers, EW
    Sutton, GG
    Delcher, AL
    Dew, IM
    Fasulo, DP
    Flanigan, MJ
    Kravitz, SA
    Mobarry, CM
    Reinert, KHJ
    Remington, KA
    Anson, EL
    Bolanos, RA
    Chou, HH
    Jordan, CM
    Halpern, AL
    Lonardi, S
    Beasley, EM
    Brandon, RC
    Chen, L
    Dunn, PJ
    Lai, ZW
    Liang, Y
    Nusskern, DR
    Zhan, M
    Zhang, Q
    Zheng, XQ
    Rubin, GM
    Adams, MD
    Venter, JC
    [J]. SCIENCE, 2000, 287 (5461) : 2196 - 2204
  • [2] Whole-genome assembly of Culex tarsalis
    Main, Bradley J.
    Marcantonio, Matteo
    Johnston, J. Spencer
    Rasgon, Jason L.
    Brown, C. Titus
    Barker, Christopher M.
    [J]. G3-GENES GENOMES GENETICS, 2021, 11 (02):
  • [3] Whole-genome shotgun assembly and comparison of human genome assemblies
    Istrail, S
    Sutton, GG
    Florea, L
    Halpern, AL
    Mobarry, CM
    Lippert, R
    Walenz, B
    Shatkay, H
    Dew, I
    Miller, JR
    Flanigan, MJ
    Edwards, NJ
    Bolanos, R
    Fasulo, D
    Halldorsson, BV
    Hannenhalli, S
    Turner, R
    Yooseph, S
    Lu, F
    Nusskern, DR
    Shue, BC
    Zheng, XQH
    Zhong, F
    Delcher, AL
    Huson, DH
    Kravitz, SA
    Mouchard, L
    Reinert, K
    Remington, KA
    Clark, AG
    Waterman, MS
    Eichler, EE
    Adams, MD
    Hunkapiller, MW
    Myers, EW
    Venter, JC
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (07) : 1916 - 1921
  • [4] Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes
    Aparicio, S
    Chapman, J
    Stupka, E
    Putnam, N
    Chia, J
    Dehal, P
    Christoffels, A
    Rash, S
    Hoon, S
    Smit, A
    Gelpke, MDS
    Roach, J
    Oh, T
    Ho, IY
    Wong, M
    Detter, C
    Verhoef, F
    Predki, P
    Tay, A
    Lucas, S
    Richardson, P
    Smith, SF
    Clark, MS
    Edwards, YJK
    Doggett, N
    Zharkikh, A
    Tavtigian, SV
    Pruss, D
    Barnstead, M
    Evans, C
    Baden, H
    Powell, J
    Glusman, G
    Rowen, L
    Hood, L
    Tan, YH
    Elgar, G
    Hawkins, T
    Venkatesh, B
    Rokhsar, D
    Brenner, S
    [J]. SCIENCE, 2002, 297 (5585) : 1301 - 1310
  • [5] GWAsimulator: a rapid whole-genome simulation program
    Li, Chun
    Li, Mingyao
    [J]. BIOINFORMATICS, 2008, 24 (01) : 140 - 142
  • [6] The present and future of de novo whole-genome assembly
    Sohn, Jang-il
    Nam, Jin-Wu
    [J]. BRIEFINGS IN BIOINFORMATICS, 2018, 19 (01) : 23 - 40
  • [7] A whole-genome assembly of the domestic cow, Bos taurus
    Zimin, Aleksey V.
    Delcher, Arthur L.
    Florea, Liliana
    Kelley, David R.
    Schatz, Michael C.
    Puiu, Daniela
    Hanrahan, Finnian
    Pertea, Geo
    Van Tassell, Curtis P.
    Sonstegard, Tad S.
    Marcais, Guillaume
    Roberts, Michael
    Subramanian, Poorani
    Yorke, James A.
    Salzberg, Steven L.
    [J]. GENOME BIOLOGY, 2009, 10 (04):
  • [8] A field guide to whole-genome sequencing, assembly and annotation
    Ekblom, Robert
    Wolf, Jochen B. W.
    [J]. EVOLUTIONARY APPLICATIONS, 2014, 7 (09): : 1026 - 1042
  • [9] A whole-genome assembly of the domestic cow, Bos taurus
    Aleksey V Zimin
    Arthur L Delcher
    Liliana Florea
    David R Kelley
    Michael C Schatz
    Daniela Puiu
    Finnian Hanrahan
    Geo Pertea
    Curtis P Van Tassell
    Tad S Sonstegard
    Guillaume Marçais
    Michael Roberts
    Poorani Subramanian
    James A Yorke
    Steven L Salzberg
    [J]. Genome Biology, 10
  • [10] Assembly and annotation of whole-genome sequence of Fusarium equiseti
    Li, Xueping
    Xu, Shiyang
    Zhang, Jungao
    Li, Minquan
    [J]. GENOMICS, 2021, 113 (04) : 2870 - 2876