PCAP: A whole-genome assembly program

被引:187
|
作者
Huang, XQ [1 ]
Wang, JM
Aluru, S
Yang, SP
Hillier, L
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
[2] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50011 USA
[3] Washington Univ, Sch Med, Genome Sequencing Ctr, St Louis, MO 63108 USA
关键词
D O I
10.1101/gr.1390403
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe a whole-genome assembly program named PCAP for processing tens of millions of reads. The PCAP program has several features to address efficiency and accuracy issues in assembly. Multiple processors are used to perform most time-consuming computations in assembly. A more sensitive method is used to avoid missing overlaps caused by sequencing errors. Repetitive regions of reads are detected oil the basis of many overlaps with other reads, instead of many shorter word matches with other reads. Contaminated end regions of reads are identified and removed. Generation of a consensus sequence for a contig is based on an alignment of reads in the contig, in which both base quality values and coverage information are used to determine every consensus base. The PCAP program was tested on a mouse whole-genome data set of 30 million reads and a human Chromosome 20 data set of 1.7 million reads. The program is freely available for academic use.
引用
收藏
页码:2164 / 2170
页数:7
相关论文
共 50 条
  • [41] Whole-genome assembly of the coral reef Pearlscale Pygmy Angelfish (Centropyge vrolikii)
    Fernandez-Silva, Iria
    Henderson, James B.
    Rocha, Luiz A.
    Simison, W. Brian
    SCIENTIFIC REPORTS, 2018, 8
  • [42] Whole-genome assembly of Akkermansia muciniphila sequenced directly from human stool
    Caputo, Aurelia
    Dubourg, Gregory
    Croce, Olivier
    Gupta, Sushim
    Robert, Catherine
    Papazian, Laurent
    Rolain, Jean-Marc
    Raoult, Didier
    BIOLOGY DIRECT, 2015, 10
  • [43] Interpreting Whole-Genome Sequencing
    Grody, Wayne W.
    Vilain, Eric
    Nelson, Stanley F.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2014, 312 (03): : 296 - 296
  • [44] Whole-genome sequencing in pharmacogeneticson
    Urban, Thomas J.
    PHARMACOGENOMICS, 2013, 14 (04) : 345 - 348
  • [45] Whole-genome DNA sequencing
    Myers, G
    COMPUTING IN SCIENCE & ENGINEERING, 1999, 1 (03) : 33 - 43
  • [46] Whole-genome assembly of the coral reef Pearlscale Pygmy Angelfish (Centropyge vrolikii)
    Iria Fernandez-Silva
    James B. Henderson
    Luiz A. Rocha
    W. Brian Simison
    Scientific Reports, 8
  • [47] De novo whole-genome assembly of Chrysanthemum makinoi, a key wild chrysanthemum
    van Lieshout, Natascha
    van Kaauwen, Martijn
    Kodde, Linda
    Arens, Paul
    Smulders, Marinus J. M.
    Visser, Richard G. F.
    Finkers, Richard
    G3-GENES GENOMES GENETICS, 2022, 12 (01):
  • [48] Whole-genome sequencing of a spirochaete
    Cathy Holding
    Genome Biology, 4 (1)
  • [49] Whole-genome genotyping on microarrays
    Choi, CQ
    SCIENTIST, 2005, 19 (13): : 36 - 36
  • [50] Whole-genome sequencing strategies
    Stein, Richard, 1600, Mary Ann Liebert Inc. (34):