A de novo metagenomic assembly program for shotgun DNA reads

被引:30
|
作者
Lai, Binbin [1 ,2 ,3 ,4 ]
Ding, Ruogu [1 ,2 ,3 ]
Li, Yang [1 ,2 ,3 ]
Duan, Liping [5 ]
Zhu, Huaiqiu [1 ,2 ,3 ,4 ]
机构
[1] Peking Univ, Coll Engn, State Key Lab Turbulence & Complex Syst, Beijing 100871, Peoples R China
[2] Peking Univ, Coll Engn, Dept Biomed Engn, Beijing 100871, Peoples R China
[3] Peking Univ, Ctr Theoret Biol, Beijing 100871, Peoples R China
[4] Peking Univ, Ctr Prot Sci, Beijing 100871, Peoples R China
[5] Peking Univ Third Hosp, Dept Gastroenterol, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
GENOMES;
D O I
10.1093/bioinformatics/bts162
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A high-quality assembly of reads generated from shotgun sequencing is a substantial step in metagenome projects. Although traditional assemblers have been employed in initial analysis of metagenomes, they cannot surmount the challenges created by the features of metagenomic data. Result: We present a de novo assembly approach and its implementation named MAP (metagenomic assembly program). Based on an improved overlap/layout/consensus (OLC) strategy incorporated with several special algorithms, MAP uses the mate pair information, resulting in being more applicable to shotgun DNA reads (recommended as > 200 bp) currently widely used in metagenome projects. Results of extensive tests on simulated data show that MAP can be superior to both Celera and Phrap for typical longer reads by Sanger sequencing, as well as has an evident advantage over Celera, Newbler and the newest Genovo, for typical shorter reads by 454 sequencing.
引用
收藏
页码:1455 / 1462
页数:8
相关论文
共 50 条
  • [1] De novo assembly of short sequence reads
    Paszkiewicz, Konrad
    Studholme, David J.
    BRIEFINGS IN BIOINFORMATICS, 2010, 11 (05) : 457 - 472
  • [2] Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads
    Kajitani, Rei
    Toshimoto, Kouta
    Noguchi, Hideki
    Toyoda, Atsushi
    Ogura, Yoshitoshi
    Okuno, Miki
    Yabana, Mitsuru
    Harada, Masayuki
    Nagayasu, Eiji
    Maruyama, Haruhiko
    Kohara, Yuji
    Fujiyama, Asao
    Hayashi, Tetsuya
    Itoh, Takehiko
    GENOME RESEARCH, 2014, 24 (08) : 1384 - 1395
  • [3] GPU acceleration of Darwin read overlapper for de novo assembly of long DNA reads
    Nauman Ahmed
    Tong Dong Qiu
    Koen Bertels
    Zaid Al-Ars
    BMC Bioinformatics, 21
  • [4] GPU acceleration of Darwin read overlapper for de novo assembly of long DNA reads
    Ahmed, Nauman
    Qiu, Tong Dong
    Bertels, Koen
    Al-Ars, Zaid
    BMC BIOINFORMATICS, 2020, 21 (Suppl 13)
  • [5] Error Correction in Nanopore Reads for de novo Genomic Assembly
    Aldridge-Aguila, Jacqueline
    Alvarez-Saravia, Diego
    Navarrete, Marcelo
    Uribe-Paredes, Roberto
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2020, PT V, 2020, 12253 : 754 - 762
  • [6] Genometa - A Fast and Accurate Classifier for Short Metagenomic Shotgun Reads
    Davenport, Colin F.
    Neugebauer, Jens
    Beckmann, Nils
    Friedrich, Benedikt
    Kameri, Burim
    Kokott, Svea
    Paetow, Malte
    Siekmann, Bjoern
    Wieding-Drewes, Matthias
    Wienhoefer, Markus
    Wolf, Stefan
    Tuemmler, Burkhard
    Ahlers, Volker
    Sprengel, Frauke
    PLOS ONE, 2012, 7 (08):
  • [7] Sequence assembly from corrupted shotgun reads
    Ganguly, Shirshendu
    Mossel, Elchanan
    Racz, Miklos Z.
    2016 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2016, : 265 - 269
  • [8] New algorithms for accurate and efficient de novo genome assembly from long DNA sequencing reads
    Gonzalez-Garcia, Laura
    Guevara-Barrientos, David
    Lozano-Arce, Daniela
    Gil, Juanita
    Diaz-Riano, Jorge
    Duarte, Erick
    Andrade, German
    Camilo Bojaca, Juan
    Camila Hoyos-Sanchez, Maria
    Chavarro, Christian
    Guayazan, Natalia
    Chica, Luis Alberto
    Buitrago Acosta, Maria Camila
    Bautista, Edwin
    Trujillo, Miller
    Duitama, Jorge
    LIFE SCIENCE ALLIANCE, 2023, 6 (05)
  • [9] Linking De Novo Assembly Results with Long DNA Reads Using the dnaasm-link Application
    Kusmirek, Wiktor
    Franus, Wiktor
    Nowak, Robert
    BIOMED RESEARCH INTERNATIONAL, 2019, 2019
  • [10] DIME: A Novel Framework for De Novo Metagenomic Sequence Assembly
    Guo, Xuan
    Yu, Ning
    Ding, Xiaojun
    Wang, Jianxin
    Pan, Yi
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2015, 22 (02) : 159 - 177