Readjoiner: a fast and memory efficient string graph-based sequence assembler

被引:37
|
作者
Gonnella, Giorgio [1 ]
Kurtz, Stefan [1 ]
机构
[1] Univ Hamburg, Ctr Bioinformat, D-20146 Hamburg, Germany
来源
BMC BIOINFORMATICS | 2012年 / 13卷
关键词
LARGE GENOMES;
D O I
10.1186/1471-2105-13-82
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Ongoing improvements in throughput of the next-generation sequencing technologies challenge the current generation of de novo sequence assemblers. Most recent sequence assemblers are based on the construction of a de Bruijn graph. An alternative framework of growing interest is the assembly string graph, not necessitating a division of the reads into k-mers, but requiring fast algorithms for the computation of suffix-prefix matches among all pairs of reads. Results: Here we present efficient methods for the construction of a string graph from a set of sequencing reads. Our approach employs suffix sorting and scanning methods to compute suffix-prefix matches. Transitive edges are recognized and eliminated early in the process and the graph is efficiently constructed including irreducible edges only. Conclusions: Our suffix-prefix match determination and string graph construction algorithms have been implemented in the software package Readjoiner. Comparison with existing string graph-based assemblers shows that Readjoiner is faster and more space efficient. Readjoiner is available at http://www.zbh.uni-hamburg.de/readjoiner.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Readjoiner: a fast and memory efficient string graph-based sequence assembler
    Giorgio Gonnella
    Stefan Kurtz
    BMC Bioinformatics, 13
  • [2] A de novo next generation genomic sequence assembler based on string graph and MapReduce cloud computing framework
    Chang, Yu-Jung
    Chen, Chien-Chih
    Chen, Chuen-Liang
    Ho, Jan-Ming
    BMC GENOMICS, 2012, 13
  • [3] A de novo next generation genomic sequence assembler based on string graph and MapReduce cloud computing framework
    Yu-Jung Chang
    Chien-Chih Chen
    Chuen-Liang Chen
    Jan-Ming Ho
    BMC Genomics, 13
  • [4] Sample Efficient Reinforcement Learning Using Graph-Based Memory Reconstruction
    Kang Y.
    Zhao E.
    Zang Y.
    Li L.
    Li K.
    Tao P.
    Xing J.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (02): : 751 - 762
  • [5] SEQUENCE TRANSDUCTION WITH GRAPH-BASED SUPERVISION
    Moritz, Niko
    Hori, Takaaki
    Watanabe, Shinji
    Le Roux, Jonathan
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7212 - 7216
  • [6] Graph-based fast image segmentation
    Han, Dongfeng
    Li, Wenhui
    Lu, Xiaosuo
    Li, Lin
    Wang, Yi
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2006, 4109 : 468 - 474
  • [7] Graph-based fast texture synthesis
    Zou, Kun
    Li, Wensheng
    Deng, Chunjian
    Journal of Computational Information Systems, 2011, 7 (13): : 4880 - 4889
  • [8] Learning Efficient Hash Codes for Fast Graph-Based Data Similarity Retrieval
    Wang, Jinbao
    Xu, Shuo
    Zheng, Feng
    Lu, Ke
    Song, Jingkuan
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6321 - 6334
  • [9] Efficient Graph-Based Document Similarity
    Paul, Christian
    Rettinger, Achim
    Mogadala, Aditya
    Knoblock, Craig A.
    Szekely, Pedro
    SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, 2016, 9678 : 334 - 349
  • [10] Efficient Graph-Based Image Segmentation
    Pedro F. Felzenszwalb
    Daniel P. Huttenlocher
    International Journal of Computer Vision, 2004, 59 : 167 - 181