Using the Sadakane Compressed Suffix Tree to Solve the All-Pairs Suffix-Prefix Problem

被引:4
|
作者
Rachid, Maan Haj [1 ]
Malluhi, Qutaibah [1 ]
Abouelhoda, Andmohamed [2 ,3 ]
机构
[1] Qatar Univ, KINDI Lab Comp Res, Doha, Qatar
[2] Cairo Univ, Fac Engn, Giza, Egypt
[3] Nile Univ, Ctr Informat Sci, Giza, Egypt
关键词
D O I
10.1155/2014/745298
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The all-pairs suffix-prefix matching problem is a basic problem in string processing. It has an application in the de novo genome assembly task, which is one of the major bioinformatics problems. Due to the large size of the input data, it is crucial to use fast and space efficient solutions. In this paper, we present a space-economical solution to this problem using the generalized Sadakane compressed suffix tree. Furthermore, we present a parallel algorithm to provide more speed for shared memory computers. Our sequential and parallel algorithms are optimized by exploiting features of the Sadakane compressed index data structure. Experimental results show that our solution based on the Sadakane's compressed index consumes significantly less space than the ones based on noncompressed data structures like the suffix tree and the enhanced suffix array. Our experimental results show that our parallel algorithm is efficient and scales well with increasing number of processors.
引用
收藏
页数:11
相关论文
共 14 条
  • [1] A fast algorithm for the all-pairs suffix-prefix problem
    Lim, Jihyuk
    Park, Kunsoo
    THEORETICAL COMPUTER SCIENCE, 2017, 698 : 14 - 24
  • [2] Parallel Computation for the All-Pairs Suffix-Prefix Problem
    Louza, Felipe A.
    Gog, Simon
    Zanotto, Leandro
    Araujo, Guido
    Telles, Guilherme P.
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2016, 2016, 9954 : 122 - 132
  • [3] An improved algorithm for the all-pairs suffix-prefix problem
    Tustumi, William H. A.
    Gog, Simon
    Telles, Guilherme P.
    Louza, Felipe A.
    JOURNAL OF DISCRETE ALGORITHMS, 2016, 37 : 34 - 43
  • [4] Efficient algorithms for the all-pairs suffix-prefix problem and the all-pairs substring-prefix problem
    Ohlebusch, Enno
    Gog, Simon
    INFORMATION PROCESSING LETTERS, 2010, 110 (03) : 123 - 128
  • [5] Latest Advances in Solving the All-Pairs Suffix Prefix Problem
    Rachid, Maan Haj
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3 (BIOINFORMATICS), 2019, : 174 - 181
  • [6] Approximate all-pairs suffix/prefix overlaps
    Valimaki, Niko
    Ladra, Susana
    Makinen, Veli
    INFORMATION AND COMPUTATION, 2012, 213 : 49 - 58
  • [7] Approximate All-Pairs Suffix/Prefix Overlaps
    Valimaki, Niko
    Ladra, Susana
    Makinen, Veli
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2010, 6129 : 76 - +
  • [8] AN EFFICIENT ALGORITHM FOR THE ALL PAIRS SUFFIX PREFIX PROBLEM
    GUSFIELD, D
    LANDAU, GM
    SCHIEBER, B
    INFORMATION PROCESSING LETTERS, 1992, 41 (04) : 181 - 185
  • [9] All-pairs suffix/prefix in optimal time using Aho-Corasick space
    Loukides, Grigorios
    Pissis, Solon P.
    INFORMATION PROCESSING LETTERS, 2022, 178
  • [10] Improved Filters for the Approximate Suffix-Prefix Overlap Problem
    Kucherov, Gregory
    Tsur, Dekel
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2014, 2014, 8799 : 139 - 148