Computing Matching Statistics and Maximal Exact Matches on Compressed Full-Text Indexes

被引:0
|
作者
Ohlebusch, Enno [1 ]
Gog, Simon [1 ]
Kuegel, Adrian [1 ]
机构
[1] Univ Ulm, Inst Theoret Comp Sci, D-89069 Ulm, Germany
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Exact string matching is a problem that computer programmers face on a regular basis, and full-text indexes like the suffix tree or the suffix array provide fast string search over large texts. In the last decade, research on compressed indexes has flourished because the main problem in large-scale applications is the space consumption of the index. Nowadays, the most successful compressed indexes are able to obtain almost optimal space and search time simultaneously. It is known that a myriad of sequence analysis and comparison problems can be solved efficiently with established data structures like the suffix tree or the suffix array, but algorithms on compressed indexes that solve these problem are still lacking at present. Here, we show that matching statistics and maximal exact matches between two strings S-1 and S-2 can be computed efficiently by matching S-2 backwards against a compressed index of S-1.
引用
收藏
页码:347 / 358
页数:12
相关论文
共 24 条