Fast String Matching with Space-efficient Word Graphs

被引:0
|
作者
Yata, Susumu [1 ]
Morita, Kazuhiro [1 ]
Fuketa, Masao [1 ]
Aoe, Jun-ichi [1 ]
机构
[1] Univ Tokushima, Inst Sci & Technol, Tokushima, Japan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
String matching is one of the fundamentals in various text-processing applications such as text mining and content filtering systems. This paper describes a fast string matching algorithm using a compact pattern matching machine DAWG. A directed acyclic word graph (DAWG) is traditionally implemented with a 2-dimensional linked list or matrix. However, DAWGs with these structures have drawbacks, the lookup time or the linked list based one is slow and the space requirement of the matrix based one is large. Therefore, this paper proposes a novel DAWG based on a compacted double-array, which overcomes the drawbacks of traditional ones. Experimental results show that the novel DAWG is more efficient than traditional ones.
引用
收藏
页码:484 / 488
页数:5
相关论文
共 50 条
  • [41] Binarized Embeddings for Fast, Space-Efficient Knowledge Graph Completion
    Hayashi, Katsuhiko
    Kishimoto, Koki
    Shimbo, Masashi
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 141 - 153
  • [42] Space-efficient and fast algorithms for multidimensional dominance reporting and counting
    JaJa, J
    Mortensen, CW
    Shi, QM
    [J]. ALGORITHMS AND COMPUTATION, 2004, 3341 : 558 - 568
  • [43] Cgaln: fast and space-efficient whole-genome alignment
    Nakato, Ryuichiro
    Gotoh, Osamu
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [44] Cgaln: fast and space-efficient whole-genome alignment
    Ryuichiro Nakato
    Osamu Gotoh
    [J]. BMC Bioinformatics, 11
  • [45] A fast, space-efficient algorithm for the approximation of images by an optimal sum of Gaussians
    Childs, J
    Lu, CC
    Potter, J
    [J]. GRAPHICS INTERFACE 2000, PROCEEDINGS, 2000, : 153 - 162
  • [46] RHJoin: A Fast and Space-efficient Join Method for Log Processing in MapReduce
    Tang, Dixin
    Liu, Taoying
    Liu, Hong
    Li, Wei
    [J]. 2014 20TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2014, : 975 - 980
  • [47] Qualizon Graphs: Space-Efficient Time-Series Visualization with Qualitative Abstractions
    Federico, Paolo
    Hoffmann, Stephan
    Rind, Alexander
    Aigner, Wolfgang
    Miksch, Silvia
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES, AVI 2014, 2014, : 273 - 280
  • [48] Interactive and space-efficient multi-dimensional time series subsequence matching
    Piatov, Danila
    Helmer, Sven
    Dignos, Anton
    Gamper, Johann
    [J]. INFORMATION SYSTEMS, 2019, 82 : 121 - 135
  • [49] A framework for designing space-efficient dictionaries for parameterized and order-preserving matching
    Ganguly, Arnab
    Hon, Wing-Kai
    Sadakane, Kunihiko
    Shah, Rahul
    Thankachan, Sharma V.
    Yang, Yilin
    [J]. THEORETICAL COMPUTER SCIENCE, 2021, 854 : 52 - 62
  • [50] A framework for designing space-efficient dictionaries for parameterized and order-preserving matching
    Ganguly, Arnab
    Hon, Wing-Kai
    Sadakane, Kunihiko
    Shah, Rahul
    Thankachan, Sharma V.
    Yang, Yilin
    [J]. Theoretical Computer Science, 2021, 854 : 52 - 62