Design of Fast Multiple String Searching Based on Improved Prefix Tree

被引:1
|
作者
Cheng, Yu [1 ]
Zhang, Tao [2 ]
机构
[1] Tsinghua Univ, Dept Biomed Engn, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
关键词
Multi-string matching; prefix tree; string pattern;
D O I
10.1109/WKDD.2010.138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-string matching is one of the most important components in data mining task. New applications in many technology fields require high performance string matching algorithms. This paper first presents a new string searching approach based on a data structure called prefix tree. The innovative algorithm eliminates the functional overlap of the table HASH and Prefix Function. Then we make a little improvement on the prefix tree and present a second algorithm that is faster and more space-saving. It is demonstrated analytically that the two algorithms inherit the optimality and are very competitive in practice. On tests of both real life and synthetic data, our algorithms are also efficient and especially effective for various string pattern and large alphabet sets.
引用
收藏
页码:111 / 114
页数:4
相关论文
共 50 条
  • [41] OFDM Synchronization Improved Algorithm Based on Cyclic Prefix
    Zhang, T. Y.
    Chen, G. F.
    Zhang, X. C.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL APPLICATIONS (CISIA 2015), 2015, 18 : 18 - 20
  • [42] String tension and thermodynamics with tree level and tadpole improved actions
    Fakultät für Physik, Universität Bielefeld, D-33615 Bielefeld, Germany
    Eur. Phys. J. C, 1 (133-140):
  • [43] String pattern searching algorithm based on characters indices
    Markic, Ivan
    Stula, Maja
    Zoric, Marija
    2019 4TH INTERNATIONAL CONFERENCE ON SMART AND SUSTAINABLE TECHNOLOGIES (SPLITECH), 2019, : 100 - 103
  • [44] String tension and thermodynamics with tree level and tadpole improved actions
    Beinlich, B
    Karsch, F
    Laermann, E
    Peikert, A
    EUROPEAN PHYSICAL JOURNAL C, 1999, 6 (01): : 133 - 140
  • [45] String tension and thermodynamics with tree level and tadpole improved actions
    Beinlich B.
    Karsch F.
    Laermann E.
    Peikert A.
    The European Physical Journal C - Particles and Fields, 1999, 6 (1): : 133 - 140
  • [46] A phonetic string searching algorithm based on syllable alignment
    Gong, RB
    Tony, CKY
    PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON INTERNET AND MULTIMEDIA SYSTEMS AND APPLICATIONS, 2004, : 108 - 113
  • [47] On the design of fast prefix-preserving IP address anonymization scheme
    Zhang, Qianli
    Wang, Jilong
    Li, Xing
    INFORMATION AND COMMUNICATIONS SECURITY, PROCEEDINGS, 2007, 4681 : 177 - 188
  • [48] Design and implementation of shortest travel path searching based on improved Dijkstra algorithm
    Mo, Taiping
    Zhao, Huihuang
    Mo, Wei
    MECHATRONICS AND APPLIED MECHANICS, PTS 1 AND 2, 2012, 157-158 : 390 - +
  • [49] IMPROVED STRING DESIGN CUTS ROD BREAKS
    WEST, PT
    WORLD OIL, 1973, 176 (06) : 64 - 65
  • [50] Fast multiple pattern algorithm for Chinese string matching
    Shen, Zhou
    Wang, Yong-Cheng
    Xu, Yi-Zhen
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2001, 35 (09): : 1285 - 1289