A novel look-ahead optimization strategy for trie-based approximate string matching

被引:0
|
作者
Ghada Badr
B. John Oommen
机构
[1] Carleton University,School of Computer Science
来源
关键词
Trie-based syntactic pattern recognition; Approximate string matching; Noisy syntactic recognition using tries; Branch and bound techniques; Pruning;
D O I
暂无
中图分类号
学科分类号
摘要
This paper deals with the problem of estimating a transmitted string X* by processing the corresponding string Y, which is a noisy version of X*. We assume that Y contains substitution, insertion, and deletion errors, and that X* is an element of a finite (but possibly, large) dictionary, H. The best estimate X+ of X*, is defined as that element of H which minimizes the generalized Levenshtein distance D(X, Y) between X and Y such that the total number of errors is not more than K, for all X ∈H. The trie is a data structure that offers search costs that are independent of the document size. Tries also combine prefixes together, and so by using tries in approximate string matching we can utilize the information obtained in the process of evaluating any one D(Xi, Y), to compute any other D(Xj, Y), where Xi and Xj share a common prefix. In the artificial intelligence (AI) domain, branch and bound (BB) schemes are used when we want to prune paths that have costs above a certain threshold. These techniques have been applied to prune, for example, game trees. In this paper, we present a new BB pruning strategy that can be applied to dictionary-based approximate string matching when the dictionary is stored as a trie. The new strategy attempts to look ahead at each node, c, before moving further, by merely evaluating a certain local criterion at c. The search algorithm according to this pruning strategy will not traverse inside the subtrie(c) unless there is a “hope” of determining a suitable string in it. In other words, as opposed to the reported trie-based methods (Kashyap and Oommen in Inf Sci 23(2):123–142, 1981; Shang and Merrettal in IEEE Trans Knowledge Data Eng 8(4):540–547, 1996), the pruning is done a priori before even embarking on the edit distance computations. The new strategy depends highly on the variance of the lengths of the strings in H. It combines the advantages of partitioning the dictionary according to the string lengths, and the advantages gleaned by representing H using the trie data structure. The results demonstrate a marked improvement (up to 30% when costs are of a 0/1 form, and up to 47% when costs are general) with respect to the number of operations needed on three benchmark dictionaries.
引用
收藏
页码:177 / 187
页数:10
相关论文
共 50 条
  • [41] Reliability improvement of wind power frequency modulation based on look-ahead control strategy and stage of charge optimization of energy storage
    Chen Changqing
    Li Xinran
    Liu Xiaolong
    Yang Yang
    INTERNATIONAL JOURNAL OF ENERGY RESEARCH, 2022, 46 (04) : 4739 - 4753
  • [42] A look-ahead strategy-based method for scheduling multiprocessor tasks on two dedicated processors
    Aider, Meziane
    Baatout, Fatma Zohra
    Hifi, Mhand
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 158
  • [43] Discrete Planning Unit Look-Ahead Velocity Control Strategy and Parallelization Research Based on GPU
    Cao, Yunan
    Ye, Peiqing
    Zhang, Qinjian
    Li, Jianyong
    ADVANCES IN MECHANICAL ENGINEERING, 2014,
  • [44] A NOVEL CALCULATION MODEL OF APPROXIMATE STRING MATCHING BASED ON CHARACTERISTIC PARAMETERS
    Ding, Guoyao
    Li, Tianrui
    Zou, Weili
    INTELLIGENT DECISION MAKING SYSTEMS, VOL. 2, 2010, : 556 - 561
  • [45] QM-DLA: an efficient qubit mapping method based on dynamic look-ahead strategy
    Liu, Hui
    Zhang, Bingjie
    Zhu, Yu
    Yang, Hanxiao
    Zhao, Bo
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [46] Dynamic Power Optimization Using Look-Ahead Clock Gating Technique
    Madhushree
    Rajan, Niju
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 261 - 264
  • [47] A Robust Wind Power Optimization Method for Look-Ahead Power Dispatch
    Wu, Wenchuan
    Chen, Jianhua
    Zhang, Boming
    Sun, Hongbin
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2014, 5 (02) : 507 - 515
  • [48] LaANIL: ANIL with Look-Ahead Meta-Optimization and Data Parallelism
    Tammisetti, Vasu
    Bierzynski, Kay
    Stettinger, Georg
    Morales-Santos, Diego P.
    Cuellar, Manuel Pegalajar
    Molina-Solana, Miguel
    ELECTRONICS, 2024, 13 (08)
  • [49] Look-ahead distance optimization method for pure pursuit control based on time delay compensation
    Tang X.
    Cheng Y.
    Pan S.
    Bao Y.
    Huang L.
    Wei B.
    Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2023, 31 (09): : 876 - 882
  • [50] Look-ahead proposals for robust grid-based SLAM
    Grzonka, Slawomir
    Plagemann, Christian
    Grisetti, Giorgio
    Burgard, Wolfram
    FIELD AND SERVICE ROBOTICS: RESULTS OF THE 6TH INTERNATIONAL CONFERENCE, 2008, 42 : 329 - 338