A DICTIONARY RETRIEVAL ALGORITHM USING 2 TRIE STRUCTURES

被引:1
|
作者
MORIMOTO, K
IRIGUCHI, H
AOE, J
机构
[1] Faculty of Engineering, Tokushima University, Tokushima
关键词
NATURAL LANGUAGE PROCESSING; TRIE STRUCTURE; SEARCHING TECHNIQUE; DICTIONARY SEARCHING;
D O I
10.1002/scj.4690260209
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The trie has the feature that the retrieval can be executed with the character symbols composing the key as the unit, and a high-speed retrieval is realized independently of the total number of keys. Consequently, it used frequently in the search of the natural language dictionary and in other problems. A problem, however, is that the number of trie states increases with the enlargement of the key set, which necessitates a larger memory capacity. To remedy this point, DAWG (Directed Acyclic Word-Graph) is proposed, where the common suffix of the tries is compressed. Then, a new problem arises in that the record information cannot be determined uniquely for the key. For this problem, this paper introduces a new structure, where the number of states is reduced by merging the common suffixes of the tries, while determining uniquely the record information for the key. The algorithm for retrieval, insertion and deletion of the key is proposed for the structure. In the proposed method, the set of keys is represented using two tries. One of the tries memorizes the prefix of the minimum length that can discriminate uniquely between the key from other keys. The other trie stores the suffixes of the remaining keys in order to merge the common suffix. A simulation is executed for various sets of keys such as Chinese character called Kanji, alphabets and Japanese Katakana characters, and it is seen that the number of states is reduced by approximately 30 to 65 percent for the key set of 50,000 words, compared to the ordinary trie.
引用
收藏
页码:85 / 97
页数:13
相关论文
共 50 条
  • [1] An efficient retrieval algorithm for case structures using TRIE
    Koyama, M
    Morta, K
    Mizobuchi, S
    Aoe, J
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 4572 - 4577
  • [2] Fast retrieving algorithm of hierarchical relationships using trie structures
    Univ of Tokushima, Tokushima-Shi, Japan
    Inf Process Manage, 6 (761-773):
  • [3] A fast retrieving algorithm of hierarchical relationships using trie structures
    Koyama, M
    Morita, K
    Fuketa, M
    Aoe, JI
    INFORMATION PROCESSING & MANAGEMENT, 1998, 34 (06) : 761 - 773
  • [4] LZ trie and dictionary compression
    Ristov, S
    SOFTWARE-PRACTICE & EXPERIENCE, 2005, 35 (05): : 445 - 465
  • [5] A PARALLEL ALGORITHM FOR PHASE RETRIEVAL WITH DICTIONARY LEARNING
    Liu, Tianyi
    Tillmann, Andreas M.
    Yang, Yang
    Eldar, Yonina C.
    Pesavento, Marius
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5619 - 5623
  • [6] Sequence Learning using the Adaptive Suffix Trie Algorithm
    Gunasinghe, Upuli
    Alahakoon, Damminda
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [7] Scientific document retrieval using structure encoded string with trie indexing
    Dhar, Sourish
    Roy, Sudipta
    Paul, Arnab
    Information Services and Use, 2022, 42 (02): : 241 - 259
  • [8] A key search algorithm using the compact Patricia Trie
    Shishibori, M
    Ando, K
    Okada, M
    Aoe, J
    1997 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT PROCESSING SYSTEMS, VOLS 1 & 2, 1997, : 1581 - 1584
  • [9] AN EFFICIENT IMPLEMENTATION OF TRIE STRUCTURES
    AOE, JI
    MORIMOTO, K
    SATO, T
    SOFTWARE-PRACTICE & EXPERIENCE, 1992, 22 (09): : 695 - 721
  • [10] An Efficient IP Address Lookup Algorithm Using a Priority Trie
    Lim, Hyesook
    Mun, Ju Hyoung
    GLOBECOM 2006 - 2006 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2006,