A DICTIONARY RETRIEVAL ALGORITHM USING 2 TRIE STRUCTURES

被引:1
|
作者
MORIMOTO, K
IRIGUCHI, H
AOE, J
机构
[1] Faculty of Engineering, Tokushima University, Tokushima
关键词
NATURAL LANGUAGE PROCESSING; TRIE STRUCTURE; SEARCHING TECHNIQUE; DICTIONARY SEARCHING;
D O I
10.1002/scj.4690260209
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The trie has the feature that the retrieval can be executed with the character symbols composing the key as the unit, and a high-speed retrieval is realized independently of the total number of keys. Consequently, it used frequently in the search of the natural language dictionary and in other problems. A problem, however, is that the number of trie states increases with the enlargement of the key set, which necessitates a larger memory capacity. To remedy this point, DAWG (Directed Acyclic Word-Graph) is proposed, where the common suffix of the tries is compressed. Then, a new problem arises in that the record information cannot be determined uniquely for the key. For this problem, this paper introduces a new structure, where the number of states is reduced by merging the common suffixes of the tries, while determining uniquely the record information for the key. The algorithm for retrieval, insertion and deletion of the key is proposed for the structure. In the proposed method, the set of keys is represented using two tries. One of the tries memorizes the prefix of the minimum length that can discriminate uniquely between the key from other keys. The other trie stores the suffixes of the remaining keys in order to merge the common suffix. A simulation is executed for various sets of keys such as Chinese character called Kanji, alphabets and Japanese Katakana characters, and it is seen that the number of states is reduced by approximately 30 to 65 percent for the key set of 50,000 words, compared to the ordinary trie.
引用
收藏
页码:85 / 97
页数:13
相关论文
共 50 条
  • [41] A New TV Program Retrieval Method Using a Semantic Relations Dictionary
    Miyazaki, Taro
    Yamada, Ichiro
    Miura, Kikuka
    Miyazaki, Masaru
    Matsui, Atsushi
    Goto, Jun
    Sumiyoshi, Hideki
    2016 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2016,
  • [42] Trie-based data structures for sequence assembly
    Chen, T
    Skiena, SS
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 1997, 1264 : 206 - 223
  • [43] Using a trie-based approach for storage and retrieval of goal-oriented plans in an S1/S2 architecture
    Cossentino, Massimo
    Pilato, Giovanni
    COGNITIVE SYSTEMS RESEARCH, 2024, 87
  • [44] An Advanced Trie-Based HTTP Parsing Algorithm
    Li, Anqi
    He, Dazhong
    Wang, Huan
    2016 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2016, : 79 - 83
  • [45] ATMS- and AMSU-A-derived hurricane warm core structures using a modified retrieval algorithm
    Tian, Xiaoxu
    Zou, Xiaolei
    JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2016, 121 (21) : 12630 - 12646
  • [46] On a trie partitioning algorithm for power-efficient TCAMs
    Lu, Haibin
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2008, 21 (02) : 115 - 133
  • [47] Trie-based algorithm for IP lookup problem
    Yilmaz, PA
    Belenkiy, A
    Uzun, N
    GLOBECOM '00: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1- 3, 2000, : 593 - 598
  • [48] An enhanced dynamic hash TRIE algorithm for lexicon search
    Yang, Lai
    Xu, Lida
    Shi, Zhongzhi
    ENTERPRISE INFORMATION SYSTEMS, 2012, 6 (04) : 419 - 432
  • [49] KP-Trie Algorithm for Update and Search Operations
    Hanandeh, Feras
    Alsmadi, Izzat
    Akour, Mohammed
    Al Daoud, Essam
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (06) : 693 - 699
  • [50] Audio retrieval using perceptually based structures
    Melih, K
    Gonzalez, R
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS, 1998, : 338 - 347