Trie Compact Representation using Double-array Structures with String Labels

被引:0
|
作者
Kanda, Shunsuke [1 ]
Fuketa, Masao [1 ]
Morita, Kazuhiro [1 ]
Aoe, Jun-ichi [1 ]
机构
[1] Univ Tokushima, Dept Informat Sci & Intelligent Syst, Tokushima, Japan
关键词
Trie; Double-array; String Label; Information retrieval; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A trie is an ordered tree structure with a character on each edge. The trie provides efficient storing and retrieval of a keyword set in processing strings. The double-array has been proposed to provide fast retrieval in the trie. As its space usage depends on the number of trie nodes, the space usage decreases by reducing nodes. To reduce the number of trie nodes, an Minimal-Prefix (MP) trie and a double-trie have been proposed, and the double-array can represent these data structures efficiently. On the other hand, the data structures include many nodes that can be reduced by giving a string label to each edge. However, the double-array with string labels has not been proposed. This paper proposes a new double-array with string labels by using multiple arrays depending on label sizes. Moreover, we show its effectiveness by experiments.
引用
收藏
页码:3 / 8
页数:6
相关论文
共 29 条
  • [1] A Fast and Compact Language Model Implementation Using Double-Array Structures
    Norimatsu, Jun-Ya
    Yasuhara, Makoto
    Tanaka, Toru
    Yamamoto, Mikio
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 15 (04)
  • [2] Research of Chinese word segmentation based on Double-Array Trie
    School of Computer and Communication, Hunan Univ., Changsha 410082, China
    [J]. Hunan Daxue Xuebao, 2009, 5 (77-80):
  • [3] Study for the Double-array Trie Tree Based Algorithm in Word Segmentation
    Yang, Wenchuan
    Fang, Zeyang
    Li, Pengfei
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENVIRONMENTAL ENGINEERING (CSEE 2015), 2015, : 440 - 446
  • [4] A compression method of double-array structures using linear functions
    Kanda, Shunsuke
    Fuketa, Masao
    Morita, Kazuhiro
    Aoe, Jun-ichi
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 48 (01) : 55 - 80
  • [5] A compression method of double-array structures using linear functions
    Shunsuke Kanda
    Masao Fuketa
    Kazuhiro Morita
    Jun-ichi Aoe
    [J]. Knowledge and Information Systems, 2016, 48 : 55 - 80
  • [6] A compact static double-array keeping character codes
    Yata, Susumu
    Oono, Masaki
    Morita, Kazuhiro
    Fuketa, Masao
    Sumitomo, Toru
    Aoe, Jun-ichi
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (01) : 237 - 247
  • [7] Fast and compact updating algorithms of a double-array structure
    Morita, K
    Atlam, ES
    Fuketa, M
    Tsuda, K
    Aoe, J
    [J]. INFORMATION SCIENCES, 2004, 159 (1-2) : 53 - 67
  • [8] Research of an Improved Algorithm for Chinese Word Segmentation Dictionary Based on Double-Array Trie Tree
    Yang, Wenchuan
    Liu, Jian
    Yu, Miao
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 355 - 362
  • [9] Compressed double-array tries for string dictionaries supporting fast lookup
    Shunsuke Kanda
    Kazuhiro Morita
    Masao Fuketa
    [J]. Knowledge and Information Systems, 2017, 51 : 1023 - 1042
  • [10] Compressed double-array tries for string dictionaries supporting fast lookup
    Kanda, Shunsuke
    Morita, Kazuhiro
    Fuketa, Masao
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) : 1023 - 1042