Survey of Grammar-Based Data Structure Compression

被引:0
|
作者
Kieffer, John C. [1 ]
Yang, En-Hui [2 ]
机构
[1] University of Minnesota, Minneapolis,MN,55455, United States
[2] University of Waterloo, Waterloo,ON,N2L 3G1, Canada
来源
关键词
Compressors - Context free grammars - Data structures;
D O I
10.1109/MBITS.2022.3210891
中图分类号
学科分类号
摘要
A data string can be represented with the help of context-free grammar such that the string is the unique string belonging to the language of the grammar. One can then losslessly compress the string indirectly by encoding the grammar into a unique binary codeword. This approach to data compression, called grammar-based data compression, can also be employed to losslessly compress graphical data structures, which are graphs in which every vertex carries a data label. Under mild restrictions, grammar-based data compression schemes are universal compressors, meaning that they perform at least as well as any finite-state compression scheme. Some of the theory of universal grammar-based compressors is surveyed. Applications of grammar-based compressors to various areas, such as bioinformatics and data networks, are discussed. Future directions for grammar-based compression research are outlined, including compression issues arising in highly repetitive databases and issues concerning the compression of sparse graphical data. © 2021 IEEE.
引用
下载
收藏
页码:19 / 35
相关论文
共 50 条
  • [1] Grammar-Based Tree Compression
    Lohrey, Markus
    [J]. DEVELOPMENTS IN LANGUAGE THEORY (DLT 2015), 2015, 9168 : 46 - 57
  • [2] Grammar-based graph compression
    Maneth, Sebastian
    Peternek, Fabian
    [J]. INFORMATION SYSTEMS, 2018, 76 : 19 - 45
  • [3] Grammar-Based Compression of Unranked Trees
    Gascon, Adria
    Lohrey, Markus
    Maneth, Sebastian
    Reh, Carl Philipp
    Siebert, Kurt
    [J]. COMPUTER SCIENCE - THEORY AND APPLICATIONS, CSR 2018, 2018, 10846 : 118 - 131
  • [4] An effective grammar-based compression algorithm for tree structured data
    Yamagata, K
    Uchida, T
    Shoudai, T
    Nakamura, Y
    [J]. INDUCTIVE LOGIC PROGRAMMING, PROCEEDINGS, 2003, 2835 : 383 - 400
  • [5] Grammar-Based Compression of Unranked Trees
    Gascon, Adria
    Lohrey, Markus
    Maneth, Sebastian
    Reh, Carl Philipp
    Sieber, Kurt
    [J]. THEORY OF COMPUTING SYSTEMS, 2020, 64 (01) : 141 - 176
  • [6] Grammar-Based Compression of Unranked Trees
    Adrià Gascón
    Markus Lohrey
    Sebastian Maneth
    Carl Philipp Reh
    Kurt Sieber
    [J]. Theory of Computing Systems, 2020, 64 : 141 - 176
  • [7] Grammar-based compression of interpreted code
    Evans, WS
    Fraser, CW
    [J]. COMMUNICATIONS OF THE ACM, 2003, 46 (08) : 61 - 66
  • [8] On the complexity of optimal grammar-based compression
    Arpe, Jan
    Reischuk, R. diger
    [J]. DCC 2006: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2006, : 173 - +
  • [9] Grammar-Based Compression in a Streaming Model
    Gagie, Travis
    Gawrychowski, Pawel
    [J]. LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS, 2010, 6031 : 273 - +
  • [10] Approximation algorithms for grammar-based compression
    Lehman, E
    Shelat, A
    [J]. PROCEEDINGS OF THE THIRTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2002, : 205 - 212