Survey of Grammar-Based Data Structure Compression

被引:0
|
作者
Kieffer, John C. [1 ]
Yang, En-Hui [2 ]
机构
[1] University of Minnesota, Minneapolis,MN,55455, United States
[2] University of Waterloo, Waterloo,ON,N2L 3G1, Canada
来源
关键词
Compressors - Context free grammars - Data structures;
D O I
10.1109/MBITS.2022.3210891
中图分类号
学科分类号
摘要
A data string can be represented with the help of context-free grammar such that the string is the unique string belonging to the language of the grammar. One can then losslessly compress the string indirectly by encoding the grammar into a unique binary codeword. This approach to data compression, called grammar-based data compression, can also be employed to losslessly compress graphical data structures, which are graphs in which every vertex carries a data label. Under mild restrictions, grammar-based data compression schemes are universal compressors, meaning that they perform at least as well as any finite-state compression scheme. Some of the theory of universal grammar-based compressors is surveyed. Applications of grammar-based compressors to various areas, such as bioinformatics and data networks, are discussed. Future directions for grammar-based compression research are outlined, including compression issues arising in highly repetitive databases and issues concerning the compression of sparse graphical data. © 2021 IEEE.
引用
收藏
页码:19 / 35
相关论文
共 50 条
  • [31] Universal Tree Source Coding Using Grammar-Based Compression
    Hucke, Danny
    Lohrey, Markus
    2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 1753 - 1757
  • [32] Grammar-based compression and its use in symbolic music analysis
    Mondol, Tiasa
    Brown, Daniel G.
    JOURNAL OF MATHEMATICS AND MUSIC, 2023, 17 (01) : 133 - 150
  • [33] Universal Tree Source Coding Using Grammar-Based Compression
    Ganardi, Moses
    Hucke, Danny
    Lohrey, Markus
    Benkner, Louisa Seelbach
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (10) : 6399 - 6413
  • [34] A Grammar-Based Compression Using a Variation of Chomsky Normal Form of Context Free Grammar
    Arimura, Mitsuharu
    PROCEEDINGS OF 2016 INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY AND ITS APPLICATIONS (ISITA 2016), 2016, : 246 - 250
  • [35] A Grammar-based Entity Representation Framework for Data Cleaning
    Arasu, Arvind
    Kaushik, Raghav
    ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 233 - 244
  • [36] Grammar-based Fuzzing
    Sargsyan, Sevak
    Kurmangaleev, Shamil
    Mehrabyan, Matevos
    Mishechkin, Maksim
    Ghukasyan, Tsolak
    Asryan, Sergey
    2018 IVANNIKOV MEMORIAL WORKSHOP (IVMEM 2018), 2018, : 32 - 35
  • [37] Atlas: Grammar-based Procedural Generation of Data Visualizations
    Liu, Zhicheng
    Chen, Chen
    Morales, Francisco
    Zhao, Yishan
    2021 IEEE VISUALIZATION CONFERENCE - SHORT PAPERS (VIS 2021), 2021, : 171 - 175
  • [38] Application of Lempel-Ziv factorization to the approximation of grammar-based compression
    Rytter, W
    COMBINATORIAL PATTERN MATCHING, 2002, 2373 : 20 - 31
  • [39] GraCT: A Grammar-based Compressed Index for Trajectory Data
    Brisaboa, Nieves R.
    Gomez-Brandon, Adrian
    Navarro, Gonzalo
    Parama, Jose R.
    INFORMATION SCIENCES, 2019, 483 : 106 - 135
  • [40] Application of Lempel-Ziv factorization to the approximation of grammar-based compression
    Rytter, W
    THEORETICAL COMPUTER SCIENCE, 2003, 302 (1-3) : 211 - 222