Survey of Grammar-Based Data Structure Compression

被引:0
|
作者
Kieffer, John C. [1 ]
Yang, En-Hui [2 ]
机构
[1] University of Minnesota, Minneapolis,MN,55455, United States
[2] University of Waterloo, Waterloo,ON,N2L 3G1, Canada
来源
关键词
Compressors - Context free grammars - Data structures;
D O I
10.1109/MBITS.2022.3210891
中图分类号
学科分类号
摘要
A data string can be represented with the help of context-free grammar such that the string is the unique string belonging to the language of the grammar. One can then losslessly compress the string indirectly by encoding the grammar into a unique binary codeword. This approach to data compression, called grammar-based data compression, can also be employed to losslessly compress graphical data structures, which are graphs in which every vertex carries a data label. Under mild restrictions, grammar-based data compression schemes are universal compressors, meaning that they perform at least as well as any finite-state compression scheme. Some of the theory of universal grammar-based compressors is surveyed. Applications of grammar-based compressors to various areas, such as bioinformatics and data networks, are discussed. Future directions for grammar-based compression research are outlined, including compression issues arising in highly repetitive databases and issues concerning the compression of sparse graphical data. © 2021 IEEE.
引用
下载
收藏
页码:19 / 35
相关论文
共 50 条
  • [21] RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure
    Qi Liu
    Yu Yang
    Chun Chen
    Jiajun Bu
    Yin Zhang
    Xiuzi Ye
    BMC Bioinformatics, 9
  • [22] RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure
    Liu, Qi
    Yang, Yu
    Chen, Chun
    Bu, Jiajun
    Zhang, Yin
    Ye, Xiuzi
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [24] Scalable Detection of Frequent Substrings by Grammar-Based Compression
    Nakahara, Masaya
    Maruyama, Shirou
    Kuboyama, Tetsuji
    Sakamoto, Hiroshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (03): : 457 - 464
  • [25] Hypergraph Grammar-Based Model of Adaptive Bitmap Compression
    Solinski, Grzegorz
    Wozniak, Maciej
    Ryzner, Jakub
    Mosialek, Albert
    Paszynska, Anna
    COMPUTATIONAL SCIENCE - ICCS 2020, PT III, 2020, 12139 : 118 - 131
  • [26] Scalable Detection of Frequent Substrings by Grammar-Based Compression
    Nakahara, Masaya
    Maruyama, Shirou
    Kuboyama, Tetsuji
    Sakamoto, Hiroshi
    DISCOVERY SCIENCE, 2011, 6926 : 236 - +
  • [27] THE DESIGN AND IMPLEMENTATION OF A GRAMMAR-BASED DATA GENERATOR
    MAURER, PM
    SOFTWARE-PRACTICE & EXPERIENCE, 1992, 22 (03): : 223 - 244
  • [28] A Space-Saving Approximation Algorithm for Grammar-Based Compression
    Sakamoto, Hiroshi
    Maruyama, Shirou
    Kida, Takuya
    Shimozono, Shinichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (02): : 158 - 165
  • [29] A Universal Grammar-Based Code for Lossless Compression of Binary Trees
    Zhang, Jie L.
    Yang, En-Hui
    Kieffer, John C.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2014, 60 (03) : 1373 - 1386
  • [30] Application of grammar-based codes for lossless compression of digital mammograms
    Li, Xiaoli
    Krishnan, Sridhar
    Ma, Ngok-Wah
    JOURNAL OF ELECTRONIC IMAGING, 2006, 15 (01)