Approximation algorithms for grammar-based compression

被引:0
|
作者
Lehman, E [1 ]
Shelat, A [1 ]
机构
[1] MIT, Comp Sci Lab, Cambridge, MA 02141 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Several recently-proposed data compression algorithms are based on the idea of representing a string by a context-free grammar. Most of these algorithms are known to be asymptotically optimal with respect to a stationary ergodic source and to achieve a low redundancy rate. However, such results do not reveal how effectively these algorithms exploit the grammar-model itself; that is, are the compressed strings produced as small as possible? We address this issue by analyzing the approximation ratio of several algorithms, that is, the maximum ratio between the size of the generated grammar and the smallest possible grammar over all inputs. On the negative side, we show that every polynomial-time grammar-compression algorithm has approximation ratio at least 8569/8568 unless P = NP. Moreover, achieving an approximation ratio of o(log n/ log log n) would require progress on an algebraic problem in a well-studied area. We then upper and lower bound approximation ratios for the following four previously-proposed gram-mar-based compression algorithms: SEQUENTIAL, BISECTION, GREEDY, and LZ78, each of which employs a distinct approach to compression. These results seem to indicate that there is much room to improve grammar-based compression algorithms.
引用
收藏
页码:205 / 212
页数:8
相关论文
共 50 条
  • [21] A Quadsection Algorithm for Grammar-Based Image Compression
    Hayashida, Morihiro
    Ruan, Peiying
    Akutsu, Tatsuya
    [J]. FUTURE GENERATION INFORMATION TECHNOLOGY, 2010, 6485 : 234 - 248
  • [23] Scalable Detection of Frequent Substrings by Grammar-Based Compression
    Nakahara, Masaya
    Maruyama, Shirou
    Kuboyama, Tetsuji
    Sakamoto, Hiroshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (03): : 457 - 464
  • [24] Hypergraph Grammar-Based Model of Adaptive Bitmap Compression
    Solinski, Grzegorz
    Wozniak, Maciej
    Ryzner, Jakub
    Mosialek, Albert
    Paszynska, Anna
    [J]. COMPUTATIONAL SCIENCE - ICCS 2020, PT III, 2020, 12139 : 118 - 131
  • [25] Scalable Detection of Frequent Substrings by Grammar-Based Compression
    Nakahara, Masaya
    Maruyama, Shirou
    Kuboyama, Tetsuji
    Sakamoto, Hiroshi
    [J]. DISCOVERY SCIENCE, 2011, 6926 : 236 - +
  • [26] A Universal Grammar-Based Code for Lossless Compression of Binary Trees
    Zhang, Jie L.
    Yang, En-Hui
    Kieffer, John C.
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2014, 60 (03) : 1373 - 1386
  • [27] Application of grammar-based codes for lossless compression of digital mammograms
    Li, Xiaoli
    Krishnan, Sridhar
    Ma, Ngok-Wah
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2006, 15 (01)
  • [28] Constrained Level Generation Through Grammar-Based Evolutionary Algorithms
    Font, Jose M.
    Izquierdo, Roberto
    Manrique, Daniel
    Togelius, Julian
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2016, PT I, 2016, 9597 : 558 - 573
  • [29] Universal Tree Source Coding Using Grammar-Based Compression
    Hucke, Danny
    Lohrey, Markus
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 1753 - 1757
  • [30] An effective grammar-based compression algorithm for tree structured data
    Yamagata, K
    Uchida, T
    Shoudai, T
    Nakamura, Y
    [J]. INDUCTIVE LOGIC PROGRAMMING, PROCEEDINGS, 2003, 2835 : 383 - 400