Approximation algorithms for grammar-based compression

被引:0
|
作者
Lehman, E [1 ]
Shelat, A [1 ]
机构
[1] MIT, Comp Sci Lab, Cambridge, MA 02141 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Several recently-proposed data compression algorithms are based on the idea of representing a string by a context-free grammar. Most of these algorithms are known to be asymptotically optimal with respect to a stationary ergodic source and to achieve a low redundancy rate. However, such results do not reveal how effectively these algorithms exploit the grammar-model itself; that is, are the compressed strings produced as small as possible? We address this issue by analyzing the approximation ratio of several algorithms, that is, the maximum ratio between the size of the generated grammar and the smallest possible grammar over all inputs. On the negative side, we show that every polynomial-time grammar-compression algorithm has approximation ratio at least 8569/8568 unless P = NP. Moreover, achieving an approximation ratio of o(log n/ log log n) would require progress on an algebraic problem in a well-studied area. We then upper and lower bound approximation ratios for the following four previously-proposed gram-mar-based compression algorithms: SEQUENTIAL, BISECTION, GREEDY, and LZ78, each of which employs a distinct approach to compression. These results seem to indicate that there is much room to improve grammar-based compression algorithms.
引用
收藏
页码:205 / 212
页数:8
相关论文
共 50 条
  • [1] Approximation of Grammar-Based Compression via Recompression
    Jez, Artur
    [J]. COMBINATORIAL PATTERN MATCHING, 2013, 7922 : 165 - 176
  • [2] Approximation of grammar-based compression via recompression
    Jez, Artur
    [J]. THEORETICAL COMPUTER SCIENCE, 2015, 592 : 115 - 134
  • [3] A Space-Saving Approximation Algorithm for Grammar-Based Compression
    Sakamoto, Hiroshi
    Maruyama, Shirou
    Kida, Takuya
    Shimozono, Shinichi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (02): : 158 - 165
  • [4] Grammar-Based Tree Compression
    Lohrey, Markus
    [J]. DEVELOPMENTS IN LANGUAGE THEORY (DLT 2015), 2015, 9168 : 46 - 57
  • [5] Grammar-based graph compression
    Maneth, Sebastian
    Peternek, Fabian
    [J]. INFORMATION SYSTEMS, 2018, 76 : 19 - 45
  • [6] Application of Lempel-Ziv factorization to the approximation of grammar-based compression
    Rytter, W
    [J]. COMBINATORIAL PATTERN MATCHING, 2002, 2373 : 20 - 31
  • [7] Application of Lempel-Ziv factorization to the approximation of grammar-based compression
    Rytter, W
    [J]. THEORETICAL COMPUTER SCIENCE, 2003, 302 (1-3) : 211 - 222
  • [8] A fully linear-time approximation algorithm for grammar-based compression
    Sakamoto, Hiroshi
    [J]. JOURNAL OF DISCRETE ALGORITHMS, 2005, 3 (2-4) : 416 - 430
  • [9] A fully linear-time approximation algorithm for grammar-based compression
    Sakamoto, H
    [J]. COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2003, 2676 : 348 - 360
  • [10] Grammar-Based Compression of Unranked Trees
    Gascon, Adria
    Lohrey, Markus
    Maneth, Sebastian
    Reh, Carl Philipp
    Siebert, Kurt
    [J]. COMPUTER SCIENCE - THEORY AND APPLICATIONS, CSR 2018, 2018, 10846 : 118 - 131