On the complexity of optimal grammar-based compression

被引:0
|
作者
Arpe, Jan [1 ]
Reischuk, R. diger [1 ]
机构
[1] Univ Lubeck, Inst Theoret Informat, Ratzeburger Allee 160, D-23538 Lubeck, Germany
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a string, the task of grammar-based compression is to find a small context-free grammar that generates exactly that string. We investigate the relationship between grammar-based compression of strings over unbounded and bounded alphabets. Specifically, we show how to transform a grammar for a string over an unbounded alphabet into a grammar for a block coding of that string over a fixed bounded alphabet and vice versa. From these constructions, we obtain asymptotically tight relationships between the minimum grammar sizes for strings and their block codings. Furthermore, we exploit an improved bound of our construction for overlap-free block codings to show that a polynomial time algorithm for approximating the minimum grammar for binary strings within a factor of c yields a polynomial time algorithm for approximating the minimum grammar for strings over arbitrary alphabets within a factor of 24c + epsilon (for arbitrary epsilon > 0). Currently, the latter problem is known to be NP-hard to approximate within a factor of 8569/8568. Since there is some hope to prove a nonconstant lower bound, our results may provide a first step towards solving the long standing open question whether minimum grammar-based compression of binary strings is NP-complete.
引用
收藏
页码:173 / +
页数:2
相关论文
共 50 条
  • [21] Scalable Detection of Frequent Substrings by Grammar-Based Compression
    Nakahara, Masaya
    Maruyama, Shirou
    Kuboyama, Tetsuji
    Sakamoto, Hiroshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (03): : 457 - 464
  • [22] Hypergraph Grammar-Based Model of Adaptive Bitmap Compression
    Solinski, Grzegorz
    Wozniak, Maciej
    Ryzner, Jakub
    Mosialek, Albert
    Paszynska, Anna
    COMPUTATIONAL SCIENCE - ICCS 2020, PT III, 2020, 12139 : 118 - 131
  • [23] Scalable Detection of Frequent Substrings by Grammar-Based Compression
    Nakahara, Masaya
    Maruyama, Shirou
    Kuboyama, Tetsuji
    Sakamoto, Hiroshi
    DISCOVERY SCIENCE, 2011, 6926 : 236 - +
  • [24] A Space-Saving Approximation Algorithm for Grammar-Based Compression
    Sakamoto, Hiroshi
    Maruyama, Shirou
    Kida, Takuya
    Shimozono, Shinichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (02): : 158 - 165
  • [25] A Universal Grammar-Based Code for Lossless Compression of Binary Trees
    Zhang, Jie L.
    Yang, En-Hui
    Kieffer, John C.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2014, 60 (03) : 1373 - 1386
  • [26] Application of grammar-based codes for lossless compression of digital mammograms
    Li, Xiaoli
    Krishnan, Sridhar
    Ma, Ngok-Wah
    JOURNAL OF ELECTRONIC IMAGING, 2006, 15 (01)
  • [27] Universal Tree Source Coding Using Grammar-Based Compression
    Hucke, Danny
    Lohrey, Markus
    2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 1753 - 1757
  • [28] An effective grammar-based compression algorithm for tree structured data
    Yamagata, K
    Uchida, T
    Shoudai, T
    Nakamura, Y
    INDUCTIVE LOGIC PROGRAMMING, PROCEEDINGS, 2003, 2835 : 383 - 400
  • [29] Grammar-based compression and its use in symbolic music analysis
    Mondol, Tiasa
    Brown, Daniel G.
    JOURNAL OF MATHEMATICS AND MUSIC, 2023, 17 (01) : 133 - 150
  • [30] Universal Tree Source Coding Using Grammar-Based Compression
    Ganardi, Moses
    Hucke, Danny
    Lohrey, Markus
    Benkner, Louisa Seelbach
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (10) : 6399 - 6413