Online Grammar Transformation based on Re-Pair Algorithm

被引:7
|
作者
Masaki, Takuya [1 ]
Kida, Takuya [1 ]
机构
[1] Hokkaido Univ, Kita Ku, Kita 14 Jo,Nishi 9 Chome, Sapporo, Hokkaido 0600814, Japan
来源
2016 DATA COMPRESSION CONFERENCE (DCC) | 2016年
关键词
D O I
10.1109/DCC.2016.69
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Re-Pair algorithm (Re-Pair), proposed by Larsson and Moffat, is a simple grammar based compression method that achieves a good compression ratio. Although Re-Pair runs in O(n) time and space for an input of length n, it cannot be used with a large input, because it runs offline and consumes substantial memory space. In this paper, we propose an online grammar transformation algorithm based on a modified Re-Pair along with a compression method using the algorithm. The proposed algorithm runs in O(n log (h) over cap) time using O(g) space, where g and (h) over cap are the number of production rules in a grammar and the maximum height of syntax trees generated by the rules, respectively. We implemented our method and demonstrated that it significantly reduces memory usage with little sacrifice of compression ratio in comparison with the original Re-Pair.
引用
收藏
页码:349 / 358
页数:10
相关论文
共 50 条
  • [1] Improvements on Re-Pair grammar compressor
    Ganczorz, MichaI
    Jez, Artur
    2017 DATA COMPRESSION CONFERENCE (DCC), 2017, : 181 - 190
  • [2] Adaptive Dictionary Sharing Method for Re-Pair Algorithm
    Sekine, Kei
    Sasakawa, Hirohito
    Yoshida, Satoshi
    Kida, Takuya
    2014 DATA COMPRESSION CONFERENCE (DCC 2014), 2014, : 425 - 425
  • [3] Re-Pair in Small Space
    Koppl, Dominik
    Tomohiro, I
    Furuya, Isamu
    Takabatake, Yoshimasa
    Sakai, Kensuke
    Goto, Keisuke
    ALGORITHMS, 2021, 14 (01) : 1 - 20
  • [4] Re-Pair in Small Space
    Koppl, Dominik
    Tomohiro, I
    Furuya, Isamu
    Takabatake, Yoshimasa
    Sakai, Kensuke
    Goto, Keisuke
    2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 377 - 377
  • [5] soft matter Re-pair
    Whitehead, Simon
    PERFORMANCE RESEARCH, 2022, 27 (02) : 46 - 47
  • [6] Space-Efficient Re-Pair Compression
    Bille, Philip
    Gortz, Inge Li
    Prezza, Nicola
    2017 DATA COMPRESSION CONFERENCE (DCC), 2017, : 171 - 180
  • [7] Effective Variable-Length-to-Fixed-Length Coding via a Re-Pair Algorithm
    Yoshida, Satoshi
    Kida, Takuya
    2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 532 - 532
  • [8] Re-pair achieves high-order entropy
    Navarro, Gonzalo
    Russo, Luis
    DCC: 2008 DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2008, : 537 - 537
  • [9] Variable-to-Fixed-Length Encoding for Large Texts Using Re-Pair Algorithm with Shared Dictionaries
    Sekine, Kei
    Sasakawa, Hirohito
    Yoshida, Satoshi
    Kida, Takuya
    2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 518 - 518
  • [10] ADMINISTRATION OF OMNI BIOTIC STRESS RE-PAIR IN CHILDREN'S AGE
    Barzashka, Evgenia
    Valova, Teodora
    JOURNAL OF IMAB, 2022, 28 (03): : 4562 - 4565