Revisiting dictionary-based compression

被引:26
|
作者
Skibinski, P
Grabowski, S
Deorowicz, S
机构
[1] Tech Univ Lodz, Dept Comp Engn, PL-90924 Lodz, Poland
[2] Univ Wroclaw, Inst Comp Sci, PL-51151 Wroclaw, Poland
[3] Silesian Tech Univ, Inst Comp Sci, PL-44100 Gliwice, Poland
来源
SOFTWARE-PRACTICE & EXPERIENCE | 2005年 / 35卷 / 15期
关键词
lossless data compression; preprocessing; text compression; dictionary compression;
D O I
10.1002/spe.678
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
An attractive way to increase text compression is to replace words with references to a text dictionary given in advance. Although there exist a few works in this area, they do not fully exploit the compression possibilities or consider alternative preprocessing variants for various compressors in the latter phase. In this paper, we discuss several aspects of dictionary-based compression, including compact dictionary representation, and present a PPM/BWCA-oriented scheme, word replacing transformation, achieving compression ratios higher by 2-6% than the state-of-the-art StarNT (2003) text preprocessor, working at a greater speed. We also present an alternative scheme designed for LZ77 compressors, with the advantage over StarNT of reaching up to 14% in combination with gzip. Copyright (c) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:1455 / 1476
页数:22
相关论文
共 50 条
  • [21] Template vertical dictionary-based program compression scheme on the TTA
    Lai, Mingche
    Wang, Zhiying
    Guo, JianJun
    Kui, Dai
    Li, Shen
    [J]. INTEGRATED CIRCUIT AND SYSTEM DESIGN: POWER AND TIMING MODELING, OPTIMIZATION AND SIMULATION, 2007, 4644 : 43 - +
  • [22] Two-Level Dictionary-Based Text Compression Scheme
    Zia, Md. Ziaul Karim
    Rahman, Dewan Md. Fayzur
    Rahman, Chowdhury Mofizur
    [J]. 2008 11TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY: ICCIT 2008, VOLS 1 AND 2, 2008, : 569 - 574
  • [23] ETAOSD: Static Dictionary-Based Transformation Method for Text Compression
    Baloul, Fadlelmoula Mohamed
    Abdullah, Mohsin Hassan
    Babikir, Elsadig Ahmed
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), 2013, : 384 - 389
  • [24] Dictionary-based English text compression using word endings
    Yang, Jeehong
    Savari, Serap A.
    [J]. DCC 2007: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2007, : 410 - 410
  • [25] A fast decoding algorithm for dictionary-based text compression system
    Wong, CH
    Cheng, LM
    Ng, KS
    [J]. INTERNATIONAL SOCIETY FOR COMPUTERS AND THEIR APPLICATIONS 11TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 1998, : 63 - 66
  • [26] A Self-Learning and Lossless Dictionary-Based Compression Algorithm
    Rose, J. Dafni
    Dhanushkkar, H.
    Jagadishan, M.
    [J]. 2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [27] Low Capture Power Dictionary-based Test Data Compression
    Sismanoglou, Panagiotis
    Nikolos, Dimitris
    [J]. PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN ISQED 2016, 2016, : 289 - 294
  • [28] Note on the greedy parsing optimality for dictionary-based text compression
    Crochemore, Maxime
    Langiu, Alessio
    Mignosi, Filippo
    [J]. Theoretical Computer Science, 2014, 525 : 55 - 59
  • [29] Note on the greedy parsing optimality for dictionary-based text compression
    [J]. Langiu, A. (Alessio.Langiu@kcl.ac.uk), 1600, Elsevier (525):
  • [30] Dictionary-based program compression on TTAs: Effects on area and power consumption
    Heikkinen, J
    Takala, J
    Corporaal, H
    [J]. 2005 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS - DESIGN AND IMPLEMENTATION (SIPS), 2005, : 479 - 484