An analysis of the Burrows-Wheeler Transform

被引:208
|
作者
Manzini, G [1 ]
机构
[1] Univ Piemonte Orientale, Dipartimento Sci & Tecnol Avanzate, I-15100 Alessandria, Italy
关键词
algorithms; performance; block sorting; Burrows-Wheeler Transform; move-to-front encoding; worst-case analysis of compression;
D O I
10.1145/382780.382782
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Burrows-Wheeler Transform (also known as Block-Sorting) is at the base of compression algorithms that are the state of the art in lossless data compression. In this paper, we analyze two algorithms that use this technique. The first one is the original algorithm described by Burrows and Wheeler, which, despite its simplicity. outperforms the Gzip compressor. The second one uses an additional run-length encoding step to improve compression. We prove that the compression ratio of both algorithms can be bounded in terms of the kth order empirical entropy of the input string for any k greater than or equal to 0. We make no assumptions on the input and we obtain bounds which hold in the worst case, that is, for every possible input string. All previous results for Block-Sorting algorithms were concerned with the average compression ratio and have been established assuming that the input comes from a finite-order Markov source.
引用
收藏
页码:407 / 430
页数:24
相关论文
共 50 条
  • [41] Balanced Words Having Simple Burrows-Wheeler Transform
    Restivo, Antonio
    Rosone, Giovanna
    DEVELOPMENTS IN LANGUAGE THEORY, PROCEEDINGS, 2009, 5583 : 431 - 442
  • [42] Constructing and indexing the bijective and extended Burrows-Wheeler transform
    Bannai, Hideo
    Karkkainen, Juha
    Koppl, Dominik
    Piatkowski, Marcin
    INFORMATION AND COMPUTATION, 2024, 297
  • [43] Multi-allelic positional Burrows-Wheeler transform
    Ardalan Naseri
    Degui Zhi
    Shaojie Zhang
    BMC Bioinformatics, 20
  • [44] On Lossless Image Compression using the Burrows-Wheeler Transform
    Adjeroh, Don
    Bhupathiraju, Kalyan V.
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [45] Parallel and Memory-efficient Burrows-Wheeler Transform
    Hayashi, Shinya
    Taura, Kenjiro
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [46] Efficient Parameterized Matching Using Burrows-Wheeler Transform
    Goel, Anjali
    Prasad, Rajesh
    Agarwal, Suneeta
    Sangal, Amit
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (01) : 44 - 49
  • [47] Space-Efficient Computation of the Burrows-Wheeler Transform
    Fuentes-Sepulveda, Jose
    Navarro, Gonzalo
    Nekrich, Yakov
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 132 - 141
  • [48] Approximate pattern matching using the Burrows-Wheeler Transform
    Nan, Z
    Mukherjee, A
    Adjeroh, D
    Bell, T
    DCC 2003: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2003, : 458 - 458
  • [50] A Graph Extension of the Positional Burrows-Wheeler Transform and Its Applications
    Novak, Adam M.
    Garrison, Erik
    Paten, Benedict
    ALGORITHMS IN BIOINFORMATICS, 2016, 9838 : 246 - 256