Bit Catastrophes for the Burrows-Wheeler Transform

被引:2
|
作者
Giuliani, Sara [1 ]
Inenaga, Shunsuke [2 ]
Liptak, Zsuzsanna [1 ]
Romana, Giuseppe [3 ]
Sciortino, Marinella [3 ]
Urbina, Cristian [4 ]
机构
[1] Univ Verona, Verona, Italy
[2] Kyushu Univ, Fukuoka, Japan
[3] Univ Palermo, Palermo, Italy
[4] Univ Chile, Santiago, Chile
来源
关键词
Burrows-Wheeler transform; Equal-letter run; Repetitiveness measure; Sensitivity;
D O I
10.1007/978-3-031-33264-7_8
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A bit catastrophe, loosely defined, is when a change in just one character of a string causes a significant change in the size of the compressed string. We study this phenomenon for the Burrows-Wheeler Transform (BWT), a string transform at the heart of several of the most popular compressors and aligners today. The parameter determining the size of the compressed data is the number of equal-letter runs of the BWT, commonly denoted r. We exhibit infinite families of strings in which insertion, deletion, resp. substitution of one character increases r from constant to Theta(log n), where n is the length of the string. These strings can be interpreted both as examples for an increase by a multiplicative or an additive Theta(log n)-factor. As regards multiplicative factor, they attain the upper bound given by Akagi, Funakoshi, and Inenaga [Inf & Comput. 2023] of O( log n log r), since here r = O(1). We then give examples of strings in which insertion, deletion, resp. substitution of a character increases r by a Theta(root n) additive factor. These strings significantly improve the best known lower bound for an additive factor of Omega(log n) [Giuliani et al., SOFSEM 2021].
引用
收藏
页码:86 / 99
页数:14
相关论文
共 50 条
  • [41] Balanced Words Having Simple Burrows-Wheeler Transform
    Restivo, Antonio
    Rosone, Giovanna
    DEVELOPMENTS IN LANGUAGE THEORY, PROCEEDINGS, 2009, 5583 : 431 - 442
  • [42] Constructing and indexing the bijective and extended Burrows-Wheeler transform
    Bannai, Hideo
    Karkkainen, Juha
    Koppl, Dominik
    Piatkowski, Marcin
    INFORMATION AND COMPUTATION, 2024, 297
  • [43] Multi-allelic positional Burrows-Wheeler transform
    Ardalan Naseri
    Degui Zhi
    Shaojie Zhang
    BMC Bioinformatics, 20
  • [44] On Lossless Image Compression using the Burrows-Wheeler Transform
    Adjeroh, Don
    Bhupathiraju, Kalyan V.
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [45] Parallel and Memory-efficient Burrows-Wheeler Transform
    Hayashi, Shinya
    Taura, Kenjiro
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [46] Efficient Parameterized Matching Using Burrows-Wheeler Transform
    Goel, Anjali
    Prasad, Rajesh
    Agarwal, Suneeta
    Sangal, Amit
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (01) : 44 - 49
  • [47] Space-Efficient Computation of the Burrows-Wheeler Transform
    Fuentes-Sepulveda, Jose
    Navarro, Gonzalo
    Nekrich, Yakov
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 132 - 141
  • [48] Approximate pattern matching using the Burrows-Wheeler Transform
    Nan, Z
    Mukherjee, A
    Adjeroh, D
    Bell, T
    DCC 2003: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2003, : 458 - 458
  • [50] A Graph Extension of the Positional Burrows-Wheeler Transform and Its Applications
    Novak, Adam M.
    Garrison, Erik
    Paten, Benedict
    ALGORITHMS IN BIOINFORMATICS, 2016, 9838 : 246 - 256