Bit Catastrophes for the Burrows-Wheeler Transform

被引:2
|
作者
Giuliani, Sara [1 ]
Inenaga, Shunsuke [2 ]
Liptak, Zsuzsanna [1 ]
Romana, Giuseppe [3 ]
Sciortino, Marinella [3 ]
Urbina, Cristian [4 ]
机构
[1] Univ Verona, Verona, Italy
[2] Kyushu Univ, Fukuoka, Japan
[3] Univ Palermo, Palermo, Italy
[4] Univ Chile, Santiago, Chile
来源
关键词
Burrows-Wheeler transform; Equal-letter run; Repetitiveness measure; Sensitivity;
D O I
10.1007/978-3-031-33264-7_8
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A bit catastrophe, loosely defined, is when a change in just one character of a string causes a significant change in the size of the compressed string. We study this phenomenon for the Burrows-Wheeler Transform (BWT), a string transform at the heart of several of the most popular compressors and aligners today. The parameter determining the size of the compressed data is the number of equal-letter runs of the BWT, commonly denoted r. We exhibit infinite families of strings in which insertion, deletion, resp. substitution of one character increases r from constant to Theta(log n), where n is the length of the string. These strings can be interpreted both as examples for an increase by a multiplicative or an additive Theta(log n)-factor. As regards multiplicative factor, they attain the upper bound given by Akagi, Funakoshi, and Inenaga [Inf & Comput. 2023] of O( log n log r), since here r = O(1). We then give examples of strings in which insertion, deletion, resp. substitution of a character increases r by a Theta(root n) additive factor. These strings significantly improve the best known lower bound for an additive factor of Omega(log n) [Giuliani et al., SOFSEM 2021].
引用
收藏
页码:86 / 99
页数:14
相关论文
共 50 条
  • [21] Burrows-Wheeler Transform Acceleration based on CUDA
    Sheng, Chang
    Dai, Fengzhi
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB 2021), 2021, : P68 - P68
  • [22] Burrows-Wheeler Transform of Words Defined by Morphisms
    Brlek, Srecko
    Frosini, Andrea
    Mancini, Ilaria
    Pergola, Elisa
    Rinaldi, Simone
    COMBINATORIAL ALGORITHMS, IWOCA 2019, 2019, 11638 : 393 - 404
  • [23] Balancing and clustering of words in the Burrows-Wheeler transform
    Restivo, Antonio
    Rosone, Giovanna
    THEORETICAL COMPUTER SCIENCE, 2011, 412 (27) : 3019 - 3032
  • [24] Burrows-Wheeler transform acceleration based on CUDA
    Sheng, Chang
    Dai, Fengzhi
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB 2021), 2021, : 596 - 599
  • [25] Multi-allelic Positional Burrows-Wheeler Transform
    Naseri, Ardalan
    Zhi, Degui
    Zhang, Shaojie
    2017 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2017,
  • [26] DNA sequence compression using the Burrows-Wheeler Transform
    Adjeroh, D
    Zhang, Y
    Mukherjee, A
    Powell, M
    Bell, T
    CSB2002: IEEE COMPUTER SOCIETY BIOINFORMATICS CONFERENCE, 2002, : 303 - 313
  • [27] Metagenomic analysis through the extended Burrows-Wheeler transform
    Veronica Guerrini
    Felipe A. Louza
    Giovanna Rosone
    BMC Bioinformatics, 21
  • [28] A survey of string orderings and their application to the Burrows-Wheeler transform
    Daykin, Jacqueline W.
    Groult, Richard
    Guesnet, Yannick
    Lecroq, Thierry
    Lefebvre, Arnaud
    Leonard, Martine
    Prieur-Gaston, Elise
    THEORETICAL COMPUTER SCIENCE, 2018, 710 : 52 - 65
  • [29] Searching for unique DNA sequences with the Burrows-Wheeler Transform
    Pokrzywa, Rafal
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2008, 28 (01) : 95 - 104
  • [30] Full-Tex searching & the Burrows-Wheeler transform
    Willets, K
    DR DOBBS JOURNAL, 2003, 28 (12): : 48 - +