DNA sequence compression using the Burrows-Wheeler Transform

被引:28
|
作者
Adjeroh, D [1 ]
Zhang, Y [1 ]
Mukherjee, A [1 ]
Powell, M [1 ]
Bell, T [1 ]
机构
[1] W Virginia Univ, Lane Dept Comp Sci & Elect Engn, Morgantown, WV 26506 USA
关键词
DNA sequence compression; repetition structures; Burrows-Wheeler Transform; BWT;
D O I
10.1109/CSB.2002.1039352
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We investigate. off-line dictionary oriented approaches to DNA sequence compression, based on the Burrows-Wheeler Transform (BWT). The preponderance of short repeating patterns is an important phenomenon in biological sequences. Here, we propose off-line methods to compress DNA sequences that exploit the different repetition structures inherent in such sequences. Repetition analysis is performed based on the relationship between the BWT and important pattern matching data structures, such as the suffix tree and suffix array. We discuss how the proposed approach can be incorporated in the BWT compression pipeline.
引用
下载
收藏
页码:303 / 313
页数:11
相关论文
共 50 条
  • [1] QLFC - a compression algorithm using the Burrows-Wheeler transform
    Ghido, F
    DCC 2005: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2005, : 459 - 459
  • [2] On Lossless Image Compression using the Burrows-Wheeler Transform
    Adjeroh, Don
    Bhupathiraju, Kalyan V.
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [3] Parallel lossless data compression using the Burrows-Wheeler Transform
    Gilchrist, Jeff
    Cuhadar, Aysegul
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2008, 4 (01) : 117 - 135
  • [4] Parallel architecture for DNA sequence inexact matching with Burrows-Wheeler Transform
    Xin, Yao
    Liu, Benben
    Min, Biao
    Li, Will X. Y.
    Cheung, Ray C. C.
    Fong, Anthony S.
    Chan, Ting Fung
    MICROELECTRONICS JOURNAL, 2013, 44 (08) : 670 - 682
  • [5] Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform
    Cox, Anthony J.
    Bauer, Markus J.
    Jakobi, Tobias
    Rosone, Giovanna
    BIOINFORMATICS, 2012, 28 (11) : 1415 - 1419
  • [6] Improving text compression ratios with the Burrows-Wheeler Transform
    Kruse, H
    Mukherjee, A
    DCC '99 - DATA COMPRESSION CONFERENCE, PROCEEDINGS, 1999, : 536 - 536
  • [7] An analysis of the Burrows-Wheeler Transform
    Manzini, G
    JOURNAL OF THE ACM, 2001, 48 (03) : 407 - 430
  • [8] Burrows-Wheeler transform for terabases
    Siren, Jouni
    2016 DATA COMPRESSION CONFERENCE (DCC), 2016, : 211 - 220
  • [9] Word-based text compression using the Burrows-Wheeler transform
    Moffat, A
    Isal, RYK
    INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (05) : 1175 - 1192
  • [10] Dynamic Burrows-Wheeler Transform
    Salson, Mikael
    Lecroq, Thierry
    Leonard, Martine
    Mouchard, Laurent
    PROCEEDINGS OF THE PRAGUE STRINGOLOGY CONFERENCE 2008, 2008, : 13 - 25