Refining the r-index

被引:21
|
作者
Bannai, Hideo [1 ,2 ]
Gagie, Travis [3 ,4 ,5 ]
Tomohiro, I [6 ]
机构
[1] Kyushu Univ, Dept Informat, Fukuoka, Japan
[2] RIKEN, Ctr Adv Intelligence Project, Tokyo, Japan
[3] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada
[4] Diego Portales Univ, Sch Comp Sci & Telecommun, Santiago, Chile
[5] Ctr Biotechnol & Bioengn, Santiago, Chile
[6] Kyushu Inst Technol, Dept Artificial Intelligence, Kitakyushu, Fukuoka, Japan
关键词
Burrow-Wheeler transform; FM-index; r-index; Dynamic indexing; LZ77; parsing; Matching statistics; READ ALIGNMENT;
D O I
10.1016/j.tcs.2019.08.005
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Gagie, Navarro and Prezza's r-index (SODA, 2018) promises to speed up DNA alignment and variation calling by allowing us to index entire genomic databases, provided certain obstacles can be overcome. In this paper we first strengthen and simplify Policriti and Prezza's Toehold Lemma (DCC '16; Algorithmica, 2017), which inspired the r-index and plays an important role in its implementation. We then show how to update the r-index efficiently after adding a new genome to the database, which is likely to be vital in practice. As a by-product of this result, we obtain an online version of Policriti and Prezza's algorithm for constructing the LZ77 parse from a run-length compressed Burrows-Wheeler Transform. Our experiments demonstrate the practicality of all three of these results. Finally, we show how to augment the r-index such that, given a new genome and fast random access to the database, we can quickly compute the matching statistics and maximal exact matches of the new genome with respect to the database. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:96 / 108
页数:13
相关论文
共 50 条
  • [1] R-index critical value
    Bi, Jian
    O'Mahony, Michael
    [J]. JOURNAL OF SENSORY STUDIES, 2020, 35 (04)
  • [2] Statistical analyses for R-index
    Bi, Jian
    [J]. JOURNAL OF SENSORY STUDIES, 2006, 21 (06) : 584 - 600
  • [3] Development of a 'bipolar' R-index
    Cliff, MA
    O'Mahony, M
    Fukumoto, L
    King, MC
    [J]. JOURNAL OF SENSORY STUDIES, 2000, 15 (02) : 219 - 229
  • [4] Matching Reads to Many Genomes with the r-Index
    Mun, Taher
    Kuhnle, Alan
    Boucher, Christina
    Gagie, Travis
    Langmead, Ben
    Manzini, Giovanni
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (04) : 514 - 518
  • [5] Updated and extended table for testing the significance of the R-index
    Bi, Jian
    O'Mahony, Michael
    [J]. JOURNAL OF SENSORY STUDIES, 2007, 22 (06) : 713 - 720
  • [6] Finding Maximal Exact Matches Using the r-Index
    Rossi, Massimiliano
    Oliva, Marco
    Bonizzoni, Paola
    Langmead, Ben
    Gagie, Travis
    Boucher, Christina
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (02) : 188 - 194
  • [7] Quantification of Sensory and Food Quality: The R-Index Analysis
    Lee, Hye-Seong
    van Hout, Danielle
    [J]. JOURNAL OF FOOD SCIENCE, 2009, 74 (06) : R57 - R64
  • [8] r-index: Quantifying the quality of an individual's scientific research output
    Rahul, P. R. C.
    [J]. JOURNAL OF SCIENTOMETRIC RESEARCH, 2013, 2 (01): : 80 - 82
  • [9] Utilizing the R-index measure for threshold testing in model caffeine solutions
    Robinson, KM
    Klein, BP
    Lee, SY
    [J]. FOOD QUALITY AND PREFERENCE, 2005, 16 (04) : 283 - 289
  • [10] Hedonic R-index measurement of temperature preferences for drinking black coffee
    Pipatsattayanuwong, S
    Lee, HS
    Lau, S
    O'Mahony, M
    [J]. JOURNAL OF SENSORY STUDIES, 2001, 16 (05) : 517 - 536