CoLoRd: compressing long reads

被引:0
|
作者
Marek Kokot
Adam Gudyś
Heng Li
Sebastian Deorowicz
机构
[1] Silesian University of Technology,Faculty of Automatic Control, Electronics and Computer Science
[2] Dana-Farber Cancer Institute,Department of Data Sciences
[3] Harvard Medical School,Department of Biomedical Informatics
来源
Nature Methods | 2022年 / 19卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The cost of maintaining exabytes of data produced by sequencing experiments every year has become a major issue in today’s genomic research. In spite of the increasing popularity of third-generation sequencing, the existing algorithms for compressing long reads exhibit a minor advantage over the general-purpose gzip. We present CoLoRd, an algorithm able to reduce the size of third-generation sequencing data by an order of magnitude without affecting the accuracy of downstream analyses.
引用
收藏
页码:441 / 444
页数:3
相关论文
共 50 条
  • [31] A survey of mapping algorithms in the long-reads era
    Sahlin, Kristoffer
    Baudeau, Thomas
    Cazaux, Bastien
    Marchet, Camille
    GENOME BIOLOGY, 2023, 24 (01)
  • [32] Streamlining Quantitative Analysis of Long RNA Sequencing Reads
    Oeck, Sebastian
    Tuns, Alicia I.
    Hurst, Sebastian
    Schramm, Alexander
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (19) : 1 - 8
  • [33] Jabba: hybrid error correction for long sequencing reads
    Giles Miclotte
    Mahdi Heydari
    Piet Demeester
    Stephane Rombauts
    Yves Van de Peer
    Pieter Audenaert
    Jan Fostier
    Algorithms for Molecular Biology, 11
  • [34] Short paired-end reads trump long single-end reads for expression analysis
    Freedman, Adam H.
    Gaspar, John M.
    Sackton, Timothy B.
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [35] Accurate spliced alignment of long RNA sequencing reads
    Sahlin, Kristoffer
    Makinen, Veli
    BIOINFORMATICS, 2021, 37 (24) : 4643 - 4651
  • [36] Benchmarking the MinION: Evaluating long reads for microbial profiling
    Robert Maximilian Leidenfrost
    Dierk-Christoph Pöther
    Udo Jäckel
    Röbbe Wünschiers
    Scientific Reports, 10
  • [37] Jabba: hybrid error correction for long sequencing reads
    Miclotte, Giles
    Heydari, Mahdi
    Demeester, Piet
    Rombauts, Stephane
    Van de Peer, Yves
    Audenaert, Pieter
    Fostier, Jan
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2016, 11
  • [38] Probably Correct: Rescuing Repeats with Short and Long Reads
    Cechova, Monika
    GENES, 2021, 12 (01) : 1 - 13
  • [39] HYBRIDSPADES: an algorithm for hybrid assembly of short and long reads
    Antipov, Dmitry
    Korobeynikov, Anton
    McLean, Jeffrey S.
    Pevzner, Pavel A.
    BIOINFORMATICS, 2016, 32 (07) : 1009 - 1015
  • [40] Benchmarking the MinION : Evaluating long reads for microbial profiling
    Leidenfrost, Robert Maximilian
    Pother, Dierk-Christoph
    Jackel, Udo
    Wunschiers, Robbe
    SCIENTIFIC REPORTS, 2020, 10 (01)