CoLoRd: compressing long reads

被引:0
|
作者
Marek Kokot
Adam Gudyś
Heng Li
Sebastian Deorowicz
机构
[1] Silesian University of Technology,Faculty of Automatic Control, Electronics and Computer Science
[2] Dana-Farber Cancer Institute,Department of Data Sciences
[3] Harvard Medical School,Department of Biomedical Informatics
来源
Nature Methods | 2022年 / 19卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The cost of maintaining exabytes of data produced by sequencing experiments every year has become a major issue in today’s genomic research. In spite of the increasing popularity of third-generation sequencing, the existing algorithms for compressing long reads exhibit a minor advantage over the general-purpose gzip. We present CoLoRd, an algorithm able to reduce the size of third-generation sequencing data by an order of magnitude without affecting the accuracy of downstream analyses.
引用
收藏
页码:441 / 444
页数:3
相关论文
共 50 条
  • [1] CoLoRd: compressing long reads
    Kokot, Marek
    Gudys, Adam
    Li, Heng
    Deorowicz, Sebastian
    NATURE METHODS, 2022, 19 (04) : 441 - +
  • [2] CoLoRMap: Correcting Long Reads by Mapping short reads
    Haghshenas, Ehsan
    Hach, Faraz
    Sahinalp, S. Cenk
    Chauve, Cedric
    BIOINFORMATICS, 2016, 32 (17) : 545 - 551
  • [3] Long reads for a short plant
    Kellogg, Elizabeth A.
    NATURE PLANTS, 2015, 1 (12)
  • [4] Long reads: their purpose and place
    Pollard, Martin O.
    Gurdasani, Deepti
    Mentzer, Alexander J.
    Porter, Tarryn
    Sandhu, Manjinder S.
    HUMAN MOLECULAR GENETICS, 2018, 27 (R2) : R234 - R241
  • [5] AS LONG AS MY CHILD READS
    ZABAWSKI, I
    READING TEACHER, 1970, 23 (07): : 631 - 632
  • [6] Informatics for PacBio Long Reads
    Suzuki, Yuta
    SINGLE MOLECULE AND SINGLE CELL SEQUENCING, 2019, 1129 : 119 - 129
  • [7] Circular consensus sequencing with long reads
    Tang, Lei
    NATURE METHODS, 2019, 16 (10) : 958 - 958
  • [8] Finding long tandem repeats in long noisy reads
    Morishita, Shinichi
    Ichikawa, Kazuki
    Myers, Eugene W.
    BIOINFORMATICS, 2021, 37 (05) : 612 - 621
  • [9] Circular consensus sequencing with long reads
    Lei Tang
    Nature Methods, 2019, 16 : 958 - 958
  • [10] Evaluation of synthetic long reads technology
    Gerber, Zuzana
    Sandron, Florian
    Daviaud, Christian
    Lenoir, Naell
    Gras, Margaux
    Meyer, Vincent
    Olaso, Robert
    Deleuze, Jean-Francois
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1779 - 1779