long-read-tools.org: an interactive catalogue of analysis methods for long-read sequencing data

被引:24
|
作者
Amarasinghe, Shanika L. [1 ,2 ]
Ritchie, Matthew E. [1 ,2 ,3 ]
Gouil, Quentin [1 ,2 ]
机构
[1] Walter & Eliza Hall Inst Med Res, Epigenet & Dev Div, 1G Royal Parade, Parkville, Vic 3052, Australia
[2] Univ Melbourne, Dept Med Biol, 1G Royal Parade, Parkville, Vic 3052, Australia
[3] Univ Melbourne, Sch Math & Stat, 813 Swanston St, Parkville, Vic 3010, Australia
来源
GIGASCIENCE | 2021年 / 10卷 / 02期
基金
英国医学研究理事会; 澳大利亚国家健康与医学研究理事会;
关键词
database; long-read sequencing; data analysis; nanopore; PacBio; ALIGNMENT; RNA;
D O I
10.1093/gigascience/giab003
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The data produced by long-read third-generation sequencers have unique characteristics compared to short-read sequencing data, often requiring tailored analysis tools for tasks ranging from quality control to downstream processing. The rapid growth in software that addresses these challenges for different genomics applications is difficult to keep track of, which makes it hard for users to choose the most appropriate tool for their analysis goal and for developers to identify areas of need and existing solutions to benchmark against. Findings: We describe the implementation of long-read-tools.org, an open-source database that organizes the rapidly expanding collection of long-read data analysis tools and allows its exploration through interactive browsing and filtering. The current database release contains 478 tools across 32 categories. Most tools are developed in Python, and the most frequent analysis tasks include base calling, de novo assembly, error correction, quality checking/filtering, and isoform detection, while long-read single-cell data analysis and transcriptomics are areas with the fewest tools available. Conclusion: Continued growth in the application of long-read sequencing in genomics research positions the long-read-tools.org database as an essential resource that allows researchers to keep abreast of both established and emerging software to help guide the selection of the most relevant tool for their analysis needs.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] The Application of Long-Read Sequencing to Cancer
    Ermini, Luca
    Driguez, Patrick
    CANCERS, 2024, 16 (07)
  • [22] Nanopore long-read sequencing of circRNAs
    Rahimi, Karim
    Nielsen, Anne Faerch
    Veno, Morten T.
    Kjems, Jorgen
    METHODS, 2021, 196 : 23 - 29
  • [23] Comprehensive assessment of mRNA isoform detection methods for long-read sequencing data
    Su, Yaqi
    Yu, Zhejian
    Jin, Siqian
    Ai, Zhipeng
    Yuan, Ruihong
    Chen, Xinyi
    Xue, Ziwei
    Guo, Yixin
    Chen, Di
    Liang, Hongqing
    Liu, Zuozhu
    Liu, Wanlu
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [24] A graphical, interactive and GPU-enabled workflow to process long-read sequencing data
    Shishir Reddy
    Ling-Hong Hung
    Olga Sala-Torra
    Jerald P. Radich
    Cecilia CS Yeung
    Ka Yee Yeung
    BMC Genomics, 22
  • [25] Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing
    Cook, David E.
    Valle-Inclan, Jose Espejo
    Pajoro, Alice
    Rovenich, Hanna
    Thomma, Bart P. H. J.
    Faino, Luigi
    PLANT PHYSIOLOGY, 2019, 179 (01) : 38 - 54
  • [26] A graphical, interactive and GPU-enabled workflow to process long-read sequencing data
    Reddy, Shishir
    Hung, Ling-Hong
    Sala-Torra, Olga
    Radich, Jerald P.
    Yeung, Cecilia C. S.
    Yeung, Ka Yee
    BMC GENOMICS, 2021, 22 (01)
  • [27] Comparison of long-read methods for sequencing and assembly of a plant genome
    Murigneux, Valentine
    Rai, Subash Kumar
    Furtado, Agnelo
    Bruxner, Timothy J. C.
    Tian, Wei
    Harliwong, Ivon
    Wei, Hanmin
    Yang, Bicheng
    Ye, Qianyu
    Anderson, Ellis
    Mao, Qing
    Drmanac, Radoje
    Wang, Ou
    Peters, Brock A.
    Xu, Mengyang
    Wu, Pei
    Topp, Bruce
    Coin, Lachlan J. M.
    Henry, Robert J.
    GIGASCIENCE, 2020, 9 (12):
  • [28] Startups use short-read data to expand long-read sequencing market
    Eisenstein, Michael
    NATURE BIOTECHNOLOGY, 2015, 33 (05) : 433 - 435
  • [29] Startups use short-read data to expand long-read sequencing market
    Michael Eisenstein
    Nature Biotechnology, 2015, 33 : 433 - 435
  • [30] Detecting Phase Effects Using Long-Read Sequencing Data
    He, Gengming
    Mastromatteo, Scott
    Keenan, Katherine
    Strug, Lisa
    GENETIC EPIDEMIOLOGY, 2024, 48 (07) : 360 - 360