Evaluation of tools for long read RNA-seq splice-aware alignment

被引:44
|
作者
Krizanovic, Kresimir [1 ]
Echchiki, Amina [2 ,3 ]
Roux, Julien [2 ,3 ,5 ]
Sikic, Mile [1 ,4 ]
机构
[1] Univ Zagreb, Fac Elect Engn & Comp, Dept Elect Syst & Informat Proc, Zagreb 10000, Croatia
[2] Univ Lausanne, Dept Ecol & Evolut, CH-1015 Lausanne, Switzerland
[3] Swiss Inst Bioinformat, CH-1015 Lausanne, Switzerland
[4] Bioinformat Inst, Singapore 138671, Singapore
[5] Univ Hosp Basel, Dept Biomed, CH-4031 Basel, Switzerland
关键词
TRANSCRIPTOME; ALIGNER; HYBRID;
D O I
10.1093/bioinformatics/btx668
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput sequencing has transformed the study of gene expression levels through RNA-seq, a technique that is now routinely used by various fields, such as genetic research or diagnostics. The advent of third generation sequencing technologies providing significantly longer reads opens up new possibilities. However, the high error rates common to these technologies set new bioinformatics challenges for the gapped alignment of reads to their genomic origin. In this study, we have explored how currently available RNA-seq splice-aware alignment tools cope with increased read lengths and error rates. All tested tools were initially developed for short NGS reads, but some have claimed support for long Pacific Biosciences (PacBio) or even Oxford Nanopore Technologies (ONT) MinION reads. The tools were tested on synthetic and real datasets from two technologies (PacBio and ONT MinION). Alignment quality and resource usage were compared across different aligners. The effect of error correction of long reads was explored, both using self-correction and correction with an external short reads dataset. A tool was developed for evaluating RNA-seq alignment results. This tool can be used to compare the alignment of simulated reads to their genomic origin, or to compare the alignment of real reads to a set of annotated transcripts. Our tests show that while some RNA-seq aligners were unable to cope with long error-prone reads, others produced overall good results. We further show that alignment accuracy can be improved using error-corrected reads. https://figshare.com/projects/RNAseq_benchmark/24391
引用
收藏
页码:748 / 754
页数:7
相关论文
共 50 条
  • [21] Discerning novel splice junctions derived from RNA-seq alignment: a deep learning approach
    Yi Zhang
    Xinan Liu
    James MacLeod
    Jinze Liu
    BMC Genomics, 19
  • [22] Discerning novel splice junctions derived from RNA-seq alignment: a deep learning approach
    Zhang, Yi
    Liu, Xinan
    MacLeod, James
    Liu, Jinze
    BMC GENOMICS, 2018, 19
  • [23] Evaluation of Alignment Algorithms for Discovery and Identification of Pathogens Using RNA-Seq
    Borozan, Ivan
    Watt, Stuart N.
    Ferretti, Vincent
    PLOS ONE, 2013, 8 (10):
  • [24] Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment
    Du, Yuheng
    Huang, Qianhui
    Arisdakessian, Cedric
    Garmire, Lana X.
    G3-GENES GENOMES GENETICS, 2020, 10 (05): : 1775 - 1783
  • [25] POMP: a powerful splice mapper for RNA-seq reads
    Saha, Subrata
    Rajasekaran, Sanguthevar
    PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2016, : 414 - 421
  • [26] L-GIREMI uncovers RNA editing sites in long-read RNA-seq
    Zhiheng Liu
    Giovanni Quinones-Valdez
    Ting Fu
    Elaine Huang
    Mudra Choudhury
    Fairlie Reese
    Ali Mortazavi
    Xinshu Xiao
    Genome Biology, 24
  • [27] L-GIREMI uncovers RNA editing sites in long-read RNA-seq
    Liu, Zhiheng
    Quinones-Valdez, Giovanni
    Fu, Ting
    Huang, Elaine
    Choudhury, Mudra
    Reese, Fairlie
    Mortazavi, Ali
    Xiao, Xinshu
    GENOME BIOLOGY, 2023, 24 (01)
  • [28] Supersplat-spliced RNA-seq alignment
    Bryant, Douglas W., Jr.
    Shen, Rongkun
    Priest, Henry D.
    Wong, Weng-Keen
    Mockler, Todd C.
    BIOINFORMATICS, 2010, 26 (12) : 1500 - 1505
  • [29] Long-read RNA-Seq for the discovery of long noncoding and antisense RNAs in plant organelles
    Lima, Matheus Sanita
    Domingues, Douglas Silva
    Paschoal, Alexandre Rossi
    Smith, David Roy
    PHYSIOLOGIA PLANTARUM, 2024, 176 (04)
  • [30] voom: precision weights unlock linear model analysis tools for RNA-seq read counts
    Law, Charity W.
    Chen, Yunshun
    Shi, Wei
    Smyth, Gordon K.
    GENOME BIOLOGY, 2014, 15 (02):