Contrasting and combining transcriptome complexity captured by short and long RNA sequencing reads

被引:2
|
作者
Han, Seong Woo [1 ]
Jewell, San [2 ]
Thomas-Tikhonenko, Andrei [3 ,4 ]
Barash, Yoseph [1 ,2 ]
机构
[1] Univ Penn, Sch Engn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
[2] Univ Penn, Perelman Sch Med, Dept Genet, Philadelphia, PA 19104 USA
[3] Univ Penn, Perelman Sch Med, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
[4] Childrens Hosp Philadelphia, Div Canc Pathobiol, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1101/gr.278659.123
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Mapping transcriptomic variations using either short- or long-read RNA sequencing is a staple of genomic research. Long reads are able to capture entire isoforms and overcome repetitive regions, whereas short reads still provide improved coverage and error rates. Yet, open questions remain, such as how to quantitatively compare the technologies, can we combine them, and what is the benefit of such a combined view? We tackle these questions by first creating a pipeline to assess matched long- and short-read data using a variety of transcriptome statistics. We find that across data sets, algorithms, and technologies, matched short-read data detects similar to 30% more splice junctions, such that similar to 10%-30% of the splice junctions included at >= 20% by short reads are missed by long reads. In contrast, long reads detect many more intron-retention events and can detect full isoforms, pointing to the benefit of combining the technologies. We introduce MAJIQ-L, an extension of the MAJIQ software, to enable a unified view of transcriptome variations from both technologies and demonstrate its benefits. Our software can be used to assess any future long-read technology or algorithm and can be combined with short-read data for improved transcriptome analysis.
引用
收藏
页码:1624 / 1635
页数:12
相关论文
共 50 条
  • [31] Retained introns in long RNA-seq reads are not reliably detected in sample-matched short reads
    David, Julianne K.
    Maden, Sean K.
    Wood, Mary A.
    Thompson, Reid F.
    Nellore, Abhinav
    GENOME BIOLOGY, 2022, 23 (01)
  • [32] Combined assembly of long and short sequencing reads improve the efficiency of exploring the soil metagenome
    Xu, Guoshun
    Zhang, Liwen
    Liu, Xiaoqing
    Guan, Feifei
    Xu, Yuquan
    Yue, Haitao
    Huang, Jin-Qun
    Chen, Jieyin
    Wu, Ningfeng
    Tian, Jian
    BMC GENOMICS, 2022, 23 (01)
  • [33] Combined assembly of long and short sequencing reads improve the efficiency of exploring the soil metagenome
    Guoshun Xu
    Liwen Zhang
    Xiaoqing Liu
    Feifei Guan
    Yuquan Xu
    Haitao Yue
    Jin-Qun Huang
    Jieyin Chen
    Ningfeng Wu
    Jian Tian
    BMC Genomics, 23
  • [34] Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing
    Stapleton, James A.
    Kim, Jeongwoon
    Hamilton, John P.
    Wu, Ming
    Irber, Luiz C.
    Maddamsetti, Rohan
    Briney, Bryan
    Newton, Linsey
    Burton, Dennis R.
    Brown, C. Titus
    Chan, Christina
    Buell, C. Robin
    Whitehead, Timothy A.
    PLOS ONE, 2016, 11 (01):
  • [35] Long Reads, Short Time: Feasibility of Prenatal Sample Karyotyping by Nanopore Genome Sequencing
    Bartalucci, Niccolo
    Romagnoli, Simone
    Contini, Elisa
    Marseglia, Giuseppina
    Magi, Alberto
    Guglielmelli, Paola
    Pelo, Elisabetta
    Vannucchi, Alessandro M.
    CLINICAL CHEMISTRY, 2019, 65 (12) : 1605 - 1608
  • [36] Deep Learning for Assembly of Haplotypes and Viral Quasispecies from Short and Long Sequencing Reads
    Ke, Ziqi
    Vikalo, Haris
    13TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, BCB 2022, 2022,
  • [37] TGStools: A Bioinformatics Suit to Facilitate Transcriptome Analysis of Long Reads from Third Generation Sequencing Platform
    Chen, Danze
    Zhao, Qianqian
    Jiang, Leiming
    Liao, Shuaiyuan
    Meng, Zhigang
    Xu, Jianzhen
    GENES, 2019, 10 (07):
  • [38] Characterization and complexity of transcriptome in Gymnocypris przewalskii using single-molecule long-read sequencing and RNA-seq
    Li, Xindan
    Wu, Jinming
    Xiao, Xinping
    Rong, Yifeng
    Yang, Haile
    Li, Junyi
    Zhou, Qiong
    Zhou, Weiguo
    Shi, Jianquan
    Qi, Hongfang
    Du, Hao
    DNA RESEARCH, 2021, 28 (03)
  • [39] Reconstruction and functional annotation of Ascosphaera apis full-length transcriptome utilizing PacBio long reads combined with Illumina short reads
    Chen, Dafu
    Du, Yu
    Fan, Xiaoxue
    Zhu, Zhiwei
    Jiang, Haibin
    Wang, Jie
    Fan, Yuanchan
    Chen, Huazhi
    Zhou, Dingding
    Xiong, Cuiling
    Zheng, Yanzhen
    Xu, Xijian
    Luo, Qun
    Guo, Rui
    JOURNAL OF INVERTEBRATE PATHOLOGY, 2020, 176
  • [40] Magic-BLAST, an accurate RNA-seq aligner for long and short reads
    Boratyn, Grzegorz M.
    Thierry-Mieg, Jean
    Thierry-Mieg, Danielle
    Busby, Ben
    Madden, Thomas L.
    BMC BIOINFORMATICS, 2019, 20 (1)