Piecing the puzzle together: a revisit to transcript reconstruction problem in RNA-seq

被引:0
|
作者
Yan Huang
Yin Hu
Jinze Liu
机构
[1] University of Kentucky,Department of Computer Science
来源
关键词
Transcript reconstruction; Transcript quantification; Transcriptome; RNA-seq;
D O I
暂无
中图分类号
学科分类号
摘要
The advancement of RNA sequencing (RNA-seq) has provided an unprecedented opportunity to assess both the diversity and quantity of transcript isoforms in an mRNA transcriptome. In this paper, we revisit the computational problem of transcript reconstruction and quantification. Unlike existing methods which focus on how to explain the exons and splice variants detected by the reads with a set of isoforms, we aim at reconstructing transcripts by piecing the reads into individual effective transcript copies. Simultaneously, the quantity of each isoform is explicitly measured by the number of assembled effective copies, instead of estimated solely based on the collective read count. We have developed a novel method named Astroid that solves the problem of effective copy reconstruction on the basis of a flow network. The RNA-seq reads are represented as vertices in the flow network and are connected by weighted edges that evaluate the likelihood of two reads originating from the same effective copy. A maximum likelihood set of transcript copies is then reconstructed by solving a minimum-cost flow problem on the flow network. Simulation studies on the human transcriptome have demonstrated the superior sensitivity and specificity of Astroid in transcript reconstruction as well as improved accuracy in transcript quantification over several existing approaches. The application of Astroid on two real RNA-seq datasets has further demonstrated its accuracy through high correlation between the estimated isoform abundance and the qRT-PCR validations.
引用
收藏
相关论文
共 50 条
  • [21] Fusion Transcript Detection from RNA-Seq using Jaccard Distance
    Mohebbi, Hamidreza
    Haspel, Nurit
    Simovici, Dan
    Quach, Joyce
    ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [22] De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis
    Haas, Brian J.
    Papanicolaou, Alexie
    Yassour, Moran
    Grabherr, Manfred
    Blood, Philip D.
    Bowden, Joshua
    Couger, Matthew Brian
    Eccles, David
    Li, Bo
    Lieber, Matthias
    MacManes, Matthew D.
    Ott, Michael
    Orvis, Joshua
    Pochet, Nathalie
    Strozzi, Francesco
    Weeks, Nathan
    Westerman, Rick
    William, Thomas
    Dewey, Colin N.
    Henschel, Robert
    Leduc, Richard D.
    Friedman, Nir
    Regev, Aviv
    NATURE PROTOCOLS, 2013, 8 (08) : 1494 - 1512
  • [23] Bayesian estimation of differential transcript usage from RNA-seq data
    Papastamoulis, Panagiotis
    Rattray, Magnus
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2017, 16 (5-6) : 387 - 405
  • [24] Deep-learning augmented RNA-seq analysis of transcript splicing
    Zhang, Zijun
    Pan, Zhicheng
    Ying, Yi
    Xie, Zhijie
    Adhikari, Samir
    Phillips, John
    Carstens, Russ P.
    Black, Douglas L.
    Wu, Yingnian
    Xing, Yi
    NATURE METHODS, 2019, 16 (04) : 307 - +
  • [25] Deep-learning augmented RNA-seq analysis of transcript splicing
    Zijun Zhang
    Zhicheng Pan
    Yi Ying
    Zhijie Xie
    Samir Adhikari
    John Phillips
    Russ P. Carstens
    Douglas L. Black
    Yingnian Wu
    Yi Xing
    Nature Methods, 2019, 16 : 307 - 310
  • [26] RNA-seq for comparative transcript profiling of kenaf under salinity stress
    Li, Hui
    Li, Defang
    Chen, Anguo
    Tang, Huijuan
    Li, Jianjun
    Huang, Siqi
    JOURNAL OF PLANT RESEARCH, 2017, 130 (02) : 365 - 372
  • [27] RNA-seq for comparative transcript profiling of kenaf under salinity stress
    Hui Li
    Defang Li
    Anguo Chen
    Huijuan Tang
    Jianjun Li
    Siqi Huang
    Journal of Plant Research, 2017, 130 : 365 - 372
  • [28] Characterization and improvement of RNA-Seq precision in quantitative transcript expression profiling
    Labaj, Pawel P.
    Leparc, German G.
    Linggi, Bryan E.
    Markillie, Lye Meng
    Wiley, H. Steven
    Kreil, David P.
    BIOINFORMATICS, 2011, 27 (13) : I383 - I391
  • [29] Transcript length bias in RNA-seq data confounds systems biology
    Alicia Oshlack
    Matthew J Wakefield
    Biology Direct, 4
  • [30] Transcript length bias in RNA-seq data confounds systems biology
    Oshlack, Alicia
    Wakefield, Matthew J.
    BIOLOGY DIRECT, 2009, 4