Piecing the puzzle together: a revisit to transcript reconstruction problem in RNA-seq

被引:0
|
作者
Yan Huang
Yin Hu
Jinze Liu
机构
[1] University of Kentucky,Department of Computer Science
来源
关键词
Transcript reconstruction; Transcript quantification; Transcriptome; RNA-seq;
D O I
暂无
中图分类号
学科分类号
摘要
The advancement of RNA sequencing (RNA-seq) has provided an unprecedented opportunity to assess both the diversity and quantity of transcript isoforms in an mRNA transcriptome. In this paper, we revisit the computational problem of transcript reconstruction and quantification. Unlike existing methods which focus on how to explain the exons and splice variants detected by the reads with a set of isoforms, we aim at reconstructing transcripts by piecing the reads into individual effective transcript copies. Simultaneously, the quantity of each isoform is explicitly measured by the number of assembled effective copies, instead of estimated solely based on the collective read count. We have developed a novel method named Astroid that solves the problem of effective copy reconstruction on the basis of a flow network. The RNA-seq reads are represented as vertices in the flow network and are connected by weighted edges that evaluate the likelihood of two reads originating from the same effective copy. A maximum likelihood set of transcript copies is then reconstructed by solving a minimum-cost flow problem on the flow network. Simulation studies on the human transcriptome have demonstrated the superior sensitivity and specificity of Astroid in transcript reconstruction as well as improved accuracy in transcript quantification over several existing approaches. The application of Astroid on two real RNA-seq datasets has further demonstrated its accuracy through high correlation between the estimated isoform abundance and the qRT-PCR validations.
引用
收藏
相关论文
共 50 条
  • [11] A Robust Method for Transcript Quantification with RNA-Seq Data
    Huang, Yan
    Hu, Yin
    Jones, Corbin D.
    MacLeod, James N.
    Chiang, Derek Y.
    Liu, Yufeng
    Prins, Jan F.
    Liu, Jinze
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2013, 20 (03) : 167 - 187
  • [12] CLASS: constrained transcript assembly of RNA-seq reads
    Song, Li
    Florea, Liliana
    BMC BIOINFORMATICS, 2013, 14
  • [13] Measure transcript integrity using RNA-seq data
    Wang, Liguo
    Nie, Jinfu
    Sicotte, Hugues
    Li, Ying
    Eckel-Passow, Jeanette E.
    Dasari, Surendra
    Vedell, Peter T.
    Barman, Poulami
    Wang, Liewei
    Weinshiboum, Richard
    Jen, Jin
    Huang, Haojie
    Kohli, Manish
    Kocher, Jean-Pierre A.
    BMC BIOINFORMATICS, 2016, 17
  • [14] TRIP: a method for novel transcript reconstruction from paired-end RNA-seq reads
    Serghei Mangul
    Adrian Caciula
    Dumitru Brinza
    Ion I Mandoiu
    Alex Zelikovsky
    BMC Bioinformatics, 13 (Suppl 18)
  • [15] Strawberry: Fast and accurate genome-guided transcript reconstruction and quantification from RNA-Seq
    Liu, Ruolin
    Dickerson, Julie
    PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (11)
  • [16] Differential analysis of gene regulation at transcript resolution with RNA-seq
    Cole Trapnell
    David G Hendrickson
    Martin Sauvageau
    Loyal Goff
    John L Rinn
    Lior Pachter
    Nature Biotechnology, 2013, 31 : 46 - 53
  • [17] Polyester: simulating RNA-seq datasets with differential transcript expression
    Frazee, Alyssa C.
    Jaffe, Andrew E.
    Langmead, Ben
    Leek, Jeffrey T.
    BIOINFORMATICS, 2015, 31 (17) : 2778 - 2784
  • [18] Differential analysis of gene regulation at transcript resolution with RNA-seq
    Trapnell, Cole
    Hendrickson, David G.
    Sauvageau, Martin
    Goff, Loyal
    Rinn, John L.
    Pachter, Lior
    NATURE BIOTECHNOLOGY, 2013, 31 (01) : 46 - +
  • [19] RNA-Skim: a rapid method for RNA-Seq quantification at transcript level
    Zhang, Zhaojun
    Wang, Wei
    BIOINFORMATICS, 2014, 30 (12) : 283 - 292
  • [20] RNA-eXpress annotates novel transcript features in RNA-seq data
    Forster, Samuel C.
    Finkel, Alexander M.
    Gould, Jodee A.
    Hertzog, Paul J.
    BIOINFORMATICS, 2013, 29 (06) : 810 - 812