Efficient RNA isoform identification and quantification from RNA-Seq data with network flows

被引:46
|
作者
Bernard, Elsa [1 ,2 ,3 ]
Jacob, Laurent [4 ]
Mairal, Julien [5 ]
Vert, Jean-Philippe [1 ,2 ,3 ]
机构
[1] Mines ParisTech, Ctr Computat Biol, F-77300 Fontainebleau, France
[2] Inst Curie, F-75248 Paris, France
[3] INSERM, U900, F-75248 Paris, France
[4] Univ Lyon 1, INRA, CNRS, Lab Biometrie & Biol Evolut,UMR5558, Villeurbanne, France
[5] INRIA Grenoble Rhone Alpes, LEAR Project Team, F-38330 Montbonnot St Martin, France
基金
美国国家科学基金会; 欧洲研究理事会;
关键词
ABUNDANCE ESTIMATION; TRANSCRIPTOME; EXPRESSION; SELECTION; ALGORITHM; DISCOVERY; GENOME; GRAPHS; LASSO;
D O I
10.1093/bioinformatics/btu317
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Several state-of-the-art methods for isoform identification and quantification are based on l(1)-regularized regression, such as the Lasso. However, explicitly listing the-possibly exponentially-large set of candidate transcripts is intractable for genes with many exons. For this reason, existing approaches using the l(1)-penalty are either restricted to genes with few exons or only run the regression algorithm on a small set of preselected isoforms. Results: We introduce a new technique called FlipFlop, which can efficiently tackle the sparse estimation problem on the full set of candidate isoforms by using network flow optimization. Our technique removes the need of a preselection step, leading to better isoform identification while keeping a low computational cost. Experiments with synthetic and real RNA-Seq data confirm that our approach is more accurate than alternative methods and one of the fastest available.
引用
下载
收藏
页码:2447 / 2455
页数:9
相关论文
共 50 条
  • [1] Simultaneous Isoform Discovery and Quantification from RNA-Seq
    Hiller D.
    Wong W.H.
    Statistics in Biosciences, 2013, 5 (1) : 100 - 118
  • [2] Towards Reliable Isoform Quantification Using RNA-Seq Data
    Howard, Brian E.
    Heber, Steffen
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 130 - 135
  • [3] Towards reliable isoform quantification using RNA-SEQ data
    Brian E Howard
    Steffen Heber
    BMC Bioinformatics, 11
  • [4] Towards reliable isoform quantification using RNA-SEQ data
    Howard, Brian E.
    Heber, Steffen
    BMC BIOINFORMATICS, 2010, 11
  • [5] A novel robust statistical method for isoform quantification from RNA-seq data
    Mondal, Pronoy K.
    Chatterjee, Raghunath
    Mukhopadhyay, Indranil
    GENETIC EPIDEMIOLOGY, 2018, 42 (07) : 719 - 719
  • [6] Network-Based Isoform Quantification with RNA-Seq Data for Cancer Transcriptome Analysis
    Zhang, Wei
    Chang, Jae-Woong
    Lin, Lilong
    Minn, Kay
    Wu, Baolin
    Chien, Jeremy
    Yong, Jeongsik
    Zheng, Hui
    Kuang, Rui
    PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (12)
  • [7] Acfs: accurate circRNA identification and quantification from RNA-Seq data
    You, Xintian
    Conrad, Tim O. F.
    SCIENTIFIC REPORTS, 2016, 6
  • [8] WemIQ: an accurate and robust isoform quantification method for RNA-seq data
    Zhang, Jing
    Kuo, C. -C. Jay
    Chen, Liang
    BIOINFORMATICS, 2015, 31 (06) : 878 - 885
  • [9] Acfs: accurate circRNA identification and quantification from RNA-Seq data
    Xintian You
    Tim OF Conrad
    Scientific Reports, 6
  • [10] RISQ: A novel robust statistical approach for isoform quantification from RNA-seq data
    Mondal, Pronoy Kanti
    Chatterjee, Raghunath
    Mukhopadhyay, Indranil
    HUMAN GENOMICS, 2018, 12