Accurate Estimation of Expression Levels of Homologous Genes in RNA-seq Experiments

被引:18
|
作者
Pasaniuc, Bogdan [1 ,2 ]
Zaitlen, Noah [1 ,2 ]
Halperin, Eran [3 ,4 ,5 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
[2] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[3] Int Comp Sci Inst, Berkeley, CA 94704 USA
[4] Tel Aviv Univ, Mol Microbiol & Biotechnol Dept, IL-69978 Tel Aviv, Israel
[5] Tel Aviv Univ, Blavatnik Sch Comp Sci, IL-69978 Tel Aviv, Israel
基金
以色列科学基金会; 美国国家科学基金会;
关键词
algorithms; gene searching; genetic mapping; genetic variation; TRANSCRIPTOMES; REVEALS; GENOME; MOUSE;
D O I
10.1089/cmb.2010.0259
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Next generation high-throughput sequencing (NGS) is poised to replace array-based technologies as the experiment of choice for measuring RNA expression levels. Several groups have demonstrated the power of this new approach (RNA-seq), making significant and novel contributions and simultaneously proposing methodologies for the analysis of RNA-seq data. In a typical experiment, millions of short sequences (reads) are sampled from RNA extracts and mapped back to a reference genome. The number of reads mapping to each gene is used as proxy for its corresponding RNA concentration. A significant challenge in analyzing RNA expression of homologous genes is the large fraction of the reads that map to multiple locations in the reference genome. Currently, these reads are either dropped from the analysis, or a naive algorithm is used to estimate their underlying distribution. In this work, we present a rigorous alternative for handling the reads generated in an RNA-seq experiment within a probabilistic model for RNA-seq data; we develop maximum likelihood-based methods for estimating the model parameters. In contrast to previous methods, our model takes into account the fact that the DNA of the sequenced individual is not a perfect copy of the reference sequence. We show with both simulated and real RNA-seq data that our new method improves the accuracy and power of RNA-seq experiments.
引用
收藏
页码:459 / 468
页数:10
相关论文
共 50 条
  • [31] Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation
    McCarthy, Davis J.
    Chen, Yunshun
    Smyth, Gordon K.
    NUCLEIC ACIDS RESEARCH, 2012, 40 (10) : 4288 - 4297
  • [32] Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks
    Trapnell, Cole
    Roberts, Adam
    Goff, Loyal
    Pertea, Geo
    Kim, Daehwan
    Kelley, David R.
    Pimentel, Harold
    Salzberg, Steven L.
    Rinn, John L.
    Pachter, Lior
    NATURE PROTOCOLS, 2012, 7 (03) : 562 - 578
  • [33] Oscope identifies oscillatory genes in unsynchronized single-cell RNA-seq experiments
    Ning Leng
    Li-Fang Chu
    Chris Barry
    Yuan Li
    Jeea Choi
    Xiaomao Li
    Peng Jiang
    Ron M Stewart
    James A Thomson
    Christina Kendziorski
    Nature Methods, 2015, 12 : 947 - 950
  • [34] Oscope identifies oscillatory genes in unsynchronized single-cell RNA-seq experiments
    Leng, Ning
    Chu, Li-Fang
    Barry, Chris
    Li, Yuan
    Choi, Jeea
    Li, Xiaomao
    Jiang, Peng
    Stewart, Ron M.
    Thomson, James A.
    Kendziorski, Christina
    NATURE METHODS, 2015, 12 (10) : 947 - 950
  • [35] An effective analytic method for detecting tissue-specific genes in RNA-seq experiments
    Zhao, Guoqing
    Li, Qiao
    Wang, I-Ming
    Liu, Xiaoqiao
    Fang, Xiangzhong
    Zhang, Xiaohua Douglas
    PHARMACOGENOMICS, 2015, 16 (16) : 1769 - 1779
  • [36] Accurate assembly of multiple RNA-seq samples with Aletsch
    Shi, Qian
    Zhang, Qimin
    Shao, Mingfu
    BIOINFORMATICS, 2024, 40 : i307 - i317
  • [37] Pitfalls of accurate RNA-seq in human saliva samples
    Mari Alemany, Sergi
    Hernangomez-Laderas, Alba
    Miguel, Irati
    Cilleros-Portet, Ariadna
    Gonzalez-Garcia, Barbara P.
    Garcia-Santisteban, Iraia
    Fernandez-Jimenez, Nora
    Ramon Bilbao, Jose
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1173 - 1173
  • [38] OSA: a fast and accurate alignment tool for RNA-Seq
    Hu, Jun
    Ge, Huanying
    Newman, Matt
    Liu, Kejun
    BIOINFORMATICS, 2012, 28 (14) : 1933 - 1934
  • [39] 3D RNA-seq: a powerful and flexible tool for rapid and accurate differential expression and alternative splicing analysis of RNA-seq data for biologists
    Guo, Wenbin
    Tzioutziou, Nikoleta A.
    Stephen, Gordon
    Milne, Iain
    Calixto, Cristiane P. G.
    Waugh, Robbie
    Brown, John W. S.
    Zhang, Runxuan
    RNA BIOLOGY, 2021, 18 (11) : 1574 - 1587
  • [40] CEDER: Accurate Detection of Differentially Expressed Genes by Combining Significance of Exons Using RNA-Seq
    Wan, Lin
    Sun, Fengzhu
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (05) : 1281 - 1292