A bi-Poisson model for clustering gene expression profiles by RNA-seq

被引:6
|
作者
Wang, Ningtao [1 ]
Wang, Yaqun [1 ]
Hao, Han [1 ]
Wang, Luojun [1 ]
Wang, Zhong
Wang, Jianxin [2 ]
Wu, Rongling [1 ,3 ,4 ]
机构
[1] Penn State Univ, Hershey, PA 17033 USA
[2] Beijing Forestry Univ, Beijing, Peoples R China
[3] Penn State Univ, Ctr Stat Genet, Hershey, PA 17033 USA
[4] Beijing Forestry Univ, Ctr Computat Biol, Beijing, Peoples R China
关键词
RNA-seq; Poisson distribution; EM algorithm; breast cancer cell lines; DIFFERENTIAL EXPRESSION; TRANSCRIPTION FACTORS; DYNAMICS;
D O I
10.1093/bib/bbt029
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the availability of gene expression data by RNA-seq, powerful statistical approaches for grouping similar gene expression profiles across different environments have become increasingly important. We describe and assess a computational model for clustering genes into distinct groups based on the pattern of gene expression in response to changing environment. The model capitalizes on the Poisson distribution to capture the count property of RNA-seq data. A two-stage hierarchical expectation-maximization (EM) algorithm is implemented to estimate an optimal number of groups and mean expression amounts of each group across two environments. A procedure is formulated to test whether and how a given group shows a plastic response to environmental changes. The impact of gene-environment interactions on the phenotypic plasticity of the organism can also be visualized and characterized. The model was used to analyse an RNA-seq dataset measured from two cell lines of breast cancer that respond differently to an anti-cancer drug, from which genes associated with the resistance and sensitivity of the cell lines are identified. We performed simulation studies to validate the statistical behaviour of the model. The model provides a useful tool for clustering gene expression data by RNA-seq, facilitating our understanding of gene functions and networks.
引用
收藏
页码:534 / 541
页数:8
相关论文
共 50 条
  • [21] Deep Learning to Analyze RNA-Seq Gene Expression Data
    Urda, D.
    Montes-Torres, J.
    Moreno, F.
    Franco, L.
    Jerez, J. M.
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2017, PT II, 2017, 10306 : 50 - 59
  • [22] The NBP Negative Binomial Model for Assessing Differential Gene Expression from RNA-Seq
    Di, Yanming
    Schafer, Daniel W.
    Cumbie, Jason S.
    Chang, Jeff H.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
  • [23] A Unified Model for Joint Normalization and Differential Gene Expression Detection in RNA-Seq Data
    Liu, Kefei
    Ye, Jieping
    Yang, Yang
    Shen, Li
    Jiang, Hui
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (02) : 442 - 454
  • [24] RNA-seq and microarray gene expression vie for toxicogenomics superiority
    Tong, W.
    TOXICOLOGY LETTERS, 2015, 238 (02) : S226 - S227
  • [25] An RNA-Seq based gene expression atlas of the common bean
    Jamie A O’Rourke
    Luis P Iniguez
    Fengli Fu
    Bruna Bucciarelli
    Susan S Miller
    Scott A Jackson
    Philip E McClean
    Jun Li
    Xinbin Dai
    Patrick X Zhao
    Georgina Hernandez
    Carroll P Vance
    BMC Genomics, 15
  • [26] RNA-Seq gene expression estimation with read mapping uncertainty
    Li, Bo
    Ruotti, Victor
    Stewart, Ron M.
    Thomson, James A.
    Dewey, Colin N.
    BIOINFORMATICS, 2010, 26 (04) : 493 - 500
  • [27] An RNA-Seq based gene expression atlas of the common bean
    O'Rourke, Jamie A.
    Iniguez, Luis P.
    Fu, Fengli
    Bucciarelli, Bruna
    Miller, Susan S.
    Jackson, Scott A.
    McClean, Philip E.
    Li, Jun
    Dai, Xinbin
    Zhao, Patrick X.
    Hernandez, Georgina
    Vance, Carroll P.
    BMC GENOMICS, 2014, 15
  • [28] RNA-seq analyses of gene expression in the microsclerotia of Verticillium dahliae
    Dechassa Duressa
    Amy Anchieta
    Dongquan Chen
    Anna Klimes
    Maria D Garcia-Pedrajas
    Katherine F Dobinson
    Steven J Klosterman
    BMC Genomics, 14
  • [29] Comparison of gene expression platforms: RNA-Seq, Fluidigm, and Nanostring
    Schleifman, Erica B.
    Motlhabi, Maipelo
    Cummings, Craig
    Nakamura, Rin
    Bosch, Linda
    Patel, Rajesh
    Do, An
    Watson, Andrew
    Sandmann, Thomas
    Darbonne, Walter
    McCaffery, Ian
    Peters, Eric
    Raja, Rajiv
    CANCER RESEARCH, 2015, 75
  • [30] Investigation of Factors Affecting RNA-Seq Gene Expression Calls
    Harati, Sahar
    Phan, John H.
    Wang, May D.
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 5232 - 5235