BAYESIAN SPARSE GRAPHICAL MODELS FOR CLASSIFICATION WITH APPLICATION TO PROTEIN EXPRESSION DATA

被引:18
|
作者
Baladandayuthapani, Veerabhadran [1 ]
Talluri, Rajesh [1 ]
Ji, Yuan [2 ]
Coombes, Kevin R. [3 ]
Lu, Yiling [4 ]
Hennessy, Bryan T. [5 ]
Davies, Michael A. [6 ]
Mallick, Bani K. [7 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[2] Northshore Univ HealthSyst, Evanston, IL 60201 USA
[3] Ohio State Univ, Wexner Med Ctr, Dept Biomed Informat, Columbus, OH USA
[4] Univ Texas MD Anderson Canc Ctr, Dept Syst Biol, Houston, TX 77030 USA
[5] Beaumont Hosp, Dublin 9, Ireland
[6] Univ Texas MD Anderson Canc Ctr, Dept Melanoma Med Oncol, Houston, TX 77030 USA
[7] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
来源
ANNALS OF APPLIED STATISTICS | 2014年 / 8卷 / 03期
基金
爱尔兰科学基金会;
关键词
Bayesian methods; protein signaling pathways; graphical models; mixture models; PROTEOMIC ANALYSIS; GENE-EXPRESSION; COVARIANCE ESTIMATION; PI3K PATHWAY; CANCER; MUTATIONS; SELECTION; CELLS; ARRAY; ACTIVATION;
D O I
10.1214/14-AOAS722
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Reverse-phase protein array (RPPA) analysis is a powerful, relatively new platform that allows for high-throughput, quantitative analysis of protein networks. One of the challenges that currently limit the potential of this technology is the lack of methods that allow for accurate data modeling and identification of related networks and samples. Such models may improve the accuracy of biological sample classification based on patterns of protein network activation and provide insight into the distinct biological relationships underlying different types of cancer. Motivated by RPPA data, we propose a Bayesian sparse graphical modeling approach that uses selection priors on the conditional relationships in the presence of class information. The novelty of our Bayesian model lies in the ability to draw information from the network data as well as from the associated categorical outcome in a unified hierarchical model for classification. In addition, our method allows for intuitive integration of a priori network information directly in the model and allows for posterior inference on the network topologies both within and between classes. Applying our methodology to an RPPA data set generated from panels of human breast cancer and ovarian cancer cell lines, we demonstrate that the model is able to distinguish the different cancer cell types more accurately than several existing models and to identify differential regulation of components of a critical signaling network (the PI3K-AKT pathway) between these two types of cancer. This approach represents a powerful new tool that can be used to improve our understanding of protein networks in cancer.
引用
收藏
页码:1443 / 1468
页数:26
相关论文
共 50 条
  • [1] Sparse Bayesian Graphical Models for RPPA Time Course Data
    Mitra, Riten
    Mueller, Peter
    Ji, Yuan
    Mills, Gordon
    Lu, Yiling
    2012 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS), 2012, : 113 - 117
  • [2] Bayesian sparse graphical models and their mixtures
    Talluri, Rajesh
    Baladandayuthapani, Veerabhadran
    Mallick, Bani K.
    STAT, 2014, 3 (01): : 109 - 125
  • [3] Sparse graphical models for exploring gene expression data
    Dobra, A
    Hans, C
    Jones, B
    Nevins, JR
    Yao, GA
    West, M
    JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 90 (01) : 196 - 212
  • [4] Bayesian Structure Learning in Sparse Gaussian Graphical Models
    Mohammadi, A.
    Wit, E. C.
    BAYESIAN ANALYSIS, 2015, 10 (01): : 109 - 138
  • [5] Graphical models for sparse data: Graphical Gaussian models with vertex and edge symmetries
    Hojsgaard, Soren
    COMPSTAT 2008: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2008, : 105 - 116
  • [6] Accelerating Bayesian Structure Learning in Sparse Gaussian Graphical Models
    Mohammadi, Reza
    Massam, Helene
    Letac, Gerard
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 1345 - 1358
  • [7] NONPARAMETRIC BAYESIAN SPARSE FACTOR MODELS WITH APPLICATION TO GENE EXPRESSION MODELING
    Knowles, David
    Ghahramani, Zoubin
    ANNALS OF APPLIED STATISTICS, 2011, 5 (2B): : 1534 - 1552
  • [8] Bayesian Inference for General Gaussian Graphical Models With Application to Multivariate Lattice Data
    Dobra, Adrian
    Lenkoski, Alex
    Rodriguez, Abel
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (496) : 1418 - 1433
  • [9] Bayesian Graphical Models for Multivariate Functional Data
    Zhu, Hongxiao
    Strawn, Nate
    Dunson, David B.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [10] BAYESIAN GRAPHICAL MODELS FOR DISCRETE-DATA
    MADIGAN, D
    YORK, J
    INTERNATIONAL STATISTICAL REVIEW, 1995, 63 (02) : 215 - 232