Executable pathway analysis using ensemble discrete-state modeling for large-scale data

被引:13
|
作者
Palli, Rohith [1 ,2 ]
Palshikar, Mukta G. [2 ]
Thakar, Juilee [2 ,3 ,4 ]
机构
[1] Univ Rochester, Med Scientist Training Program, Rochester, NY USA
[2] Univ Rochester, Biophys Struct & Computat Biol Program, Rochester, NY 14642 USA
[3] Univ Rochester, Dept Microbiol & Immunol, Rochester, NY 14642 USA
[4] Univ Rochester, Dept Biostat & Computat Biol, Rochester, NY 14642 USA
基金
美国国家卫生研究院;
关键词
PROBABILISTIC BOOLEAN NETWORKS; INDUCIBLE GENE-EXPRESSION; CELL-CYCLE PROGRESSION; DISEASE; APOPTOSIS; PROTEIN; TARGET;
D O I
10.1371/journal.pcbi.1007317
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Pathway analysis is widely used to gain mechanistic insights from high-throughput omics data. However, most existing methods do not consider signal integration represented by pathway topology, resulting in enrichment of convergent pathways when downstream genes are modulated. Incorporation of signal flow and integration in pathway analysis could rank the pathways based on modulation in key regulatory genes. This implementation can be facilitated for large-scale data by discrete state network modeling due to simplicity in parameterization. Here, we model cellular heterogeneity using discrete state dynamics and measure pathway activities in cross-sectional data. We introduce a new algorithm, Boolean Omics Network Invariant-Time Analysis (BONITA), for signal propagation, signal integration, and pathway analysis. Our signal propagation approach models heterogeneity in transcriptomic data as arising from intercellular heterogeneity rather than intracellular stochasticity, and propagates binary signals repeatedly across networks. Logic rules defining signal integration are inferred by genetic algorithm and are refined by local search. The rules determine the impact of each node in a pathway, which is used to score the probability of the pathway's modulation by chance. We have comprehensively tested BONITA for application to transcriptomics data from translational studies. Comparison with state-of-the-art pathway analysis methods shows that BONITA has higher sensitivity at lower levels of source node modulation and similar sensitivity at higher levels of source node modulation. Application of BONITA pathway analysis to previously validated RNA-sequencing studies identifies additional relevant pathways in in-vitro human cell line experiments and in-vivo infant studies. Additionally, BONITA successfully detected modulation of disease specific pathways when comparing relevant RNA-sequencing data with healthy controls. Most interestingly, the two highest impact score nodes identified by BONITA included known drug targets. Thus, BONITA is a powerful approach to prioritize not only pathways but also specific mechanistic role of genes compared to existing methods. BONITA is available at: https://github.com/thakar-lab/BONITA.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Modeling Large-Scale Slim Fly Networks Using Parallel Discrete-Event Simulation
    Wolfe, Noah
    Mubarak, Misbah
    Carothers, Christopher D.
    Ross, Robert B.
    Carns, Philip H.
    ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2018, 28 (04):
  • [42] Direction pathway analysis of large-scale proteomics data reveals novel features of the insulin action pathway
    Yang, Pengyi
    Patrick, Ellis
    Tan, Shi-Xiong
    Fazakerley, Daniel J.
    Burchfield, James
    Gribben, Christopher
    Prior, Matthew J.
    James, David E.
    Yang, Yee Hwa
    BIOINFORMATICS, 2014, 30 (06) : 808 - 814
  • [43] Large-Scale Data-Driven Financial Risk Modeling using Big Data Technology
    Stockinger, Kurt
    Heitz, Jonas
    Bundi, Nils
    Breymann, Wolfgang
    2018 IEEE/ACM 5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING APPLICATIONS AND TECHNOLOGIES (BDCAT), 2018, : 206 - 207
  • [44] INTERACTION OF A CUMULUS CLOUD ENSEMBLE WITH THE LARGE-SCALE ENVIRONMENT .4. THE DISCRETE MODEL
    LORD, SJ
    CHAO, WC
    ARAKAWA, A
    JOURNAL OF THE ATMOSPHERIC SCIENCES, 1982, 39 (01) : 104 - 113
  • [45] Large-Scale Analysis of Soccer Matches using Spatiotemporal Tracking Data
    Bialkowski, Alina
    Lucey, Patrick
    Carr, Peter
    Yue, Yisong
    Sridharan, Sridha
    Matthews, Iain
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 725 - 730
  • [46] Analysis of Elderly Drivers' Performance Using Large-Scale Test Data
    Nakano, Yasuhiko
    Kawanaka, Haruki
    Oguri, Koji
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (01): : 243 - 251
  • [47] Synthesis of decentralized state feedbacks for large-scale discrete event systems
    Takai, Shigemasa
    Kodama, Shinzo
    Ushio, Toshimitsu
    Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi), 1994, 77 (04): : 34 - 43
  • [48] ON THE STATE ESTIMATION OF LARGE-SCALE SYSTEMS IN THE DISCRETE-TIME DOMAIN
    MAHALANABIS, AK
    RAY, G
    LARGE SCALE SYSTEMS IN INFORMATION AND DECISION TECHNOLOGIES, 1982, 3 (02): : 97 - 109
  • [49] Large-Scale Data Analysis on Cloud Systems
    Marozzo, Fabrizio
    Talia, Domenico
    Trunfio, Paolo
    ERCIM NEWS, 2012, (89): : 26 - 27
  • [50] TranSeqAnnotator: large-scale analysis of transcriptomic data
    Menon, Ranjeeta
    Garg, Gagan
    Gasser, Robin B.
    Ranganathan, Shoba
    BMC BIOINFORMATICS, 2012, 13