Executable pathway analysis using ensemble discrete-state modeling for large-scale data

被引:13
|
作者
Palli, Rohith [1 ,2 ]
Palshikar, Mukta G. [2 ]
Thakar, Juilee [2 ,3 ,4 ]
机构
[1] Univ Rochester, Med Scientist Training Program, Rochester, NY USA
[2] Univ Rochester, Biophys Struct & Computat Biol Program, Rochester, NY 14642 USA
[3] Univ Rochester, Dept Microbiol & Immunol, Rochester, NY 14642 USA
[4] Univ Rochester, Dept Biostat & Computat Biol, Rochester, NY 14642 USA
基金
美国国家卫生研究院;
关键词
PROBABILISTIC BOOLEAN NETWORKS; INDUCIBLE GENE-EXPRESSION; CELL-CYCLE PROGRESSION; DISEASE; APOPTOSIS; PROTEIN; TARGET;
D O I
10.1371/journal.pcbi.1007317
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Pathway analysis is widely used to gain mechanistic insights from high-throughput omics data. However, most existing methods do not consider signal integration represented by pathway topology, resulting in enrichment of convergent pathways when downstream genes are modulated. Incorporation of signal flow and integration in pathway analysis could rank the pathways based on modulation in key regulatory genes. This implementation can be facilitated for large-scale data by discrete state network modeling due to simplicity in parameterization. Here, we model cellular heterogeneity using discrete state dynamics and measure pathway activities in cross-sectional data. We introduce a new algorithm, Boolean Omics Network Invariant-Time Analysis (BONITA), for signal propagation, signal integration, and pathway analysis. Our signal propagation approach models heterogeneity in transcriptomic data as arising from intercellular heterogeneity rather than intracellular stochasticity, and propagates binary signals repeatedly across networks. Logic rules defining signal integration are inferred by genetic algorithm and are refined by local search. The rules determine the impact of each node in a pathway, which is used to score the probability of the pathway's modulation by chance. We have comprehensively tested BONITA for application to transcriptomics data from translational studies. Comparison with state-of-the-art pathway analysis methods shows that BONITA has higher sensitivity at lower levels of source node modulation and similar sensitivity at higher levels of source node modulation. Application of BONITA pathway analysis to previously validated RNA-sequencing studies identifies additional relevant pathways in in-vitro human cell line experiments and in-vivo infant studies. Additionally, BONITA successfully detected modulation of disease specific pathways when comparing relevant RNA-sequencing data with healthy controls. Most interestingly, the two highest impact score nodes identified by BONITA included known drug targets. Thus, BONITA is a powerful approach to prioritize not only pathways but also specific mechanistic role of genes compared to existing methods. BONITA is available at: https://github.com/thakar-lab/BONITA.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Large-Scale Cellular Network Modeling From Population Data: An Empirical Analysis
    Achtzehn, Andreas
    Riihijarvi, Janne
    Mahonen, Petri
    IEEE COMMUNICATIONS LETTERS, 2016, 20 (11) : 2292 - 2295
  • [32] A novel ensemble-based paradigm to process large-scale data
    Thanh Trinh
    HoangAnh Le
    Nhung VuongThi
    Hai HoangDuc
    KieuAnh VuThi
    Multimedia Tools and Applications, 2024, 83 (9) : 26663 - 26685
  • [33] Effective ensemble learning approach for large-scale medical data analytics
    Namamula, Lakshmana Rao
    Chaytor, Daniel
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (01) : 13 - 20
  • [34] ORM Ontologies with Executable Derivation Rules to Support Semantic Search in Large-Scale Data Applications
    Bur, Marton
    Stirewalt, Kurt
    ACM/IEEE 25TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS, MODELS 2022 COMPANION, 2022, : 81 - 82
  • [35] A novel ensemble-based paradigm to process large-scale data
    Thanh Trinh
    HoangAnh Le
    Nhung VuongThi
    Hai HoangDuc
    KieuAnh VuThi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 26663 - 26685
  • [36] Ensemble Riemannian data assimilation: towards large-scale dynamical systems
    Tamang, Sagar K.
    Ebtehaj, Ardeshir
    van Leeuwen, Peter Jan
    Lerman, Gilad
    Foufoula-Georgiou, Efi
    NONLINEAR PROCESSES IN GEOPHYSICS, 2022, 29 (01) : 77 - 92
  • [37] A three-way cluster ensemble approach for large-scale data
    Yu, Hong
    Chen, Yun
    Lingras, Pawan
    Wang, Guoyin
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2019, 115 (32-49) : 32 - 49
  • [38] Effective ensemble learning approach for large-scale medical data analytics
    Lakshmana Rao Namamula
    Daniel Chaytor
    International Journal of System Assurance Engineering and Management, 2024, 15 : 13 - 20
  • [39] An elastic framework for ensemble-based large-scale data assimilation
    Friedemann, Sebastian
    Raffin, Bruno
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2022, 36 (04): : 543 - 563
  • [40] Large-Scale Power Systems State Estimation Using PMU and SCADA Data
    Saadabadi, Hamideh
    Dehghani, Maryam
    2016 24TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2016, : 906 - 911