Inference and Verification of Probabilistic Graphical Models from High-Dimensional Data

被引:1
|
作者
Ma, Yinjiao [1 ]
Damazyn, Kevin [2 ]
Klinger, Jakob [2 ]
Gong, Haijun [2 ]
机构
[1] St Louis Univ, Dept Biostat, St Louis, MO 63103 USA
[2] St Louis Univ, Dept Math & Comptuer Sci, St Louis, MO 63103 USA
关键词
Dimensionality reduction; Gaussian graphical model; Graphical lasso; Dynamic Bayesian network; Model checking; Microarray; Prostate cancer; BAYESIAN NETWORKS; PROSTATE-CANCER; EXPRESSION;
D O I
10.1007/978-3-319-21843-4_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Probabilistic graphical modelling technique has been widely used to infer the causal relations in the network from high-dimensional data. One of the most challenging biological questions is the inference and verification of biological network, for example, gene regulatory network and signaling pathway, from high-dimensional omics data. Conditionally dependent genes and undirected network can be inferred from the independently and identically distributed static data, while the time series data can help reconstruct a directed network which is more important to our understanding of the complex biological system. Due to the curse of dimensionality and network sparsity, statistical inference algorithm alone is not efficient and realistic to infer and verify large networks. In this work, we propose a novel technique, which applies the dimensionality reduction, network inference and formal verification methods together to reconstruct some regulatory networks from the static and time-series microarray data. A graphical lasso algorithm is first applied to learn the structure of Gaussian graphical models from static data and infer some conditionally dependent genes. Then, an extended dynamic Bayesian network method is applied to reconstruct some weighted and directed networks from the time series data of selected genes, and also generate symbolic model verification code for model checking. Finally, we apply this technique to reconstruct and verify some regulatory networks in yeast and prostate cancer in response to stress and irradiation respectively for illustration.
引用
收藏
页码:223 / 239
页数:17
相关论文
共 50 条
  • [11] Nonparametric and high-dimensional functional graphical models
    Solea, Eftychia
    Dette, Holger
    ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (02): : 6175 - 6231
  • [12] High-dimensional Mixed Graphical Model with Ordinal Data: Parameter Estimation and Statistical Inference
    Feng, Huijie
    Ning, Yang
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 654 - 663
  • [13] ASYMPTOTIC INFERENCE FOR HIGH-DIMENSIONAL DATA
    Kuelbs, Jim
    Vidyashankar, Anand N.
    ANNALS OF STATISTICS, 2010, 38 (02): : 836 - 869
  • [14] Inference Attacks on Genomic Data Based on Probabilistic Graphical Models
    Zaobo He
    Junxiu Zhou
    Big Data Mining and Analytics, 2020, (03) : 225 - 233
  • [15] Inference Attacks on Genomic Data Based on Probabilistic Graphical Models
    He, Zaobo
    Zhou, Junxiu
    BIG DATA MINING AND ANALYTICS, 2020, 3 (03) : 225 - 233
  • [16] Probabilistic classifiers with high-dimensional data
    Kim, Kyung In
    Simon, Richard
    BIOSTATISTICS, 2011, 12 (03) : 399 - 412
  • [17] Learning Sparse High-Dimensional Matrix-Valued Graphical Models From Dependent Data
    Tugnait, Jitendra K.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 3363 - 3379
  • [18] Inference of Multiple High-Dimensional Networks with the Graphical Horseshoe Prior
    Busatto, Claudio
    Stingo, Francesco Claudio
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2025,
  • [19] Online inference in high-dimensional generalized linear models with streaming data
    Luo, Lan
    Han, Ruijian
    Lin, Yuanyuan
    Huang, Jian
    ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (02): : 3443 - 3471
  • [20] Inference on High-dimensional Single-index Models with Streaming Data
    Han, Dongxiao
    Xie, Jinhan
    Liu, Jin
    Sun, Liuquan
    Huang, Jian
    Jiang, Bei
    Kong, Linglong
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25