Efficient inference for sparse latent variable models of transcriptional regulation

被引:1
|
作者
Dai, Zhenwen [1 ,2 ]
Iqbal, Mudassar [3 ]
Lawrence, Neil D. [1 ,2 ]
Rattray, Magnus [3 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield, S Yorkshire, England
[2] Amazon Res, Cambridge, England
[3] Univ Manchester, Fac Biol Med & Hlth Sci, Div Informat Imaging & Data Sci, Manchester, Lancs, England
基金
英国医学研究理事会;
关键词
GENE NETWORK INFERENCE; MYCOBACTERIUM-TUBERCULOSIS; COMPONENT ANALYSIS; BACILLUS-SUBTILIS; EXPRESSION; INTEGRATION;
D O I
10.1093/bioinformatics/btx508
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Regulation of gene expression in prokaryotes involves complex co-regulatory mechanisms involving large numbers of transcriptional regulatory proteins and their target genes. Uncovering these genome-scale interactions constitutes a major bottleneck in systems biology. Sparse latent factor models, assuming activity of transcription factors (TFs) as unobserved, provide a biologically interpretable modelling framework, integrating gene expression and genome-wide binding data, but at the same time pose a hard computational inference problem. Existing probabilistic inference methods for such models rely on subjective filtering and suffer from scalability issues, thus are not well-suited for realistic genome-scale applications. Results: We present a fast Bayesian sparse factor model, which takes input gene expression and binding sites data, either from ChIP-seq experiments or motif predictions, and outputs active TF-gene links as well as latent TF activities. Our method employs an efficient variational Bayes scheme for model inference enabling its application to large datasets which was not feasible with existing MCMC-based inference methods for such models. We validate our method on synthetic data against a similar model in the literature, employing MCMC for inference, and obtain comparable results with a small fraction of the computational time. We also apply our method to large-scale data from Mycobacterium tuberculosis involving ChIP-seq data on 113 TFs and matched gene expression data for 3863 putative target genes. We evaluate our predictions using an independent transcriptomics experiment involving over-expression of TFs.
引用
收藏
页码:3776 / 3783
页数:8
相关论文
共 50 条
  • [1] Distributed Variational Inference in Sparse Gaussian Process Regression and Latent Variable Models
    Gal, Yarin
    van der Wilk, Mark
    Rasmussen, Carl E.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [2] Spectral Latent Variable Models for perceptual inference
    Kanaujia, Atul
    Sminchisescu, Cristian
    Metaxas, Dimitris
    [J]. 2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 142 - +
  • [3] Performance evaluation of latent variable models with sparse priors
    Wipf, David
    Palmer, Jason
    Rao, Bhaskar
    Kreutz-Delgado, Kenneth
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 453 - +
  • [4] Exact Inference for Integer Latent-Variable Models
    Winner, Kevin
    Sujono, Debora
    Sheldon, Dan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [5] Computationally efficient inference for latent position network models
    Rastelli, Riccardo
    Maire, Florian
    Friel, Nial
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (01): : 2531 - 2570
  • [6] Sparse signals recovery based on latent variable Bayesian models
    Wang, Feng
    Xiang, Xin
    Yi, Ke-Chu
    Xiong, Lei
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2015, 37 (01): : 97 - 102
  • [7] Inference and Interval Estimation for Indirect Effects With Latent Variable Models
    Falk, Carl F.
    Biesanz, Jeremy C.
    [J]. MULTIVARIATE BEHAVIORAL RESEARCH, 2011, 46 (06) : 1012 - 1012
  • [8] What are the advantages of MCMC based inference in latent variable models?
    Paap, R
    [J]. STATISTICA NEERLANDICA, 2002, 56 (01) : 2 - 22
  • [9] SINGLE INDEX LATENT VARIABLE MODELS FOR NETWORK TOPOLOGY INFERENCE
    Mei, Jonathan
    Moura, Jose M. F.
    [J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 703 - 707
  • [10] Perturbative Corrections for Approximate Inference in Gaussian Latent Variable Models
    Opper, Manfred
    Paquet, Ulrich
    Winther, Ole
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 2857 - 2898