ACE: adaptive cluster expansion for maximum entropy graphical model inference

被引:52
|
作者
Barton, J. P. [1 ,2 ,3 ]
De Leonardis, E. [4 ,5 ,6 ]
Coucke, A. [5 ,6 ,7 ]
Cocco, S. [4 ,5 ]
机构
[1] MIT, Dept Chem Engn, Cambridge, MA 02139 USA
[2] MIT, Dept Phys, Cambridge, MA 02139 USA
[3] Massachusetts Inst Technol & Harvard, Ragon Inst, Massachusetts Gen Hosp, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[4] Ecole Normale Super, CNRS, Lab Phys Stat, Paris, France
[5] Univ P&M Curie, Paris, France
[6] Sorbonne Univ, Computat & Quantitat Biol, UPMC, UMR 7238, Paris, France
[7] Ecole Normale Super, CNRS, Phys Theor Lab, Paris, France
关键词
DIRECT-COUPLING ANALYSIS;
D O I
10.1093/bioinformatics/btw328
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Graphical models are often employed to interpret patterns of correlations observed in data through a network of interactions between the variables. Recently, Ising/Potts models, also known as Markov random fields, have been productively applied to diverse problems in biology, including the prediction of structural contacts from protein sequence data and the description of neural activity patterns. However, inference of such models is a challenging computational problem that cannot be solved exactly. Here, we describe the adaptive cluster expansion (ACE) method to quickly and accurately infer Ising or Potts models based on correlation data. ACE avoids overfitting by constructing a sparse network of interactions sufficient to reproduce the observed correlation data within the statistical error expected due to finite sampling. When convergence of the ACE algorithm is slow, we combine it with a Boltzmann Machine Learning algorithm (BML). We illustrate this method on a variety of biological and artificial datasets and compare it to state-of-the-art approximate methods such as Gaussian and pseudo-likelihood inference. Results: We show that ACE accurately reproduces the true parameters of the underlying model when they are known, and yields accurate statistical descriptions of both biological and artificial data. Models inferred by ACE more accurately describe the statistics of the data, including both the constrained low-order correlations and unconstrained higher-order correlations, compared to those obtained by faster Gaussian and pseudo-likelihood methods. These alternative approaches can recover the structure of the interaction network but typically not the correct strength of interactions, resulting in less accurate generative models.
引用
收藏
页码:3089 / 3097
页数:9
相关论文
共 50 条
  • [1] Maximum entropy relaxation for multiscale graphical model selection
    Choi, Myung Jin
    Chandrasekaran, Venkat
    Willsky, Alan S.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1889 - 1892
  • [2] On Maximum Entropy and Inference
    Gresele, Luigi
    Marsili, Matteo
    [J]. ENTROPY, 2017, 19 (12):
  • [3] Massive inference and maximum entropy
    Skilling, J
    [J]. MAXIMUM ENTROPY AND BAYESIAN METHODS, 1998, 98 : 1 - 14
  • [4] Maximum entropy relaxation for graphical model selection given inconsistent statistics
    Chandrasekaran, Venkat
    Johnson, Jason K.
    Willsky, Alan S.
    [J]. 2007 IEEE/SP 14TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 625 - 629
  • [5] Adaptive estimated maximum-entropy distribution model
    Tan, Ling
    Taniar, David
    [J]. INFORMATION SCIENCES, 2007, 177 (15) : 3110 - 3128
  • [6] Adaptive Exact Inference in Graphical Models
    Suemer, Oezguer
    Acar, Umut A.
    Ihler, Alexander T.
    Mettu, Ramgopal R.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 3147 - 3186
  • [7] Maximum entropy and the Edgeworth expansion
    Harremoës, P
    [J]. Proceedings of the IEEE ITSOC Information Theory Workshop 2005 on Coding and Complexity, 2005, : 68 - 71
  • [8] Maximum entropy inference and stimulus generalization
    Myung, IJ
    Shepard, RN
    [J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1996, 40 (04) : 342 - 347
  • [9] MAXIMUM-ENTROPY AND INDUCTIVE INFERENCE
    PARIS, JB
    VENCOVSKA, A
    [J]. MAXIMUM ENTROPY AND BAYESIAN METHODS /, 1989, 36 : 397 - 403
  • [10] Continuity of the Maximum-Entropy Inference
    Weis Stephan
    [J]. Communications in Mathematical Physics, 2014, 330 : 1263 - 1292