Summary statistics and discrepancy measures for approximate Bayesian computation via surrogate posteriors

被引:0
|
作者
Florence Forbes
Hien Duy Nguyen
TrungTin Nguyen
Julyan Arbel
机构
[1] University of Grenoble Alpes,School of Mathematics and Physics
[2] Inria,undefined
[3] CNRS,undefined
[4] Grenoble INP,undefined
[5] LJK,undefined
[6] Inria Grenoble Rhone-Alpes,undefined
[7] University of Queensland,undefined
[8] Normandie University,undefined
[9] UNICAEN,undefined
[10] CNRS,undefined
[11] LMNO,undefined
来源
Statistics and Computing | 2022年 / 32卷
关键词
Approximate Bayesian computation; Summary statistics; Surrogate models; Gaussian mixtures; Wasserstein distance; Multimodal posterior distributions;
D O I
暂无
中图分类号
学科分类号
摘要
A key ingredient in approximate Bayesian computation (ABC) procedures is the choice of a discrepancy that describes how different the simulated and observed data are, often based on a set of summary statistics when the data cannot be compared directly. Unless discrepancies and summaries are available from experts or prior knowledge, which seldom occurs, they have to be chosen, and thus their choice can affect the quality of approximations. The choice between discrepancies is an active research topic, which has mainly considered data discrepancies requiring samples of observations or distances between summary statistics. In this work, we introduce a preliminary learning step in which surrogate posteriors are built from finite Gaussian mixtures using an inverse regression approach. These surrogate posteriors are then used in place of summary statistics and compared using metrics between distributions in place of data discrepancies. Two such metrics are investigated: a standard L2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$_2$$\end{document} distance and an optimal transport-based distance. The whole procedure can be seen as an extension of the semi-automatic ABC framework to a functional summary statistics setting and can also be used as an alternative to sample-based approaches. The resulting ABC quasi-posterior distribution is shown to converge to the true one, under standard conditions. Performance is illustrated on both synthetic and real data sets, where it is shown that our approach is particularly useful when the posterior is multimodal.
引用
收藏
相关论文
共 50 条
  • [1] Summary statistics and discrepancy measures for approximate Bayesian computation via surrogate posteriors
    Forbes, Florence
    Nguyen, Hien Duy
    Nguyen, TrungTin
    Arbel, Julyan
    STATISTICS AND COMPUTING, 2022, 32 (05)
  • [2] Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation
    Fearnhead, Paul
    Prangle, Dennis
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2012, 74 : 419 - 474
  • [3] On Optimal Selection of Summary Statistics for Approximate Bayesian Computation
    Nunes, Matthew A.
    Balding, David J.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2010, 9 (01)
  • [4] Choosing the Summary Statistics and the Acceptance Rate in Approximate Bayesian Computation
    Blum, Michael G. B.
    COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 47 - 56
  • [5] Approximate Bayesian Computation Without Summary Statistics: The Case of Admixture
    Sousa, Vitor C.
    Fritz, Marielle
    Beaumont, Mark A.
    Chikhi, Lounes
    GENETICS, 2009, 181 (04) : 1507 - 1519
  • [6] A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation
    Aeschbacher, Simon
    Beaumont, Mark A.
    Futschik, Andreas
    GENETICS, 2012, 192 (03) : 1027 - +
  • [7] Convolutional Neural Networks as Summary Statistics for Approximate Bayesian Computation
    Akesson, Mattias
    Singh, Prashant
    Wrede, Fredrik
    Hellander, Andreas
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3353 - 3365
  • [8] A Parzen-Based Distance Between Probability Measures as an Alternative of Summary Statistics in Approximate Bayesian Computation
    Zuluaga, Carlos D.
    Valencia, Edgar A.
    Alvarez, Mauricio A.
    Orozco, Alvaro A.
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2015, PT I, 2015, 9279 : 50 - 61
  • [9] An automatic adaptive method to combine summary statistics in approximate Bayesian computation
    Harrison, Jonathan U.
    Baker, Ruth E.
    PLOS ONE, 2020, 15 (08):
  • [10] Choosing summary statistics by least angle regression for approximate Bayesian computation
    Faisal, Muhammad
    Futschik, Andreas
    Hussain, Ijaz
    Abd-el Moemen, Mitwali
    JOURNAL OF APPLIED STATISTICS, 2016, 43 (12) : 2191 - 2202