Sworn testimony of the model evidence: Gaussian Mixture Importance (GAME) sampling

被引:31
|
作者
Volpi, Elena [1 ]
Schoups, Gerrit [2 ]
Firmani, Giovanni [1 ]
Vrugt, Jasper A. [3 ,4 ]
机构
[1] Univ Roma Tre, Dept Engn, Rome, Italy
[2] Delft Univ Technol, Dept Water Management, Delft, Netherlands
[3] Univ Calif Irvine, Dept Civil & Environm Engn, Irvine, CA USA
[4] Univ Calif Irvine, Dept Earth Syst Sci, Irvine, CA USA
关键词
MARGINAL LIKELIHOOD ESTIMATION; BAYES FACTORS; SIMULATION; SELECTION; UNCERTAINTY; INFERENCE;
D O I
10.1002/2016WR020167
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
What is the "best'' model? The answer to this question lies in part in the eyes of the beholder, nevertheless a good model must blend rigorous theory with redeeming qualities such as parsimony and quality of fit. Model selection is used to make inferences, via weighted averaging, from a set of K candidate models, M-k; k = (1, ..., K), and help identify which model is most supported by the observed data, (gamma) over tilde = ((gamma) over tilde (1), ... (gamma) over tilde (n)), Here, we introduce a new and robust estimator of the model evidence, p((gamma) over tilde vertical bar M-k), which acts as normalizing constant in the denominator of Bayes' theorem and provides a single quantitative measure of relative support for each hypothesis that integrates model accuracy, uncertainty, and complexity. However, p((gamma) over tilde vertical bar M-k) is analytically intractable for most practical modeling problems. Our method, coined GAussian Mixture importancE (GAME) sampling, uses bridge sampling of a mixture distribution fitted to samples of the posterior model parameter distribution derived from MCMC simulation. We benchmark the accuracy and reliability of GAME sampling by application to a diverse set of multivariate target distributions (up to 100 dimensions) with known values of p((gamma) over tilde vertical bar M-k) and to hypothesis testing using numerical modeling of the rainfall-runoff transformation of the Leaf River watershed in Mississippi, USA. These case studies demonstrate that GAME sampling provides robust and unbiased estimates of the evidence at a relatively small computational cost outperforming commonly used estimators. The GAME sampler is implemented in the MATLAB package of DREAM and simplifies considerably scientific inquiry through hypothesis testing and model selection. Plain Language Summary Science is an iterative process for learning and discovery in which competing ideas about how nature works are evaluated against observations. The translation of each hypothesis to a computational model requires specification of system boundaries, inputs and outputs, state variables, physical/behavioral laws, and material properties; this is difficult and subjective, particularly in the face of incomplete knowledge of the governing spatiotemporal processes and insufficient observed data. To guard against the use of an inadequate model, statisticians advise selecting the `` best'' model among a set of candidate ones where each might be equally plausible and justifiable a priori. Bayesian model selection uses probability theory to select among competing hypotheses; the key variable is the Bayesian model evidence, which provides a single quantitative measure of relative support for each hypothesis that integrates model accuracy, uncertainty, and complexity. Bayesian model selection has not entered into mainstream use in Earth systems modeling due to the lack of general-purpose methods to reliably estimate the evidence. Here, we introduce a new method, called GAussian Mixture importancE (GAME) sampling. We demonstrate GAME power and usefulness for hypothesis testing using benchmark experiments with known target and numerical modeling of the rainfall-runoff transformation of the Leaf River watershed (Mississippi, USA).
引用
收藏
页码:6133 / 6158
页数:26
相关论文
共 50 条
  • [1] Importance Sampling Schemes for Evidence Approximation in Mixture Models
    Lee, Jeong Eun
    Robert, Christian P.
    [J]. BAYESIAN ANALYSIS, 2016, 11 (02): : 573 - 597
  • [2] Scalable importance sampling estimation of Gaussian mixture posteriors in Bayesian networks
    Ramos-Lopez, Dario
    Masegosa, Andres R.
    Salmeron, Antonio
    Rumi, Rafael
    Langseth, Helge
    Nielsen, Thomas D.
    Madsen, Anders L.
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 100 : 115 - 134
  • [3] Scalable importance sampling estimation of Gaussian mixture posteriors in Bayesian networks
    [J]. Ramos-López, Darío (dramoslopez@ual.es), 1600, Elsevier Inc. (100):
  • [4] Cross-entropy-based adaptive importance sampling using Gaussian mixture
    Kurtz, Nolan
    Song, Junho
    [J]. STRUCTURAL SAFETY, 2013, 42 : 35 - 44
  • [5] Gaussian mixture importance sampling function for unscented SMC-PHD filter
    Yoon, Ju Hong
    Kim, Du Yong
    Yoon, Kuk-Jin
    [J]. SIGNAL PROCESSING, 2013, 93 (09) : 2664 - 2670
  • [6] Dynamic game level design using Gaussian mixture model
    Lee, Sangkyung
    Jung, Keechul
    [J]. PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 955 - 959
  • [7] An efficient mixture sampling model for gaussian estimation of distribution algorithm
    Dang, Qianlong
    Gao, Weifeng
    Gong, Maoguo
    [J]. INFORMATION SCIENCES, 2022, 608 : 1157 - 1182
  • [8] An efficient mixture sampling model for gaussian estimation of distribution algorithm
    Dang, Qianlong
    Gao, Weifeng
    Gong, Maoguo
    [J]. Information Sciences, 2022, 608 : 1157 - 1182
  • [9] Adaptive mixture importance sampling
    Raghavan, N
    Cox, DD
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1998, 60 (03) : 237 - 259
  • [10] GAME: Gaussian Mixture Model Mapping and Navigation Engine on Embedded FPGA
    Xu, Yuanfan
    Zhang, Zhaoliang
    Yu, Jincheng
    Cao, Jianfei
    Dong, Haolin
    Huang, Zhengfeng
    Wang, Yu
    Yang, Huazhong
    [J]. 2021 IEEE 29TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2021), 2021, : 60 - 68