Reinforcement learning and approximate Bayesian computation for model selection and parameter calibration applied to a nonlinear

被引:11
|
作者
Ritto, T. G. [1 ,2 ]
Beregi, S. [2 ]
Barton, D. A. W. [2 ]
机构
[1] Univ Fed Rio de Janeiro, Dept Mech Engn, Rio De Janeiro, Brazil
[2] Univ Bristol, Fac Engn, Bristol, England
基金
英国工程与自然科学研究理事会;
关键词
Nonlinear dynamics; Parameter identification; Model selection; Reinforcement learning; ABC; Decision under uncertainty; MONTE-CARLO; ROBOTICS;
D O I
10.1016/j.ymssp.2022.109485
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
In the context of digital twins and integration of physics-based models with machine learning tools, this paper proposes a new methodology for model selection and parameter identification. It combines (i) reinforcement learning (RL) for model selection through a Thompson-like sampling with (ii) approximate Bayesian computation (ABC) for parameter identification and uncertainty quantification. These two methods are applied together to a nonlinear mechanical oscillator with periodic forcing. Experimental data are used in the analysis and two different nonlinear models are tested. The initial Beta distribution that represents the likelihood of the model is updated depending on how successful the model is at reproducing the reference data (reinforcement learning strategy). At the same time, the prior distribution of the model parameters is updated using a likelihood-free strategy (ABC). In the end, the rewards and the posterior distribution of the parameters of each model are obtained. The results show that the combined methodology (RL-ABC) is promising for model selection from bifurcation diagrams. Prior parameter distribution was successfully updated, correlations between parameters were found, probabilistic envelopes of the posterior model are consistent with the available data, the most rewarded model was selected, and the reinforcing strategy allows to speed up the selection process.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Bayesian model selection and calibration applied to composite load identification
    Viana, Felipe A. C.
    Pan, Yan
    Bose, Sumit
    2014 IEEE PES T&D CONFERENCE AND EXPOSITION, 2014,
  • [22] Adaptive Approximate Bayesian Computation Tolerance Selection
    Simola, Umberto
    Cisewski-Kehe, Jessi
    Gutmann, Michael U.
    Corander, Jukka
    BAYESIAN ANALYSIS, 2021, 16 (02): : 397 - 423
  • [23] Parameter inference and model selection in deterministic and stochastic dynamical models via approximate Bayesian computation: modeling a wildlife epidemic
    Sun, Libo
    Lee, Chihoon
    Hoeting, Jennifer A.
    ENVIRONMETRICS, 2015, 26 (07) : 451 - 462
  • [24] Distance-learning For Approximate Bayesian Computation To Model a Volcanic Eruption
    Lorenzo Pacchiardi
    Pierre Künzli
    Marcel Schöngens
    Bastien Chopard
    Ritabrata Dutta
    Sankhya B, 2021, 83 : 288 - 317
  • [25] Distance-learning For Approximate Bayesian Computation To Model a Volcanic Eruption
    Pacchiardi, Lorenzo
    Kuenzli, Pierre
    Chopard, Bastien
    Schoengens, Marcel
    Dutta, Ritabrata
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2021, 83 (01): : 288 - 317
  • [26] Approximate planning for bayesian hierarchical reinforcement learning
    Ngo Anh Vien
    Hung Ngo
    Lee, Sungyoung
    Chung, TaeChoong
    APPLIED INTELLIGENCE, 2014, 41 (03) : 808 - 819
  • [27] Dual Control for Approximate Bayesian Reinforcement Learning
    Klenske, Edgar D.
    Hennig, Philipp
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [28] Dual control for approximate Bayesian reinforcement learning
    Klenske, Edgar D.
    Hennig, Philipp
    Journal of Machine Learning Research, 2016, 17 : 1 - 30
  • [29] On Optimal Selection of Summary Statistics for Approximate Bayesian Computation
    Nunes, Matthew A.
    Balding, David J.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2010, 9 (01)
  • [30] Approximate planning for bayesian hierarchical reinforcement learning
    Ngo Anh Vien
    Hung Ngo
    Sungyoung Lee
    TaeChoong Chung
    Applied Intelligence, 2014, 41 : 808 - 819