Bayesian reinforcement learning reliability analysis

被引:0
|
作者
Zhou, Tong [1 ]
Guo, Tong [2 ]
Dang, Chao [3 ]
Beer, Michael [3 ,4 ,5 ]
机构
[1] Department of Civil and Environmental Engineering, The Hong Kong Polytechnic University, Hong Kong
[2] School of Civil Engineering, Southeast University, Nanjing,211189, China
[3] Institute for Risk and Reliability, Leibniz University Hannover, Hannover,30167, Germany
[4] Institute for Risk and Reliability, University of Liverpool, Liverpool,L69 7ZF, United Kingdom
[5] International Joint Research Center for Resilient Infrastructure & International Joint Research Center for Engineering Reliability and Stochastic Mechanics, Tongji University, Shanghai,200092, China
基金
中国国家自然科学基金;
关键词
Bayesian networks - Computational efficiency - Inference engines - Learning systems - Markov processes - Reinforcement learning - Statistics - Uncertainty analysis;
D O I
暂无
中图分类号
学科分类号
摘要
A Bayesian reinforcement learning reliability method that combines Bayesian inference for the failure probability estimation and reinforcement learning-guided sequential experimental design is proposed. The reliability-oriented sequential experimental design is framed as a finite-horizon Markov decision process (MDP), with the associated utility function defined by a measure of epistemic uncertainty about Kriging-estimated failure probability, referred to as integrated probability of misclassification (IPM). On this basis, a one-step Bayes optimal learning function termed integrated probability of misclassification reduction (IPMR), along with a compatible convergence criterion, is defined. Three effective strategies are implemented to accelerate IPMR-informed sequential experimental design: (i) Analytical derivation of the inner expectation in IPMR, simplifying it to a single expectation. (ii) Substitution of IPMR with its upper bound IPMRU to avoid element-wise computation of its integrand. (iii) Rational pruning of both quadrature set and candidate pool in IPMRU to alleviate computer memory constraint. The efficacy of the proposed approach is demonstrated on two benchmark examples and two numerical examples. Results indicate that IPMRU facilitates a much more rapid reduction of IPM compared to other existing learning functions, while requiring much less computational time than IPMR itself. Therefore, the proposed reliability method offers a substantial advantage in both computational efficiency and accuracy, especially in complex dynamic reliability problems. © 2024
引用
收藏
相关论文
共 50 条
  • [1] Bayesian reinforcement learning reliability analysis
    Zhou, Tong
    Guo, Tong
    Dang, Chao
    Beer, Michael
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2024, 424
  • [2] An Information-Theoretic Analysis of Bayesian Reinforcement Learning
    Gouverneur, Amaury
    Rodriguez-Galvez, Borja
    Oechtering, Tobias J.
    Skoglund, Mikael
    2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
  • [3] Benchmarking for Bayesian Reinforcement Learning
    Castronovo, Michael
    Ernst, Damien
    Couetoux, Adrien
    Fonteneau, Raphael
    PLOS ONE, 2016, 11 (06):
  • [4] Bayesian reinforcement learning: A survey
    Ghavamzadeh, Mohammad
    Mannor, Shie
    Pineau, Joelle
    Tamar, Aviv
    Foundations and Trends in Machine Learning, 2015, 8 (5-6): : 359 - 483
  • [5] Bayesian Inverse Reinforcement Learning
    Ramachandran, Deepak
    Amir, Eyal
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2586 - 2591
  • [6] Bayesian Reinforcement Learning with Exploration
    Lattimore, Tor
    Hutter, Marcus
    ALGORITHMIC LEARNING THEORY (ALT 2014), 2014, 8776 : 170 - 184
  • [7] Reinforcement Learning for Reliability Optimisation
    Saka, Prasuna
    Banerjee, Ansuman
    THIRTEENTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING ADVANCES (ICSEA 2018), 2018, : 25 - 32
  • [8] Yet another Bayesian active learning reliability analysis method
    Dang, Chao
    Zhou, Tong
    Valdebenito, Marcos A.
    Faes, Matthias G. R.
    STRUCTURAL SAFETY, 2025, 112
  • [9] Multi-point Bayesian active learning reliability analysis
    Zhou, Tong
    Zhu, Xujia
    Guo, Tong
    Dong, You
    Beer, Michael
    STRUCTURAL SAFETY, 2025, 114
  • [10] A Bayesian Approach to Robust Reinforcement Learning
    Derman, Esther
    Mankowitz, Daniel
    Mann, Timothy
    Mannor, Shie
    35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 648 - 658