Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach

被引:0
|
作者
Queeney, James [1 ]
Paschalidis, Ioannis Ch [1 ]
Cassandras, Christos G. [1 ]
机构
[1] Boston Univ, Div Syst Engn, Boston, MA 02215 USA
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order for reinforcement learning techniques to be useful in real-world decision making processes, they must be able to produce robust performance from limited data. Deep policy optimization methods have achieved impressive results on complex tasks, but their real-world adoption remains limited because they often require significant amounts of data to succeed. When combined with small sample sizes, these methods can result in unstable learning due to their reliance on high-dimensional sample-based estimates. In this work, we develop techniques to control the uncertainty introduced by these estimates. We leverage these techniques to propose a deep policy optimization approach designed to produce stable performance even when data is scarce. The resulting algorithm, Uncertainty-Aware Trust Region Policy Optimization, generates robust policy updates that adapt to the level of uncertainty present throughout the learning process.
引用
收藏
页码:9377 / 9385
页数:9
相关论文
共 50 条
  • [1] Uncertainty-aware circuit optimization
    Bai, XL
    Visweswariah, C
    Strenski, PN
    Hathaway, DJ
    39TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2002, 2002, : 58 - 63
  • [2] Uncertainty-aware GAN with Adaptive Loss for Robust MRI Image Enhancement
    Upadhyay, Uddeshya
    Sudarshan, Viswanath P.
    Awate, Suyash P.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3248 - 3257
  • [3] Uncertainty-aware Simulation of Adaptive Systems
    Jezequel, Jean-Marc
    Vallecillo, Antonio
    ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2023, 33 (03):
  • [4] Uncertainty-Aware Reliability Analysis and Optimization
    Khosravi, Faramarz
    Mueller, Malte
    Glass, Michael
    Teich, Juergen
    2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2015, : 97 - 102
  • [5] RobOpt: A Tool for Robust Workload Optimization Based on Uncertainty-Aware Machine Learning
    Kamali, Amin
    Kantere, Verena
    Zuzarte, Calisto
    Corvinelli, Vincent
    COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 468 - 471
  • [6] Uncertainty-Aware Reinforcement Learning for Portfolio Optimization
    Enkhsaikhan, Bayaraa
    Jo, Ohyun
    IEEE ACCESS, 2024, 12 : 166553 - 166563
  • [7] Uncertainty-Aware Optimization for Network Provisioning and Routing
    Bi, Yingjie
    Tang, Ao
    2019 53RD ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2019,
  • [8] Robust Tracking via Uncertainty-Aware Semantic Consistency
    Ma, Jie
    Lan, Xiangyuan
    Zhong, Bineng
    Li, Guorong
    Tang, Zhenjun
    Li, Xianxian
    Ji, Rongrong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1740 - 1751
  • [9] Uncertainty-aware image inpainting with adaptive feedback network
    Ma, Xin
    Zhou, Xiaoqiang
    Huang, Huaibo
    Jia, Gengyun
    Wang, Yaohui
    Chen, Xinyuan
    Chen, Cunjian
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [10] Rapid Trust Calibration through Interpretable and Uncertainty-Aware AI
    Tomsett, Richard
    Preece, Alun
    Braines, Dave
    Cerutti, Federico
    Chakraborty, Supriyo
    Srivastava, Mani
    Pearson, Gavin
    Kaplan, Lance
    PATTERNS, 2020, 1 (04):