Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach

被引:0
|
作者
Queeney, James [1 ]
Paschalidis, Ioannis Ch [1 ]
Cassandras, Christos G. [1 ]
机构
[1] Boston Univ, Div Syst Engn, Boston, MA 02215 USA
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order for reinforcement learning techniques to be useful in real-world decision making processes, they must be able to produce robust performance from limited data. Deep policy optimization methods have achieved impressive results on complex tasks, but their real-world adoption remains limited because they often require significant amounts of data to succeed. When combined with small sample sizes, these methods can result in unstable learning due to their reliance on high-dimensional sample-based estimates. In this work, we develop techniques to control the uncertainty introduced by these estimates. We leverage these techniques to propose a deep policy optimization approach designed to produce stable performance even when data is scarce. The resulting algorithm, Uncertainty-Aware Trust Region Policy Optimization, generates robust policy updates that adapt to the level of uncertainty present throughout the learning process.
引用
收藏
页码:9377 / 9385
页数:9
相关论文
共 50 条
  • [21] Uncertainty-Aware Web of Things Composition: A Probabilistic Approach
    Boulaares, Soura
    Sassi, Salma
    Chbeir, Richard
    Bensilmane, Djamal
    Faiz, Sami
    2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [22] Uncertainty-Aware Instance Reweighting for Off-Policy Learning
    Zhang, Xiaoying
    Chen, Junpu
    Wang, Hongning
    Xie, Hong
    Liu, Yang
    Lui, John C. S.
    Li, Hang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] Uncertainty-Aware Active Domain Adaptive Salient Object Detection
    Li, Guanbin
    Chen, Zhuohua
    Mao, Mingzhi
    Lin, Liang
    Fang, Chaowei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5510 - 5524
  • [24] Deep Adaptive Pansharpening via Uncertainty-Aware Image Fusion
    Zheng, Kaiwen
    Huang, Jie
    Zhou, Man
    Hong, Danfeng
    Zhao, Feng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [25] Uncertainty-Aware Label Rectification for Domain Adaptive Mitochondria Segmentation
    Wu, Siqi
    Chen, Chang
    Xiong, Zhiwei
    Chen, Xuejin
    Sun, Xiaoyan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 191 - 200
  • [26] Online and Adaptive Parking Availability Mapping: An Uncertainty-Aware Active Sensing Approach for Connected Vehicles
    Varotto, Luca
    Cenedese, Angelo
    2021 IEEE INTELLIGENT VEHICLES SYMPOSIUM WORKSHOPS (IV WORKSHOPS), 2021, : 31 - 36
  • [27] Uncertainty-Aware Robust Optimization of Test-Access Architectures for 3D Stacked ICs
    Deutsch, Sergej
    Chakrabarty, Krishnendu
    Marinissen, Erik Jan
    2013 IEEE INTERNATIONAL TEST CONFERENCE (ITC), 2013,
  • [28] Data-driven and uncertainty-aware robust airstrip surface estimation
    Crocetti, Francesco
    Fravolini, Mario Luca
    Costante, Gabriele
    Valigi, Paolo
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (26): : 19565 - 19580
  • [29] Trust Region Policy Optimization
    Schulman, John
    Levine, Sergey
    Moritz, Philipp
    Jordan, Michael
    Abbeel, Pieter
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1889 - 1897
  • [30] Data-driven and uncertainty-aware robust airstrip surface estimation
    Francesco Crocetti
    Mario Luca Fravolini
    Gabriele Costante
    Paolo Valigi
    Neural Computing and Applications, 2023, 35 : 19565 - 19580