Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk

被引:0
|
作者
Ni, Xinyi [1 ]
Lai, Lifeng [1 ]
机构
[1] Univ Calif Davis, Elect & Comp Engn, Davis, CA USA
基金
美国国家科学基金会;
关键词
ambiguity sets; RMDP; risk-sensitive RL; CVaR; OPTIMIZATION;
D O I
10.1109/ITW61385.2024.10806953
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robust Markov Decision Processes (RMDPs) have received significant research interest, offering an alternative to standard Markov Decision Processes (MDPs) that often assume fixed transition probabilities. RMDPs address this by optimizing for the worst-case scenarios within ambiguity sets. While earlier studies on RMDPs have largely centered on risk-neutral reinforcement learning (RL), with the goal of minimizing expected total discounted costs, in this paper, we analyze the robustness of CVaR-based risk-sensitive RL under RMDP. Firstly, we consider predetermined ambiguity sets. Based on the coherency of CVaR, we establish a connection between robustness and risk sensitivity, thus, techniques in risk-sensitive RL can be adopted to solve the proposed problem. Furthermore, motivated by the existence of decision-dependent uncertainty in real-world problems, we study problems with state-action-dependent ambiguity sets. To solve this, we define a new risk measure named NCVaR and build the equivalence of NCVaR optimization and robust CVaR optimization. We further propose value iteration algorithms and validate our approach in simulation experiments.
引用
收藏
页码:520 / 525
页数:6
相关论文
共 50 条
  • [1] Risk-sensitive learning via minimization of empirical conditional value-at-risk
    Kashima, Hisashi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (12) : 2043 - 2052
  • [2] Risk-Sensitive Safety Analysis Using Conditional Value-at-Risk
    Chapman, Margaret P.
    Bonalli, Riccardo
    Smith, Kevin M.
    Yang, Insoon
    Pavone, Marco
    Tomlin, Claire J.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (12) : 6521 - 6536
  • [3] Risk-sensitive Reinforcement Learning and Robust Learning for Control
    Noorani, Erfaun
    Baras, John S.
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2976 - 2981
  • [4] Distributionally robust reinsurance with Value-at-Risk and Conditional Value-at-Risk
    Liu, Haiyan
    Mao, Tiantian
    INSURANCE MATHEMATICS & ECONOMICS, 2022, 107 : 393 - 417
  • [5] Risk-Sensitive Reinforcement Learning
    Shen, Yun
    Tobia, Michael J.
    Sommer, Tobias
    Obermayer, Klaus
    NEURAL COMPUTATION, 2014, 26 (07) : 1298 - 1328
  • [6] Risk-sensitive reinforcement learning
    Mihatsch, O
    Neuneier, R
    MACHINE LEARNING, 2002, 49 (2-3) : 267 - 290
  • [7] Risk-Sensitive Reinforcement Learning
    Oliver Mihatsch
    Ralph Neuneier
    Machine Learning, 2002, 49 : 267 - 290
  • [8] Risk-Sensitive Motion Planning using Entropic Value-at-Risk
    Dixit, Anushri
    Ahmadi, Mohamadreza
    Burdick, Joel W.
    2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 1726 - 1732
  • [9] Inverse Risk-Sensitive Reinforcement Learning
    Ratliff, Lillian J.
    Mazumdar, Eric
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (03) : 1256 - 1263
  • [10] Robust Conditional Variance and Value-at-Risk Estimation
    Dupuis, Debbie J.
    Papageorgiou, Nicolas
    Remillard, Bruno
    JOURNAL OF FINANCIAL ECONOMETRICS, 2015, 13 (04) : 896 - 921