Robust Risk-Aware Reinforcement Learning

被引:10
|
作者
Jaimungal, Sebastian [1 ]
Pesenti, Silvana M. [1 ]
Wang, Ye Sheng [1 ]
Tatsat, Hariom [2 ]
机构
[1] Univ Toronto, Dept Stat Sci, Toronto, ON M5G 1Z5, Canada
[2] Barclays Capital, New York, NY 10020 USA
来源
SIAM JOURNAL ON FINANCIAL MATHEMATICS | 2022年 / 13卷 / 01期
基金
加拿大自然科学与工程研究理事会;
关键词
robust optimization; reinforcement learning; risk measures; Wasserstein distance; statistical arbitrage; portfolio optimization; CHOICE;
D O I
10.1137/21M144640X
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
We present a reinforcement learning (RL) approach for robust optimization of risk-aware performance criteria. To allow agents to express a wide variety of risk-reward profiles, we assess the value of a policy using rank dependent expected utility (RDEU). RDEU allows agents to seek gains, while simultaneously protecting themselves against downside risk. To robustify optimal policies against model uncertainty, we assess a policy not by its distribution but rather by the worst possible distribution that lies within a Wasserstein ball around it. Thus, our problem formulation may be viewed as an actor/agent choosing a policy (the outer problem) and the adversary then acting to worsen the performance of that strategy (the inner problem). We develop explicit policy gradient formulae for the inner and outer problems and show their efficacy on three prototypical financial problems: robust portfolio allocation, benchmark optimization, and statistical arbitrage.
引用
收藏
页码:213 / 226
页数:14
相关论文
共 50 条
  • [41] ZEROTH-ORDER STOCHASTIC COMPOSITIONAL ALGORITHMS FOR RISK-AWARE LEARNING
    Kalogerias, Dionysios S.
    Powell, Warren B.
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2022, 32 (02) : 386 - 416
  • [42] Towards Risk-Aware Resource Selection
    Markov, Ilya
    Carman, Mark
    Crestani, Fabio
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 : 148 - 159
  • [43] Risk-Aware Control and Games in Engineering
    Barreiro-Gomez, Julian
    Tembine, Hamidou
    Stella, Leonardo
    Bauso, Dario
    Colaneri, Patrizio
    [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3860 - 3870
  • [44] Towards risk-aware communications networking
    Cholda, Piotr
    Folstad, Eirik L.
    Helvik, Bjarne E.
    Kuusela, Pirkko
    Naldi, Maurizio
    Norros, Ilkka
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2013, 109 : 160 - 174
  • [45] Dynamic Risk-Aware Patch Scheduling
    Zhang, Fengli
    Li, Qinghua
    [J]. 2020 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2020,
  • [46] Risk-Aware Stochastic Shortest Path
    Meggendorfer, Tobias
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9858 - 9867
  • [47] Uncertainty in Trust: A Risk-Aware Approach
    Nogoorani, Sadegh Dorri
    Jalili, Rasool
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2016, 24 (05) : 703 - 737
  • [48] A study of risk-aware program transformation
    Murta, Daniel
    Oliveira, Jose Nuno
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2015, 110 : 51 - 77
  • [49] XACML and Risk-Aware Access Control
    Chen, Liang
    Gasparini, Luca
    Norman, Timothy J.
    [J]. WOSIS: PROCEEDINGS OF THE 10TH INTERNATIONAL WORKSHOP ON SECURITY IN INFORMATION SYSTEMS, 2013, : 66 - 75
  • [50] Towards risk-aware resource selection
    [J]. 1600, Springer Verlag (8870):