Deep reinforcement learning for option pricing and hedging under dynamic expectile risk measures

被引:0
|
作者
Marzban, Saeed [1 ,2 ]
Delage, Erick [1 ,2 ]
Li, Jonathan Yu-Meng [3 ]
机构
[1] HEC Montreal, GERAD, Montreal, PQ H3T 2A7, Canada
[2] HEC Montreal, Dept Decis Sci, Montreal, PQ H3T 2A7, Canada
[3] Univ Ottawa, Telfer Sch Management, Ottawa, ON K1N 6N5, Canada
关键词
Deep reinforcement learning; Derivative pricing; Risk hedging; Expectile risk measures; Incomplete market;
D O I
暂无
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Recently equal risk pricing, a framework for fair derivative pricing, was extended to consider dynamic risk measures. However, all current implementations either employ a static risk measure that violates time consistency, or are based on traditional dynamic programing solution schemes that are impracticable in problems with a large number of underlying assets (due to the curse of dimensionality) or with incomplete asset dynamics information. In this paper, we extend for the first time a famous off-policy deterministic actor-critic deep reinforcement learning (ACRL) algorithm to the problem of solving a risk averse Markov decision process that models risk using a time consistent recursive expectile risk measure. This new ACRL algorithm allows us to identify high quality time consistent hedging policies (and equal risk prices) for options, such as basket options, that cannot be handled using traditional methods, or in context where only historical trajectories of the underlying assets are available. Our numerical experiments, which involve both a simple vanilla option and a more exotic basket option, confirm that the new ACRL algorithm can produce (1) in simple environments, nearly optimal hedging policies, and highly accurate prices, simultaneously for a range of maturities (2) in complex environments, good quality policies and prices using reasonable amount of computing resources; and (3) overall, hedging strategies that actually outperform the strategies produced using static risk measures when the risk is evaluated at later points of time.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Deep reinforcement learning for option pricing and hedging under dynamic expectile risk measures
    Marzban, Saeed
    Delage, Erick
    Li, Jonathan Yu-Meng
    [J]. QUANTITATIVE FINANCE, 2023, 23 (10) : 1411 - 1430
  • [2] Deep Reinforcement Learning for Dynamic Stock Option Hedging: A Review
    Pickard, Reilly
    Lawryshyn, Yuri
    [J]. MATHEMATICS, 2023, 11 (24)
  • [3] Option pricing under residual risk and imperfect hedging
    Wang, Xiao-Tian
    Liang, Xiang-Qian
    Zhou, Ze-Min
    [J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2014, 415 (01) : 269 - 293
  • [4] Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning
    Coache, Anthony
    Jaimungal, Sebastian
    Cartea, Alvaro
    [J]. SIAM JOURNAL ON FINANCIAL MATHEMATICS, 2023, 14 (04): : 1249 - 1289
  • [5] Actuarial bridges to dynamic hedging and option pricing
    Gerber, HU
    Shiu, ESW
    [J]. INSURANCE MATHEMATICS & ECONOMICS, 1996, 18 (03): : 183 - 218
  • [6] Preliminary remarks on option pricing and dynamic hedging
    Fliess, Michel
    Join, Cedric
    [J]. 2012 1ST INTERNATIONAL CONFERENCE ON SYSTEMS AND COMPUTER SCIENCE (ICSCS), 2012,
  • [7] Pricing under dynamic risk measures
    Zhao, Jun
    Lepinette, Emmanuel
    Zhao, Peibiao
    [J]. OPEN MATHEMATICS, 2019, 17 : 894 - 905
  • [8] Option pricing and portfolio hedging under the mixed hedging strategy
    Wang, Xiao-Tian
    Zhao, Zhong-Feng
    Fang, Xiao-Fen
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2015, 424 : 194 - 206
  • [9] Pricing and hedging option under portfolio constrained
    Wei, G
    Chen, SP
    [J]. ACTA MATHEMATICA SCIENTIA, 2001, 21 (04) : 483 - 494
  • [10] PRICING AND HEDGING OPTION UNDER PORTFOLIO CONSTRAINED
    魏刚
    陈世平
    [J]. Acta Mathematica Scientia, 2001, (04) : 483 - 494