Premium control with reinforcement learning

被引:0
|
作者
Palmborg, Lina [1 ]
Lindskog, Filip [1 ]
机构
[1] Stockholm Univ, Dept Math, S-10691 Stockholm, Sweden
基金
瑞典研究理事会;
关键词
Premium control; reinforcement learning; Markov decision process; CONVERGENCE;
D O I
10.1017/asb.2023.13
中图分类号
F [经济];
学科分类号
02 ;
摘要
We consider a premium control problem in discrete time, formulated in terms of a Markov decision process. In a simplified setting, the optimal premium rule can be derived with dynamic programming methods. However, these classical methods are not feasible in a more realistic setting due to the dimension of the state space and lack of explicit expressions for transition probabilities. We explore reinforcement learning techniques, using function approximation, to solve the premium control problem for realistic stochastic models. We illustrate the appropriateness of the approximate optimal premium rule compared with the true optimal premium rule in a simplified setting and further demonstrate that the approximate optimal premium rule outperforms benchmark rules in more realistic settings where classical approaches fail.
引用
收藏
页码:233 / 257
页数:25
相关论文
共 50 条
  • [1] Interpretable Control by Reinforcement Learning
    Hein, Daniel
    Limmer, Steffen
    Runkler, Thomas A.
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 8082 - 8089
  • [2] Reinforcement learning for structural control
    Adam, Bernard
    Smith, Ian F. C.
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2008, 22 (02) : 133 - 139
  • [3] Reinforcement learning for robot control
    Smart, WD
    Kaelbling, LP
    [J]. MOBILE ROBOTS XVI, 2002, 4573 : 92 - 103
  • [4] Reinforcement learning for control of valves
    Siraskar, Rajesh
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2021, 4
  • [5] Robust reinforcement learning control
    Kretchmar, RM
    Young, PM
    Anderson, CW
    Hittle, DC
    Anderson, ML
    Tu, J
    Delnero, CC
    [J]. PROCEEDINGS OF THE 2001 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2001, : 902 - 907
  • [6] Control of a Quadrotor With Reinforcement Learning
    Hwangbo, Jemin
    Sa, Inkyu
    Siegwart, Roland
    Hutter, Marco
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (04): : 2096 - 2103
  • [7] Reinforcement learning and apprenticeship learning for robotic control
    Ng, Andrew
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2006, 4265 : 14 - 14
  • [8] Reinforcement learning and apprenticeship learning for robotic control
    Ng, Andrew Y.
    [J]. ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2006, 4264 : 29 - 31
  • [9] The learning premium
    Bichuch, Maxim
    Guasoni, Paolo
    [J]. MATHEMATICS AND FINANCIAL ECONOMICS, 2020, 14 (01) : 175 - 205
  • [10] The learning premium
    Maxim Bichuch
    Paolo Guasoni
    [J]. Mathematics and Financial Economics, 2020, 14 : 175 - 205