Reinforcement learning for continuous-time mean-variance portfolio selection in a regime-switching market

被引:4
|
作者
Wu, Bo [1 ]
Li, Lingfei [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Peoples R China
来源
JOURNAL OF ECONOMIC DYNAMICS & CONTROL | 2024年 / 158卷
关键词
Reinforcement learning; Actor-critic; Mean-variance; Portfolio selection; Partial information; Regime-switching; Wonham's filter; ASSET ALLOCATION; OPTIMIZATION;
D O I
10.1016/j.jedc.2023.104787
中图分类号
F [经济];
学科分类号
02 ;
摘要
We propose a reinforcement learning (RL) approach to solve the continuous-time mean-variance portfolio selection problem in a regime-switching market, where the market regime is unobservable. To encourage exploration for learning, we formulate an exploratory stochastic control problem with an entropy-regularized mean-variance objective. We obtain semi-analytical representations of the optimal value function and optimal policy, which involve unknown solutions to two linear parabolic partial differential equations (PDEs). We utilize these representations to parametrize the value function and policy for learning with the unknown solutions to the PDEs approximated based on polynomials. We develop an actor-critic RL algorithm to learn the optimal policy through interactions with the market environment. The algorithm carries out filtering to obtain the belief probability of the market regime and performs policy evaluation and policy gradient updates alternately. Empirical results demonstrate the advantages of our RL algorithm in relatively long-term investment problems over the classical control approach and an RL algorithm developed for the continuous-time mean-variance problem without considering regime switches.
引用
收藏
页数:28
相关论文
共 50 条
  • [31] Numerical solution of continuous-time mean-variance portfolio selection with nonlinear constraints
    Yan, Wei
    Li, Shurong
    INTERNATIONAL JOURNAL OF CONTROL, 2010, 83 (03) : 642 - 650
  • [32] Explicit efficient frontier of a continuous-time mean-variance portfolio selection problem
    Zhou, XY
    Li, D
    CONTROL OF DISTRIBUTED PARAMETER AND STOCHASTIC SYSTEMS, 1999, 13 : 323 - +
  • [33] MEAN-VARIANCE PORTFOLIO SELECTION UNDER A NON-MARKOVIAN REGIME-SWITCHING MODEL: TIME-CONSISTENT SOLUTIONS
    Wang, Tianxiao
    Jin, Zhuo
    Wei, Jiaqin
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2019, 57 (05) : 3249 - 3271
  • [34] Mean-Variance Portfolio Selection in a Markovian Regime-Switching Market When the Uncertain Time Horizon is a Stopping Time of Market State Filtration: A Multi-period Model
    Keykhaei, Reza
    JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF CHINA, 2024,
  • [35] Mean-variance portfolio selection with regime switching under shorting prohibition
    Zhang, Miao
    Chen, Ping
    OPERATIONS RESEARCH LETTERS, 2016, 44 (05) : 658 - 662
  • [36] Continuous-time portfolio selection with liability: Mean-variance model and stochastic LQ approach
    Xie, Shuxiang
    Li, Zhongfei
    Wang, Shouyang
    INSURANCE MATHEMATICS & ECONOMICS, 2008, 42 (03): : 943 - 953
  • [37] Optimal mean-variance investment/reinsurance with common shock in a regime-switching market
    Bi, Junna
    Liang, Zhibin
    Yuen, Kam Chuen
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2019, 90 (01) : 109 - 135
  • [38] A regime-switching factor model for mean-variance optimization
    Costa, Giorgio
    Kwon, Roy H.
    JOURNAL OF RISK, 2020, 22 (04): : 31 - 59
  • [39] Mean-variance portfolio selection with only risky assets under regime switching
    Zhang, Miao
    Chen, Ping
    Yao, Haixiang
    ECONOMIC MODELLING, 2017, 62 : 35 - 42
  • [40] A Continuous-Time Hidden Markov Model for Mean-Variance Portfolio Optimization
    Elliott, Robert J.
    Siu, Tak Kuen
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 1189 - +