Reinforcement learning for continuous-time mean-variance portfolio selection in a regime-switching market

被引：4

作者：

Wu, Bo ^{[1
]}

Li, Lingfei ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Peoples R China

来源：

JOURNAL OF ECONOMIC DYNAMICS & CONTROL | 2024年 / 158卷

关键词：

Reinforcement learning; Actor-critic; Mean-variance; Portfolio selection; Partial information; Regime-switching; Wonham's filter; ASSET ALLOCATION; OPTIMIZATION;

D O I：

10.1016/j.jedc.2023.104787

中图分类号：

F [经济];

学科分类号：

02 ;

摘要：

We propose a reinforcement learning (RL) approach to solve the continuous-time mean-variance portfolio selection problem in a regime-switching market, where the market regime is unobservable. To encourage exploration for learning, we formulate an exploratory stochastic control problem with an entropy-regularized mean-variance objective. We obtain semi-analytical representations of the optimal value function and optimal policy, which involve unknown solutions to two linear parabolic partial differential equations (PDEs). We utilize these representations to parametrize the value function and policy for learning with the unknown solutions to the PDEs approximated based on polynomials. We develop an actor-critic RL algorithm to learn the optimal policy through interactions with the market environment. The algorithm carries out filtering to obtain the belief probability of the market regime and performs policy evaluation and policy gradient updates alternately. Empirical results demonstrate the advantages of our RL algorithm in relatively long-term investment problems over the classical control approach and an RL algorithm developed for the continuous-time mean-variance problem without considering regime switches.

引用

页数：28

共 50 条

[1] Continuous-time mean-variance portfolio selection with regime switching
Zhou, XY
Yin, G
PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 383 - 388
[2] Continuous-time mean-variance portfolio selection with regime-switching financial market: Time-consistent solution
Alia, Ishak
Chighoub, Farid
RANDOM OPERATORS AND STOCHASTIC EQUATIONS, 2021, 29 (01) : 11 - 25
[3] CONTINUOUS-TIME MEAN-VARIANCE PORTFOLIO SELECTION WITH NO-SHORTING CONSTRAINTS AND REGIME-SWITCHING
Chen, Ping
Yao, Haixiang
JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2020, 16 (02) : 531 - 551
[4] Continuous-time mean-variance portfolio selection with liability and regime switching
Me, Shuxiang
INSURANCE MATHEMATICS & ECONOMICS, 2009, 45 (01): : 148 - 155
[5] Continuous-time mean-variance portfolio selection: A reinforcement learning framework
Wang, Haoran
Zhou, Xun Yu
MATHEMATICAL FINANCE, 2020, 30 (04) : 1273 - 1308
[6] Mean-variance portfolio selection with an uncertain exit-time in a regime-switching market
Keykhaei, Reza
RAIRO-OPERATIONS RESEARCH, 2019, 53 (04) : 1171 - 1186
[7] Markowitz's mean-variance portfolio selection with regime switching: A continuous-time model
Zhou, XY
Yin, G
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2003, 42 (04) : 1466 - 1482
[8] Continuous-Time Mean-Variance Portfolio Selection Under Non-Markovian Regime-Switching Model with Random Horizon
Chen, Tian
Liu, Ruyi
Wu, Zhen
JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2023, 36 (02) : 457 - 479
[9] Continuous-Time Mean-Variance Portfolio Selection Under Non-Markovian Regime-Switching Model with Random Horizon
CHEN Tian
LIU Ruyi
WU Zhen
JournalofSystemsScience&Complexity, 2023, 36 (02) : 457 - 479
[10] Continuous-Time Mean-Variance Portfolio Selection Under Non-Markovian Regime-Switching Model with Random Horizon
Tian Chen
Ruyi Liu
Zhen Wu
Journal of Systems Science and Complexity, 2023, 36 : 457 - 479

← 1 2 3 4 5 →