Multi-step reward ensemble methods for adaptive stock trading

被引:0
|
作者
Zeng, Zhiyi [1 ]
Ma, Cong [2 ]
Chang, Xiangyu [3 ]
机构
[1] Hubei Normal Univ, Sch Math & Stat, Huangshi, Peoples R China
[2] Northwest Univ, Sch Econ & Management, Xian, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Management, Ctr Intelligent Decis Making & Machine Learning, Xian, Peoples R China
关键词
Multi-step reward; Reward ensemble; Adaptive trading; Thompson sampling; VOLATILITY; RETURNS; RULES;
D O I
10.1016/j.eswa.2023.120547
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stock trading can be considered a Markov decision process that comes naturally to applying reinforcement learning (RL) to this field. Numerous studies have proposed various methods to combine stock trading with RL, where only one single reward function is used to fit the market. However, the market in the real world shows distinct patterns in different periods, such as bullish or bearish. A reward function in bullish periods may perform poorly in bearish periods. In our work, we construct several kinds of multi-step future-price-based reward functions (profit-based reward and regularized-based reward), considering that the market changes consistently. Moreover, we propose two ensemble rewards based on the greedy method (MSR-GME, the abbreviation for Multi-Step Rewards Greedy Method Ensemble) and Thompson sampling (MSR-TSE, the abbreviation for Multi-Step Rewards Thompson Sampling Ensemble) to help agents to make adaptive trading decisions under distinct market patterns. We conduct extensive experiments to verify the mechanisms and the superiority of our constructed reward functions from multiple aspects. The results show the two constructed single-reward functions outperform both the buy-and-hold strategy (B & H) and the historical-price-based rewards consistently to a large extent (for example, the profit-based reward achieves at most 7.3 times the Sortino ratio and 78.6% lower maximum drawdown than B & H). Moreover, the ensemble rewards can substantially improve strategy performance in achieving higher profits and lower risks (for example, MSR-TSE achieves at most 49.7 times profits and 8.85 times Sortino ratio than B & H). We also find that MSR-TSE is risk-averse, but MSR-GME is risk-aggressive, indicating that Thompson sampling is an intensely competitive ensemble method, especially in bearish markets.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] The step-transition operators for multi-step methods of ODE's
    Feng, K
    JOURNAL OF COMPUTATIONAL MATHEMATICS, 1998, 16 (03) : 193 - 202
  • [22] A note on symplecticity of step-transition mappings for multi-step methods
    Dai, Gui-Dong
    Tang, Yi-Fa
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2006, 196 (02) : 474 - 477
  • [23] A multi-step class of iterative methods for nonlinear systems
    Fazlollah Soleymani
    Taher Lotfi
    Parisa Bakhtiari
    Optimization Letters, 2014, 8 : 1001 - 1015
  • [24] An Ensemble Approach for Multi-Step Ahead Energy Forecasting of Household Communities
    Pirbazari, Aida Mehdipour
    Sharma, Ekanki
    Chakravorty, Antorweep
    Elmenreich, Wilfried
    Rong, Chunming
    IEEE ACCESS, 2021, 9 : 36218 - 36240
  • [25] A multi-step class of iterative methods for nonlinear systems
    Soleymani, Fazlollah
    Lotfi, Taher
    Bakhtiari, Parisa
    OPTIMIZATION LETTERS, 2014, 8 (03) : 1001 - 1015
  • [26] Solving monotone inclusions with linear multi-step methods
    Pennanen, T
    Svaiter, BF
    MATHEMATICAL PROGRAMMING, 2003, 96 (03) : 469 - 487
  • [27] Methods to Improve Multi-Step Time Series Prediction
    Koesdwiady, Arief
    El Khatib, Alaa
    Karray, Fakhri
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [28] Solving monotone inclusions with linear multi-step methods
    Teemu Pennanen
    B. F. Svaiter
    Mathematical Programming, 2003, 96 : 469 - 487
  • [29] Comparison of multi-step forecasting methods for renewable energy
    Dolgintseva, E.
    Wu, H.
    Petrosian, O.
    Zhadan, A.
    Allakhverdyan, A.
    Martemyanov, A.
    ENERGY SYSTEMS-OPTIMIZATION MODELING SIMULATION AND ECONOMIC ASPECTS, 2024,