Multi-step reward ensemble methods for adaptive stock trading

被引:0
|
作者
Zeng, Zhiyi [1 ]
Ma, Cong [2 ]
Chang, Xiangyu [3 ]
机构
[1] Hubei Normal Univ, Sch Math & Stat, Huangshi, Peoples R China
[2] Northwest Univ, Sch Econ & Management, Xian, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Management, Ctr Intelligent Decis Making & Machine Learning, Xian, Peoples R China
关键词
Multi-step reward; Reward ensemble; Adaptive trading; Thompson sampling; VOLATILITY; RETURNS; RULES;
D O I
10.1016/j.eswa.2023.120547
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stock trading can be considered a Markov decision process that comes naturally to applying reinforcement learning (RL) to this field. Numerous studies have proposed various methods to combine stock trading with RL, where only one single reward function is used to fit the market. However, the market in the real world shows distinct patterns in different periods, such as bullish or bearish. A reward function in bullish periods may perform poorly in bearish periods. In our work, we construct several kinds of multi-step future-price-based reward functions (profit-based reward and regularized-based reward), considering that the market changes consistently. Moreover, we propose two ensemble rewards based on the greedy method (MSR-GME, the abbreviation for Multi-Step Rewards Greedy Method Ensemble) and Thompson sampling (MSR-TSE, the abbreviation for Multi-Step Rewards Thompson Sampling Ensemble) to help agents to make adaptive trading decisions under distinct market patterns. We conduct extensive experiments to verify the mechanisms and the superiority of our constructed reward functions from multiple aspects. The results show the two constructed single-reward functions outperform both the buy-and-hold strategy (B & H) and the historical-price-based rewards consistently to a large extent (for example, the profit-based reward achieves at most 7.3 times the Sortino ratio and 78.6% lower maximum drawdown than B & H). Moreover, the ensemble rewards can substantially improve strategy performance in achieving higher profits and lower risks (for example, MSR-TSE achieves at most 49.7 times profits and 8.85 times Sortino ratio than B & H). We also find that MSR-TSE is risk-averse, but MSR-GME is risk-aggressive, indicating that Thompson sampling is an intensely competitive ensemble method, especially in bearish markets.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Multi-step methods for equations
    Sunil Kumar
    Janak Raj Sharma
    Ioannis K. Argyros
    ANNALI DELL'UNIVERSITA' DI FERRARA, 2024, 70 (4) : 1193 - 1215
  • [2] Multi-step ahead Bitcoin Price Forecasting Based on VMD and Ensemble Learning Methods
    da Silva, Ramon Gomes
    Ribeiro, Matheus Henrique Dal Molin
    Fraccanabbia, Naylene
    Mariani, Viviana Cocco
    Coelho, Leandro dos Santos
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [3] On conjugate symplecticity of multi-step methods
    Tang, YF
    JOURNAL OF COMPUTATIONAL MATHEMATICS, 2000, 18 (04) : 431 - 438
  • [4] ON CONJUGATE SYMPLECTICITY OF MULTI-STEP METHODS
    Yi-fa Tang (LSEC
    Journal of Computational Mathematics, 2000, (04) : 431 - 438
  • [5] An Adaptive Multi-step Levenberg–Marquardt Method
    Jinyan Fan
    Jianchao Huang
    Jianyu Pan
    Journal of Scientific Computing, 2019, 78 : 531 - 548
  • [6] Multi-Step Gradient Methods for Networked Optimization
    Ghadimi, Euhanna
    Shames, Iman
    Johansson, Mikael
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2013, 61 (21) : 5417 - 5429
  • [7] Inertial manifolds and linear multi-step methods
    Tony Shardlow
    Numerical Algorithms, 1997, 14 : 189 - 209
  • [8] Inertial manifolds and linear multi-step methods
    Shardlow, T
    NUMERICAL ALGORITHMS, 1997, 14 (1-3) : 189 - 209
  • [9] CORRECTOR FORMULAS FOR MULTI-STEP INTEGRATION METHODS
    HULL, TE
    NEWBERY, ACR
    JOURNAL OF THE SOCIETY FOR INDUSTRIAL AND APPLIED MATHEMATICS, 1962, 10 (02): : 351 - 369
  • [10] Multi-Step Skipping Methods for Unconstrained Optimization
    Ford, John A.
    Aamir, Nudrat
    NUMERICAL ANALYSIS AND APPLIED MATHEMATICS ICNAAM 2011: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS, VOLS A-C, 2011, 1389