Twin-Delayed Deep Deterministic Policy Gradient Algorithm for Portfolio Selection

被引:1
|
作者
Baard, Nicholas [1 ]
van Zyl, Terence L. [2 ]
机构
[1] Univ Witwatersrand, Comp Sci & Appl Math, Johannesburg, South Africa
[2] Univ Johannesburg, Inst Intelligent Syst, Johannesburg, South Africa
关键词
Reinforcement Learning; Portfolio Selection; TD3; DDPG;
D O I
10.1109/CIFEr52523.2022.9776067
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
State-of-the-art RL algorithms have shown suboptimal performance in some market conditions with regard to the portfolio selection problem. The reason for suboptimal performance could be due to overestimation bias in actor-critic methods through the use of neural networks as the function approximator. The resulting bias leads to a suboptimal policy being learned by the agent, hindering performance. This research focuses on using the Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithm for portfolio selection to achieve greater results than previously achieved. In addition, an analysis of the overall effectiveness of the algorithm in various market conditions is needed to determine the TD3's robustness. This research establishes a RL environment for portfolio selection and trains the TD3 alongside three state-of-the-art algorithms in five different market conditions. The algorithms are tested by allowing the agent to manage a portfolio in each market for a specified period. The results are used for the analysis of the algorithms. The research shows improved results achieved by the TD3 algorithm for portfolio selection compared to other state-of-the-art algorithms. Furthermore, the performance of the TD3 across the five selected markets proves the robustness of the algorithm in its use for the portfolio selection problem.
引用
下载
收藏
页数:8
相关论文
共 50 条
  • [31] Intelligent Trajectory Tracking Linear Active Disturbance Rejection Control of a Powered Parafoil Based on Twin Delayed Deep Deterministic Policy Gradient Algorithm Optimization
    Zheng, Yuemin
    Fei, Zelin
    Tao, Jin
    Sun, Qinglin
    Sun, Hao
    Chen, Zengqiang
    Sun, Mingwei
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [32] Novel task decomposed multi-agent twin delayed deep deterministic policy gradient algorithm for multi-UAV autonomous path planning
    Zhou, Yatong
    Kong, Xiaoran
    Lin, Kuo-Ping
    Liu, Liangyu
    KNOWLEDGE-BASED SYSTEMS, 2024, 287
  • [33] An Intelligent Energy Management Strategy for Hybrid Vehicle with irrational actions using Twin Delayed Deep Deterministic Policy Gradient
    Liu, Zemin Eitan
    Zhou, Quan
    Li, Yanfei
    Shuai, Shijin
    IFAC PAPERSONLINE, 2021, 54 (10): : 546 - 551
  • [34] Optimal demand response based dynamic pricing strategy via Multi-Agent Federated Twin Delayed Deep Deterministic policy gradient algorithm
    Ma, Haining
    Zhang, Huifeng
    Tian, Ding
    Yue, Dong
    Hancke, Gerhard P.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [35] Low-Level Control of a Quadrotor using Twin Delayed Deep Deterministic Policy Gradient (TD3)
    Shehab, Mazen
    Zaghloul, Ahmed
    El-Badawy, Ayman
    2021 18TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2021), 2021,
  • [36] Deep reinforcement learning for PMSG wind turbine control via twin delayed deep deterministic policy gradient (TD3)
    Zholtayev, Darkhan
    Rubagotti, Matteo
    Do, Ton Duc
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2024, 45 (04): : 1889 - 1906
  • [37] Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm
    Wu, Junta
    Li, Huiyun
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020 (2020)
  • [38] A new strategy optimisation method for underwater flapping foil propulsion based on Twin-Delayed Deep Deterministic and Gaussian process regression
    Yang, Yinghe
    Wei, Handi
    Fan, Dixia
    Li, Ang
    OCEAN ENGINEERING, 2024, 311
  • [39] Decentralized multi-agent control of a three-tank hybrid system based on twin delayed deep deterministic policy gradient reinforcement learning algorithm
    N. Rajasekhar
    T. K. Radhakrishnan
    N. Samsudeen
    International Journal of Dynamics and Control, 2024, 12 : 1098 - 1115
  • [40] MP-TD3: Multi-Pool Prioritized Experience Replay-Based Asynchronous Twin Delayed Deep Deterministic Policy Gradient Algorithm
    Tan, Wenwen
    Huang, Detian
    IEEE ACCESS, 2024, 12 : 105268 - 105280