Twin-Delayed Deep Deterministic Policy Gradient Algorithm for Portfolio Selection

被引:1
|
作者
Baard, Nicholas [1 ]
van Zyl, Terence L. [2 ]
机构
[1] Univ Witwatersrand, Comp Sci & Appl Math, Johannesburg, South Africa
[2] Univ Johannesburg, Inst Intelligent Syst, Johannesburg, South Africa
关键词
Reinforcement Learning; Portfolio Selection; TD3; DDPG;
D O I
10.1109/CIFEr52523.2022.9776067
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
State-of-the-art RL algorithms have shown suboptimal performance in some market conditions with regard to the portfolio selection problem. The reason for suboptimal performance could be due to overestimation bias in actor-critic methods through the use of neural networks as the function approximator. The resulting bias leads to a suboptimal policy being learned by the agent, hindering performance. This research focuses on using the Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithm for portfolio selection to achieve greater results than previously achieved. In addition, an analysis of the overall effectiveness of the algorithm in various market conditions is needed to determine the TD3's robustness. This research establishes a RL environment for portfolio selection and trains the TD3 alongside three state-of-the-art algorithms in five different market conditions. The algorithms are tested by allowing the agent to manage a portfolio in each market for a specified period. The results are used for the analysis of the algorithms. The research shows improved results achieved by the TD3 algorithm for portfolio selection compared to other state-of-the-art algorithms. Furthermore, the performance of the TD3 across the five selected markets proves the robustness of the algorithm in its use for the portfolio selection problem.
引用
下载
收藏
页数:8
相关论文
共 50 条
  • [1] Twin-delayed deep deterministic policy gradient algorithm for the energy management of microgrids
    Dominguez-Barbero, David
    Garcia-Gonzalez, Javier
    Sanz-Bobi, Miguel A.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [2] Twin-Delayed Deep Deterministic Policy Gradient Algorithm to Control a Boost Converter in a DC Microgrid
    Muktiadji, Rifqi Firmansyah
    Ramli, Makbul A. M.
    Milyani, Ahmad H.
    ELECTRONICS, 2024, 13 (02)
  • [3] Fractional-Order Control Method Based on Twin-Delayed Deep Deterministic Policy Gradient Algorithm
    Jiao, Guangxin
    An, Zhengcai
    Shao, Shuyi
    Sun, Dong
    FRACTAL AND FRACTIONAL, 2024, 8 (02)
  • [4] Twin-Delayed Deep Deterministic Policy Gradient for Low-Frequency Oscillation Damping Control
    Cui, Qiushi
    Kim, Gyoungjae
    Weng, Yang
    ENERGIES, 2021, 14 (20)
  • [5] Cooperative motion planning and control of a group of autonomous underwater vehicles using twin-delayed deep deterministic policy gradient
    Hadi, Behnaz
    Khosravi, Alireza
    Sarhadi, Pouria
    APPLIED OCEAN RESEARCH, 2024, 147
  • [6] Optimizing Control of Wastewater Treatment Plant With Reinforcement Learning: Technical Evaluation of Twin-Delayed Deep Deterministic Policy Gradient Agent
    Klawikowska, Zuzanna
    Grochowski, Michal
    IEEE ACCESS, 2024, 12 : 130318 - 130333
  • [7] Twin Delayed Multi-Agent Deep Deterministic Policy Gradient
    Zhan, Mengying
    Chen, Jinchao
    Du, Chenglie
    Duan, Yuxin
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 48 - 52
  • [8] Deep Deterministic Policy Gradient for Portfolio Management
    Khemlichi, Firdaous
    Chougrad, Hiba
    Khamlichi, Youness Idrissi
    El Boushaki, Abdessamad
    Ben Ali, Safae Elhaj
    2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 424 - 429
  • [9] Fault-resilient control of parallel PV inverters using multi-agent twin-delayed deep deterministic policy gradient approach
    Malik, Azra
    Haque, Ahteshamul
    Kurukuru, V. S. Bharath
    Mekhilef, Saad
    INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2024, 52 (07) : 3230 - 3254
  • [10] Adaptive neuro-fuzzy PID controller based on twin delayed deep deterministic policy gradient algorithm
    Shi, Qian
    Lam, Hak-Keung
    Xuan, Chengbin
    Chen, Ming
    NEUROCOMPUTING, 2020, 402 : 183 - 194