Twin-Delayed Deep Deterministic Policy Gradient Algorithm for Portfolio Selection

被引：1

作者：

Baard, Nicholas ^{[1
]}

van Zyl, Terence L. ^{[2
]}

机构：

[1] Univ Witwatersrand, Comp Sci & Appl Math, Johannesburg, South Africa

[2] Univ Johannesburg, Inst Intelligent Syst, Johannesburg, South Africa

来源：

2022 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING AND ECONOMICS (CIFER) | 2022年

关键词：

Reinforcement Learning; Portfolio Selection; TD3; DDPG;

D O I：

10.1109/CIFEr52523.2022.9776067

中图分类号：

F8 [财政、金融];

学科分类号：

0202 ;

摘要：

State-of-the-art RL algorithms have shown suboptimal performance in some market conditions with regard to the portfolio selection problem. The reason for suboptimal performance could be due to overestimation bias in actor-critic methods through the use of neural networks as the function approximator. The resulting bias leads to a suboptimal policy being learned by the agent, hindering performance. This research focuses on using the Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithm for portfolio selection to achieve greater results than previously achieved. In addition, an analysis of the overall effectiveness of the algorithm in various market conditions is needed to determine the TD3's robustness. This research establishes a RL environment for portfolio selection and trains the TD3 alongside three state-of-the-art algorithms in five different market conditions. The algorithms are tested by allowing the agent to manage a portfolio in each market for a specified period. The results are used for the analysis of the algorithms. The research shows improved results achieved by the TD3 algorithm for portfolio selection compared to other state-of-the-art algorithms. Furthermore, the performance of the TD3 across the five selected markets proves the robustness of the algorithm in its use for the portfolio selection problem.

引用

下载

页数：8

共 50 条

[1] Twin-delayed deep deterministic policy gradient algorithm for the energy management of microgrids
Dominguez-Barbero, David
Garcia-Gonzalez, Javier
Sanz-Bobi, Miguel A.
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
[2] Twin-Delayed Deep Deterministic Policy Gradient Algorithm to Control a Boost Converter in a DC Microgrid
Muktiadji, Rifqi Firmansyah
Ramli, Makbul A. M.
Milyani, Ahmad H.
ELECTRONICS, 2024, 13 (02)
[3] Fractional-Order Control Method Based on Twin-Delayed Deep Deterministic Policy Gradient Algorithm
Jiao, Guangxin
An, Zhengcai
Shao, Shuyi
Sun, Dong
FRACTAL AND FRACTIONAL, 2024, 8 (02)
[4] Twin-Delayed Deep Deterministic Policy Gradient for Low-Frequency Oscillation Damping Control
Cui, Qiushi
Kim, Gyoungjae
Weng, Yang
ENERGIES, 2021, 14 (20)
[5] Cooperative motion planning and control of a group of autonomous underwater vehicles using twin-delayed deep deterministic policy gradient
Hadi, Behnaz
Khosravi, Alireza
Sarhadi, Pouria
APPLIED OCEAN RESEARCH, 2024, 147
[6] Optimizing Control of Wastewater Treatment Plant With Reinforcement Learning: Technical Evaluation of Twin-Delayed Deep Deterministic Policy Gradient Agent
Klawikowska, Zuzanna
Grochowski, Michal
IEEE ACCESS, 2024, 12 : 130318 - 130333
[7] Twin Delayed Multi-Agent Deep Deterministic Policy Gradient
Zhan, Mengying
Chen, Jinchao
Du, Chenglie
Duan, Yuxin
PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 48 - 52
[8] Deep Deterministic Policy Gradient for Portfolio Management
Khemlichi, Firdaous
Chougrad, Hiba
Khamlichi, Youness Idrissi
El Boushaki, Abdessamad
Ben Ali, Safae Elhaj
2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 424 - 429
[9] Fault-resilient control of parallel PV inverters using multi-agent twin-delayed deep deterministic policy gradient approach
Malik, Azra
Haque, Ahteshamul
Kurukuru, V. S. Bharath
Mekhilef, Saad
INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2024, 52 (07) : 3230 - 3254
[10] Adaptive neuro-fuzzy PID controller based on twin delayed deep deterministic policy gradient algorithm
Shi, Qian
Lam, Hak-Keung
Xuan, Chengbin
Chen, Ming
NEUROCOMPUTING, 2020, 402 : 183 - 194

← 1 2 3 4 5 →