Twin-Delayed Deep Deterministic Policy Gradient Algorithm for Portfolio Selection

被引：1

作者：

Baard, Nicholas ^{[1
]}

van Zyl, Terence L. ^{[2
]}

机构：

[1] Univ Witwatersrand, Comp Sci & Appl Math, Johannesburg, South Africa

[2] Univ Johannesburg, Inst Intelligent Syst, Johannesburg, South Africa

来源：

2022 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING AND ECONOMICS (CIFER) | 2022年

关键词：

Reinforcement Learning; Portfolio Selection; TD3; DDPG;

D O I：

10.1109/CIFEr52523.2022.9776067

中图分类号：

F8 [财政、金融];

学科分类号：

0202 ;

摘要：

State-of-the-art RL algorithms have shown suboptimal performance in some market conditions with regard to the portfolio selection problem. The reason for suboptimal performance could be due to overestimation bias in actor-critic methods through the use of neural networks as the function approximator. The resulting bias leads to a suboptimal policy being learned by the agent, hindering performance. This research focuses on using the Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithm for portfolio selection to achieve greater results than previously achieved. In addition, an analysis of the overall effectiveness of the algorithm in various market conditions is needed to determine the TD3's robustness. This research establishes a RL environment for portfolio selection and trains the TD3 alongside three state-of-the-art algorithms in five different market conditions. The algorithms are tested by allowing the agent to manage a portfolio in each market for a specified period. The results are used for the analysis of the algorithms. The research shows improved results achieved by the TD3 algorithm for portfolio selection compared to other state-of-the-art algorithms. Furthermore, the performance of the TD3 across the five selected markets proves the robustness of the algorithm in its use for the portfolio selection problem.

引用

下载

页数：8

共 50 条

[31] Intelligent Trajectory Tracking Linear Active Disturbance Rejection Control of a Powered Parafoil Based on Twin Delayed Deep Deterministic Policy Gradient Algorithm Optimization
Zheng, Yuemin
Fei, Zelin
Tao, Jin
Sun, Qinglin
Sun, Hao
Chen, Zengqiang
Sun, Mingwei
APPLIED SCIENCES-BASEL, 2023, 13 (23):
[32] Novel task decomposed multi-agent twin delayed deep deterministic policy gradient algorithm for multi-UAV autonomous path planning
Zhou, Yatong
Kong, Xiaoran
Lin, Kuo-Ping
Liu, Liangyu
KNOWLEDGE-BASED SYSTEMS, 2024, 287
[33] An Intelligent Energy Management Strategy for Hybrid Vehicle with irrational actions using Twin Delayed Deep Deterministic Policy Gradient
Liu, Zemin Eitan
Zhou, Quan
Li, Yanfei
Shuai, Shijin
IFAC PAPERSONLINE, 2021, 54 (10): : 546 - 551
[34] Optimal demand response based dynamic pricing strategy via Multi-Agent Federated Twin Delayed Deep Deterministic policy gradient algorithm
Ma, Haining
Zhang, Huifeng
Tian, Ding
Yue, Dong
Hancke, Gerhard P.
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[35] Low-Level Control of a Quadrotor using Twin Delayed Deep Deterministic Policy Gradient (TD3)
Shehab, Mazen
Zaghloul, Ahmed
El-Badawy, Ayman
2021 18TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2021), 2021,
[36] Deep reinforcement learning for PMSG wind turbine control via twin delayed deep deterministic policy gradient (TD3)
Zholtayev, Darkhan
Rubagotti, Matteo
Do, Ton Duc
OPTIMAL CONTROL APPLICATIONS & METHODS, 2024, 45 (04): : 1889 - 1906
[37] Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm
Wu, Junta
Li, Huiyun
MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020 (2020)
[38] A new strategy optimisation method for underwater flapping foil propulsion based on Twin-Delayed Deep Deterministic and Gaussian process regression
Yang, Yinghe
Wei, Handi
Fan, Dixia
Li, Ang
OCEAN ENGINEERING, 2024, 311
[39] Decentralized multi-agent control of a three-tank hybrid system based on twin delayed deep deterministic policy gradient reinforcement learning algorithm
N. Rajasekhar
T. K. Radhakrishnan
N. Samsudeen
International Journal of Dynamics and Control, 2024, 12 : 1098 - 1115
[40] MP-TD3: Multi-Pool Prioritized Experience Replay-Based Asynchronous Twin Delayed Deep Deterministic Policy Gradient Algorithm
Tan, Wenwen
Huang, Detian
IEEE ACCESS, 2024, 12 : 105268 - 105280

← 1 2 3 4 5 →