GraphSAGE with deep reinforcement learning for financial portfolio

被引：6

作者：

Sun, Qiguo ^{[1
]}

Wei, Xueying ^{[2
]}

Yang, Xibei ^{[1
]}

机构：

[1] Jiangsu Univ Sci & Technol, Sch Comp Sci & Engn, Zhenjiang 212003, Jiangsu, Peoples R China

[2] Shanxi Med Univ, Fenyang Coll, Fenyang 032200, Shanxi, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 238卷

关键词：

Portfolio management; Deep reinforcement learning; GraphSAGE; Explainable AI; SHAP;

D O I：

10.1016/j.eswa.2023.122027

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Portfolio optimization is an active management strategy that aims to maximize returns and control risk within reasonable limits. The Proximal Policy Optimization (PPO), a robust on-policy actor-critic deep reinforcement learning (DRL) model, is gaining popularity in portfolio optimization because it can help reduce emotional biases and take systematic investment actions. However, some research has found that the PPO model cannot achieve such remarkable performance in portfolio optimization as in games or robot control. In this paper, a novel GraphSAGE and DRL coupled model (GRL) is proposed to improve the architecture of the PPO agent by introducing a GraphSAGE-based feature extractor to capture the complex non-Euclidean relationships among market indexes, industry indexes and stocks. In addition, the explainable model SHAP is used to select a few but important features for GRL learning, and a method for generating a static financial graph is defined. This improves the robustness and training efficiency of the GRL model. We provide a holistic performance evaluation for GRL on three datasets using five metrics, i.e., Return On Investment (ROI), Sharpe Ratio, Sortino Ratio, Maximum Drawdown, and Calmar Ratio. The results show that the GRL model outperforms the Equal Weight strategy and the S&P 500 index. In addition, the results of the comparative analysis show that the Share-Extractor GRL and the Separate-Extractor GRL significantly outperform the PPO baseline without a feature extractor. This implies that integrating a GraphSAGE-based feature extractor into the PPO agent can improve its performance and robustness in portfolio optimization tasks.

引用

页数：13

共 50 条

[1] Ghost Expectation Point with Deep Reinforcement Learning in Financial Portfolio Management
Yang, Xuting
Sun, Ruoyu
Ren, Xiaotian
Stefanidis, Angelos
Gu, Fengchen
Su, Jionglong
[J]. 2022 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, CYBERC, 2022, : 136 - 142
[2] Deep graph convolutional reinforcement learning for financial portfolio management-DeepPocket
Soleymani, Farzan
Paquet, Eric
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 182
[3] Deep reinforcement learning for portfolio management
Yang, Shantian
[J]. KNOWLEDGE-BASED SYSTEMS, 2023, 278
[4] Deep reinforcement learning for portfolio selection
Jiang, Yifu
Olmo, Jose
Atwi, Majed
[J]. GLOBAL FINANCE JOURNAL, 2024, 62
[5] Reinforcement learning for deep portfolio optimization
Yan, Ruyu
Jin, Jiafei
Han, Kun
[J]. ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (09): : 5176 - 5200
[6] Cryptocurrency Portfolio Management with Deep Reinforcement Learning
Jiang, Zhengyao
Liang, Jinjun
[J]. PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 905 - 913
[7] Deep Reinforcement Learning for Quantitative Portfolio Management
Wei, Ziqiang
Chen, Deng
[J]. 2023 THE 6TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA 2023, 2023, : 237 - 242
[8] Deep Reinforcement Learning Task for Portfolio Construction
Belyakov, Boris
Sizykh, Dmitry
[J]. 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 1077 - 1082
[9] Deep Reinforcement Learning (DRL) for Portfolio Allocation
Benhamou, Eric
Saltiel, David
Ohana, Jean Jacques
Atif, Jamal
Laraki, Rida
[J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 527 - 531
[10] Financial portfolio optimization with online deep reinforcement learning and restricted stacked autoencoder-DeepBreath
Soleymani, Farzan
Paquet, Eric
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 156

← 1 2 3 4 5 →