GraphSAGE with deep reinforcement learning for financial portfolio

被引:6
|
作者
Sun, Qiguo [1 ]
Wei, Xueying [2 ]
Yang, Xibei [1 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp Sci & Engn, Zhenjiang 212003, Jiangsu, Peoples R China
[2] Shanxi Med Univ, Fenyang Coll, Fenyang 032200, Shanxi, Peoples R China
关键词
Portfolio management; Deep reinforcement learning; GraphSAGE; Explainable AI; SHAP;
D O I
10.1016/j.eswa.2023.122027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Portfolio optimization is an active management strategy that aims to maximize returns and control risk within reasonable limits. The Proximal Policy Optimization (PPO), a robust on-policy actor-critic deep reinforcement learning (DRL) model, is gaining popularity in portfolio optimization because it can help reduce emotional biases and take systematic investment actions. However, some research has found that the PPO model cannot achieve such remarkable performance in portfolio optimization as in games or robot control. In this paper, a novel GraphSAGE and DRL coupled model (GRL) is proposed to improve the architecture of the PPO agent by introducing a GraphSAGE-based feature extractor to capture the complex non-Euclidean relationships among market indexes, industry indexes and stocks. In addition, the explainable model SHAP is used to select a few but important features for GRL learning, and a method for generating a static financial graph is defined. This improves the robustness and training efficiency of the GRL model. We provide a holistic performance evaluation for GRL on three datasets using five metrics, i.e., Return On Investment (ROI), Sharpe Ratio, Sortino Ratio, Maximum Drawdown, and Calmar Ratio. The results show that the GRL model outperforms the Equal Weight strategy and the S&P 500 index. In addition, the results of the comparative analysis show that the Share-Extractor GRL and the Separate-Extractor GRL significantly outperform the PPO baseline without a feature extractor. This implies that integrating a GraphSAGE-based feature extractor into the PPO agent can improve its performance and robustness in portfolio optimization tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Ghost Expectation Point with Deep Reinforcement Learning in Financial Portfolio Management
    Yang, Xuting
    Sun, Ruoyu
    Ren, Xiaotian
    Stefanidis, Angelos
    Gu, Fengchen
    Su, Jionglong
    [J]. 2022 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, CYBERC, 2022, : 136 - 142
  • [2] Deep graph convolutional reinforcement learning for financial portfolio management-DeepPocket
    Soleymani, Farzan
    Paquet, Eric
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 182
  • [3] Deep reinforcement learning for portfolio management
    Yang, Shantian
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 278
  • [4] Deep reinforcement learning for portfolio selection
    Jiang, Yifu
    Olmo, Jose
    Atwi, Majed
    [J]. GLOBAL FINANCE JOURNAL, 2024, 62
  • [5] Reinforcement learning for deep portfolio optimization
    Yan, Ruyu
    Jin, Jiafei
    Han, Kun
    [J]. ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (09): : 5176 - 5200
  • [6] Cryptocurrency Portfolio Management with Deep Reinforcement Learning
    Jiang, Zhengyao
    Liang, Jinjun
    [J]. PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 905 - 913
  • [7] Deep Reinforcement Learning for Quantitative Portfolio Management
    Wei, Ziqiang
    Chen, Deng
    [J]. 2023 THE 6TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA 2023, 2023, : 237 - 242
  • [8] Deep Reinforcement Learning Task for Portfolio Construction
    Belyakov, Boris
    Sizykh, Dmitry
    [J]. 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 1077 - 1082
  • [9] Deep Reinforcement Learning (DRL) for Portfolio Allocation
    Benhamou, Eric
    Saltiel, David
    Ohana, Jean Jacques
    Atif, Jamal
    Laraki, Rida
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 527 - 531
  • [10] Financial portfolio optimization with online deep reinforcement learning and restricted stacked autoencoder-DeepBreath
    Soleymani, Farzan
    Paquet, Eric
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 156