Dealing With Sparse Rewards Using Graph Neural Networks

被引:1
|
作者
Gerasyov, Matvey [1 ,2 ]
Makarov, Ilya [2 ,3 ,4 ]
机构
[1] HSE Univ, Sch Data Anal & Artificial Intelligence, Moscow 101000, Russia
[2] HSE Univ, Lab Algorithms & Technol Network Anal, Nizhnii Novgorod 603155, Russia
[3] Natl Univ Sci & Technol NUST MISiS, AI Ctr, Moscow 119049, Russia
[4] Artificial Intelligence Res Inst AIRI, Moscow 105064, Russia
关键词
Deep reinforcement learning (DRL); graph neural networks (GNNs); partially observable Markov decision process (POMDP); reward shaping;
D O I
10.1109/ACCESS.2023.3305927
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning in partially observable environments is a difficult task in itself and can be further complicated by a sparse reward signal. Most tasks involving navigation in three-dimensional environments provide the agent with minimal information. Typically, the agent receives a visual observation input from the environment and is rewarded once at the end of the episode. A good reward function could substantially improve the convergence of reinforcement learning algorithms for such tasks. The classic approach to increasing the density of the reward signal is to augment it with supplementary rewards. This technique is called reward shaping. In this study, we propose two modifications of one of the recent reward shaping methods based on graph convolutional networks: the first involving advanced aggregation functions, and the second utilizing the attention mechanism. We empirically validate the effectiveness of our solutions for the task of navigation in a 3D environment with sparse rewards. For the solution featuring the attention mechanism, we can also show that the learned attention is concentrated on edges corresponding to important transitions in the 3D environment.
引用
收藏
页码:89180 / 89187
页数:8
相关论文
共 50 条
  • [41] Neural Graph Learning: Training Neural Networks Using Graphs
    Bui, Thang D.
    Ravi, Sujith
    Ramavajjala, Vivek
    WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 64 - 71
  • [42] Reconstruction of gene regulatory networks using graph neural networks
    Paul, M. Emma
    Jereesh, A. S.
    Kumar, G. Santhosh
    APPLIED SOFT COMPUTING, 2024, 163
  • [43] Graph-to-Sequence Learning using Gated Graph Neural Networks
    Beck, Daniel
    Haffari, Gholamreza
    Cohn, Trevor
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 273 - 283
  • [44] Bipartite Graph Coarsening for Text Classification Using Graph Neural Networks
    dos Santos, Nicolas Roque
    Minatel, Diego
    Baria Valejo, Alan Demetrius
    Lopes, Alneu de A.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 589 - 604
  • [45] Graph neural networks
    Corso G.
    Stark H.
    Jegelka S.
    Jaakkola T.
    Barzilay R.
    Nature Reviews Methods Primers, 4 (1):
  • [46] Graph neural networks
    不详
    NATURE REVIEWS METHODS PRIMERS, 2024, 4 (01):
  • [47] Higher-Order GNNs Meet Efficiency: Sparse Sobolev Graph Neural Networks
    Giraldo, Jhony H.
    Einizade, Aref
    Todorovic, Andjela
    Castro-Correa, Jhon A.
    Badiey, Mohsen
    Bouwmans, Thierry
    Malliaros, Fragkiskos D.
    arXiv,
  • [48] Higher-Order GNNs Meet Efficiency: Sparse Sobolev Graph Neural Networks
    Giraldo, Jhony H.
    Einizade, Aref
    Todorovic, Andjela
    Castro-Correa, Jhon A.
    Badiey, Mohsen
    Bouwmans, Thierry
    Malliaros, Fragkiskos D.
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2025, 11 : 11 - 22
  • [49] Distribution Consistency based Self-Training for Graph Neural Networks with Sparse Labels
    Wang, Fali
    Zhao, Tianxiang
    Wang, Suhang
    PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 712 - 720
  • [50] Incorporating Adaptive Sparse Graph Convolutional Neural Networks for Segmentation of Organs at Risk in Radiotherapy
    Hu, Junjie
    Yu, Chengrong
    Zhu, Shengqian
    Zhang, Haixian
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024