Dealing With Sparse Rewards Using Graph Neural Networks

被引：1

作者：

Gerasyov, Matvey ^{[1
,2
]}

Makarov, Ilya ^{[2
,3
,4
]}

机构：

[1] HSE Univ, Sch Data Anal & Artificial Intelligence, Moscow 101000, Russia

[2] HSE Univ, Lab Algorithms & Technol Network Anal, Nizhnii Novgorod 603155, Russia

[3] Natl Univ Sci & Technol NUST MISiS, AI Ctr, Moscow 119049, Russia

[4] Artificial Intelligence Res Inst AIRI, Moscow 105064, Russia

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Deep reinforcement learning (DRL); graph neural networks (GNNs); partially observable Markov decision process (POMDP); reward shaping;

D O I：

10.1109/ACCESS.2023.3305927

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep reinforcement learning in partially observable environments is a difficult task in itself and can be further complicated by a sparse reward signal. Most tasks involving navigation in three-dimensional environments provide the agent with minimal information. Typically, the agent receives a visual observation input from the environment and is rewarded once at the end of the episode. A good reward function could substantially improve the convergence of reinforcement learning algorithms for such tasks. The classic approach to increasing the density of the reward signal is to augment it with supplementary rewards. This technique is called reward shaping. In this study, we propose two modifications of one of the recent reward shaping methods based on graph convolutional networks: the first involving advanced aggregation functions, and the second utilizing the attention mechanism. We empirically validate the effectiveness of our solutions for the task of navigation in a 3D environment with sparse rewards. For the solution featuring the attention mechanism, we can also show that the learned attention is concentrated on edges corresponding to important transitions in the 3D environment.

引用

页码：89180 / 89187

页数：8

共 50 条

[41] Neural Graph Learning: Training Neural Networks Using Graphs
Bui, Thang D.
Ravi, Sujith
Ramavajjala, Vivek
WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 64 - 71
[42] Reconstruction of gene regulatory networks using graph neural networks
Paul, M. Emma
Jereesh, A. S.
Kumar, G. Santhosh
APPLIED SOFT COMPUTING, 2024, 163
[43] Graph-to-Sequence Learning using Gated Graph Neural Networks
Beck, Daniel
Haffari, Gholamreza
Cohn, Trevor
PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 273 - 283
[44] Bipartite Graph Coarsening for Text Classification Using Graph Neural Networks
dos Santos, Nicolas Roque
Minatel, Diego
Baria Valejo, Alan Demetrius
Lopes, Alneu de A.
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 589 - 604
[45] Graph neural networks
Corso G.
Stark H.
Jegelka S.
Jaakkola T.
Barzilay R.
Nature Reviews Methods Primers, 4 (1):
[46] Graph neural networks
不详
NATURE REVIEWS METHODS PRIMERS, 2024, 4 (01):
[47] Higher-Order GNNs Meet Efficiency: Sparse Sobolev Graph Neural Networks
Giraldo, Jhony H.
Einizade, Aref
Todorovic, Andjela
Castro-Correa, Jhon A.
Badiey, Mohsen
Bouwmans, Thierry
Malliaros, Fragkiskos D.
arXiv,
[48] Higher-Order GNNs Meet Efficiency: Sparse Sobolev Graph Neural Networks
Giraldo, Jhony H.
Einizade, Aref
Todorovic, Andjela
Castro-Correa, Jhon A.
Badiey, Mohsen
Bouwmans, Thierry
Malliaros, Fragkiskos D.
IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2025, 11 : 11 - 22
[49] Distribution Consistency based Self-Training for Graph Neural Networks with Sparse Labels
Wang, Fali
Zhao, Tianxiang
Wang, Suhang
PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 712 - 720
[50] Incorporating Adaptive Sparse Graph Convolutional Neural Networks for Segmentation of Organs at Risk in Radiotherapy
Hu, Junjie
Yu, Chengrong
Zhu, Shengqian
Zhang, Haixian
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024

← 1 2 3 4 5 →