Graph convolution with topology refinement for Automatic Reinforcement Learning

被引：3

作者：

Sang, Jianghui ^{[1
]}

Wang, Yongli ^{[1
,2
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China

[2] Sci & Technol Informat Syst Engn Lab, Nanjing, Peoples R China

来源：

NEUROCOMPUTING | 2023年 / 554卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Reward shaping; Graph; Markov decision process; ENTROPY;

D O I：

10.1016/j.neucom.2023.126621

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning faces the challenge of sparse rewards. Existing research utilizes reward shaping based on graph convolutional neural networks (GCNs) to address this challenge. However, the automatic construction of optimal graph has been a long standing issue. Here we propose Graph Convolution with Topology Refinement for Automatic Reinforcement Learning (GTR), based on the construction of new latent graph to replace the original input graph for more effective reward shaping. It is found from this work that, the most suitable state node can be extracted through the graph entropy. Subsequently we map the original graph to subset of nodes adaptively to form a new and more compact latent graph. Since GTR utilizes trainable projection vectors for projecting all node features into one-dimensional representation, the inter-connections between the nodes of the newly constructed latent graph are consistent with the original ones. The proposed GTR stems from mathematical grounds, and preliminary experiments have shown that the proposed GTR has considerable improvement on Atari benchmark and Mujoco benchmark. Further experiment and ablation analysis have given further supports to this work.

引用

页数：7

共 50 条

[1] Star topology convolution for graph representation learning
Chong Wu
Zhenan Feng
Jiangbin Zheng
Houwang Zhang
Jiawang Cao
Hong Yan
Complex & Intelligent Systems, 2022, 8 : 5125 - 5141
[2] Star topology convolution for graph representation learning
Wu, Chong
Feng, Zhenan
Zheng, Jiangbin
Zhang, Houwang
Cao, Jiawang
Yan, Hong
COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (06) : 5125 - 5141
[3] Graph Signal Processing and Deep Learning: Convolution, Pooling, and Topology
Cheung, Mark
Shi, John
Wright, Oren
Jiang, Lavendar Y.
Liu, Xujin
Moura, Jose M. F.
IEEE SIGNAL PROCESSING MAGAZINE, 2020, 37 (06) : 139 - 149
[4] Automatic Curriculum Graph Generation for Reinforcement Learning Agents
Svetlik, Maxwell
Leonetti, Matteo
Sinapov, Jivko
Shah, Rishi
Walker, Nick
Stone, Peter
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2590 - 2596
[5] Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition
Chen, Yuxin
Zhang, Ziqi
Yuan, Chunfeng
Li, Bing
Deng, Ying
Hu, Weiming
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13339 - 13348
[6] Reinforcement Learning with Dual Attention Guided Graph Convolution for Relation Extraction
Li, Zhixin
Sun, Yaru
Tang, Suqin
Zhang, Canlong
Ma, Huifang
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 946 - 953
[7] IAB Topology Design: A Graph Embedding and Deep Reinforcement Learning Approach
Simsek, Meryem
Orhan, Oner
Nassar, Marcel
Elibol, Oguz
Nikopour, Hosein
IEEE COMMUNICATIONS LETTERS, 2021, 25 (02) : 489 - 493
[8] Graph Convolution Reinforcement Learning for Decision-Making in Highway Overtaking Scenario
Meng Xiaoqiang
Yang Fan
Li Xueyuan
Liu Qi
Gao Xin
Li Zirui
2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 417 - 422
[9] Automatic skill acquisition in reinforcement learning using graph centrality measures
Moradi, Parham
Shiri, Mohammad Ebrahim
Rad, Ali Ajdari
Khadivi, Alireza
Hasler, Martin
INTELLIGENT DATA ANALYSIS, 2012, 16 (01) : 113 - 135
[10] April: An Automatic Graph Data Management System Based on Reinforcement Learning
Wang, Hongzhi
Qi, Zhixin
Zheng, Lei
Feng, Yun
Ouyang, Junfei
Zhang, Haoqi
Zhang, Xiangxi
Shen, Ziming
Liu, Shirong
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3465 - 3468

← 1 2 3 4 5 →