Graph convolution with topology refinement for Automatic Reinforcement Learning

被引:3
|
作者
Sang, Jianghui [1 ]
Wang, Yongli [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Sci & Technol Informat Syst Engn Lab, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Reward shaping; Graph; Markov decision process; ENTROPY;
D O I
10.1016/j.neucom.2023.126621
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning faces the challenge of sparse rewards. Existing research utilizes reward shaping based on graph convolutional neural networks (GCNs) to address this challenge. However, the automatic construction of optimal graph has been a long standing issue. Here we propose Graph Convolution with Topology Refinement for Automatic Reinforcement Learning (GTR), based on the construction of new latent graph to replace the original input graph for more effective reward shaping. It is found from this work that, the most suitable state node can be extracted through the graph entropy. Subsequently we map the original graph to subset of nodes adaptively to form a new and more compact latent graph. Since GTR utilizes trainable projection vectors for projecting all node features into one-dimensional representation, the inter-connections between the nodes of the newly constructed latent graph are consistent with the original ones. The proposed GTR stems from mathematical grounds, and preliminary experiments have shown that the proposed GTR has considerable improvement on Atari benchmark and Mujoco benchmark. Further experiment and ablation analysis have given further supports to this work.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Star topology convolution for graph representation learning
    Chong Wu
    Zhenan Feng
    Jiangbin Zheng
    Houwang Zhang
    Jiawang Cao
    Hong Yan
    Complex & Intelligent Systems, 2022, 8 : 5125 - 5141
  • [2] Star topology convolution for graph representation learning
    Wu, Chong
    Feng, Zhenan
    Zheng, Jiangbin
    Zhang, Houwang
    Cao, Jiawang
    Yan, Hong
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (06) : 5125 - 5141
  • [3] Graph Signal Processing and Deep Learning: Convolution, Pooling, and Topology
    Cheung, Mark
    Shi, John
    Wright, Oren
    Jiang, Lavendar Y.
    Liu, Xujin
    Moura, Jose M. F.
    IEEE SIGNAL PROCESSING MAGAZINE, 2020, 37 (06) : 139 - 149
  • [4] Automatic Curriculum Graph Generation for Reinforcement Learning Agents
    Svetlik, Maxwell
    Leonetti, Matteo
    Sinapov, Jivko
    Shah, Rishi
    Walker, Nick
    Stone, Peter
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2590 - 2596
  • [5] Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition
    Chen, Yuxin
    Zhang, Ziqi
    Yuan, Chunfeng
    Li, Bing
    Deng, Ying
    Hu, Weiming
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13339 - 13348
  • [6] Reinforcement Learning with Dual Attention Guided Graph Convolution for Relation Extraction
    Li, Zhixin
    Sun, Yaru
    Tang, Suqin
    Zhang, Canlong
    Ma, Huifang
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 946 - 953
  • [7] IAB Topology Design: A Graph Embedding and Deep Reinforcement Learning Approach
    Simsek, Meryem
    Orhan, Oner
    Nassar, Marcel
    Elibol, Oguz
    Nikopour, Hosein
    IEEE COMMUNICATIONS LETTERS, 2021, 25 (02) : 489 - 493
  • [8] Graph Convolution Reinforcement Learning for Decision-Making in Highway Overtaking Scenario
    Meng Xiaoqiang
    Yang Fan
    Li Xueyuan
    Liu Qi
    Gao Xin
    Li Zirui
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 417 - 422
  • [9] Automatic skill acquisition in reinforcement learning using graph centrality measures
    Moradi, Parham
    Shiri, Mohammad Ebrahim
    Rad, Ali Ajdari
    Khadivi, Alireza
    Hasler, Martin
    INTELLIGENT DATA ANALYSIS, 2012, 16 (01) : 113 - 135
  • [10] April: An Automatic Graph Data Management System Based on Reinforcement Learning
    Wang, Hongzhi
    Qi, Zhixin
    Zheng, Lei
    Feng, Yun
    Ouyang, Junfei
    Zhang, Haoqi
    Zhang, Xiangxi
    Shen, Ziming
    Liu, Shirong
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3465 - 3468