Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks

被引:7
|
作者
Han, Dongqi [1 ]
Doya, Kenji [2 ]
Tani, Jun [1 ]
机构
[1] Okinawa Inst Sci & Technol, Cognit Neurorobot Res Unit, Okinawa, Japan
[2] Okinawa Inst Sci & Technol, Neural Computat Unit, Okinawa, Japan
基金
日本学术振兴会;
关键词
Recurrent neural network; Reinforcement learning; Partially observable Markov decision process; Multiple timescale; Compositionality; TIME SCALES; TIMESCALES; MEMORY; GAME; GO;
D O I
10.1016/j.neunet.2020.06.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent neural networks (RNNs) for reinforcement learning (RL) have shown distinct advantages, e.g., solving memory-dependent tasks and meta-learning. However, little effort has been spent on improving RNN architectures and on understanding the underlying neural mechanisms for performance gain. In this paper, we propose a novel, multiple-timescale, stochastic RNN for RL. Empirical results show that the network can autonomously learn to abstract sub-goals and can self-develop an action hierarchy using internal dynamics in a challenging continuous control task. Furthermore, we show that the self-developed compositionality of the network enhances faster re-learning when adapting to a new task that is a re-composition of previously learned sub-goals, than when starting from scratch. We also found that improved performance can be achieved when neural activities are subject to stochastic rather than deterministic dynamics. (C) 2020 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:149 / 162
页数:14
相关论文
共 50 条
  • [41] Microscopic self-organization in networks
    Sun, K.
    Ouyang, Q.
    [J]. Physical Review E - Statistical, Nonlinear, and Soft Matter Physics, 2001, 64 (2 II): : 261111 - 261115
  • [42] Self-organization of collaboration networks
    Ramasco, JJ
    Dorogovtsev, SN
    Pastor-Satorras, R
    [J]. PHYSICAL REVIEW E, 2004, 70 (03)
  • [43] Self-organization in sensor networks
    Collier, TC
    Taylor, C
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2004, 64 (07) : 866 - 873
  • [44] Self-organization in networks today
    Dixit, S
    Sarma, A
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2005, 43 (08) : 77 - 77
  • [45] Microscopic self-organization in networks
    Sun, K
    Ouyang, Q
    [J]. PHYSICAL REVIEW E, 2001, 64 (02):
  • [46] Emergence of multimodal action representations from neural network self-organization
    Parisi, German I.
    Tani, Jun
    Weber, Cornelius
    Wermter, Stefan
    [J]. COGNITIVE SYSTEMS RESEARCH, 2017, 43 : 208 - 221
  • [47] Self-Organization in Decentralized Networks: A Trial and Error Learning Approach
    Rose, Luca
    Perlaza, Satnir M.
    Le Martret, Christophe J.
    Debbah, Merouane
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2014, 13 (01) : 268 - 279
  • [48] Self-Organization of Physics-Informed Mechanisms in Recurrent Neural Networks: A Case Study in Pneumatic Artificial Muscles
    Sun, Wentao
    Akashi, Nozomi
    Kuniyoshi, Yasuo
    Nakajima, Kohei
    [J]. 2022 IEEE 5TH INTERNATIONAL CONFERENCE ON SOFT ROBOTICS (ROBOSOFT), 2022, : 409 - 415
  • [49] Self-organization, learning and language
    Fang, FK
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1906 - 1911
  • [50] Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks
    Brosch, Tobias
    Neumann, Heiko
    Roelfsema, Pieter R.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (10)