Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks

被引:7
|
作者
Han, Dongqi [1 ]
Doya, Kenji [2 ]
Tani, Jun [1 ]
机构
[1] Okinawa Inst Sci & Technol, Cognit Neurorobot Res Unit, Okinawa, Japan
[2] Okinawa Inst Sci & Technol, Neural Computat Unit, Okinawa, Japan
基金
日本学术振兴会;
关键词
Recurrent neural network; Reinforcement learning; Partially observable Markov decision process; Multiple timescale; Compositionality; TIME SCALES; TIMESCALES; MEMORY; GAME; GO;
D O I
10.1016/j.neunet.2020.06.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent neural networks (RNNs) for reinforcement learning (RL) have shown distinct advantages, e.g., solving memory-dependent tasks and meta-learning. However, little effort has been spent on improving RNN architectures and on understanding the underlying neural mechanisms for performance gain. In this paper, we propose a novel, multiple-timescale, stochastic RNN for RL. Empirical results show that the network can autonomously learn to abstract sub-goals and can self-develop an action hierarchy using internal dynamics in a challenging continuous control task. Furthermore, we show that the self-developed compositionality of the network enhances faster re-learning when adapting to a new task that is a re-composition of previously learned sub-goals, than when starting from scratch. We also found that improved performance can be achieved when neural activities are subject to stochastic rather than deterministic dynamics. (C) 2020 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:149 / 162
页数:14
相关论文
共 50 条
  • [1] Self-Organization in Small Cell Networks: A Reinforcement Learning Approach
    Bennis, Mehdi
    Perlaza, Samir M.
    Blasco, Pol
    Han, Zhu
    Poor, H. Vincent
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2013, 12 (07) : 3202 - 3212
  • [2] SELF-ORGANIZATION OF NEURAL NETWORKS
    CLARK, JW
    WINSTON, JV
    RAFELSKI, J
    [J]. PHYSICS LETTERS A, 1984, 102 (04) : 207 - 211
  • [3] Self-organization of social hierarchy on interaction networks
    Fujie, Ryo
    Odagaki, Takashi
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2011,
  • [4] Learning to Learn and Compositionality with Deep Recurrent Neural Networks
    de Freitas, Nando
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 3 - 3
  • [5] Self-Organization and Compositionality in Cognitive Brains
    Tani, Jun
    Friston, Karl
    Haykin, Simon
    [J]. PROCEEDINGS OF THE IEEE, 2014, 102 (04) : 606 - 607
  • [6] A self-organization algorithm of neural networks
    Yamawaki, S
    Imao, S
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 1435 - 1438
  • [7] Self-organization of neural networks for clustering
    Maeda, Y
    Yotsumoto, Y
    Kanata, Y
    [J]. ELECTRICAL ENGINEERING IN JAPAN, 1997, 121 (01) : 51 - 59
  • [8] Self-organization in probabilistic neural networks
    Shiraishi, Y
    Hirasawa, K
    Hu, J
    Murata, J
    [J]. SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 2533 - 2538
  • [9] Incorporating structural plasticity into self-organization recurrent networks for sequence learning
    Yuan, Ye
    Zhu, Yongtong
    Wang, Jiaqi
    Li, Ruoshi
    Xu, Xin
    Fang, Tao
    Huo, Hong
    Wan, Lihong
    Li, Qingdu
    Liu, Na
    Yang, Shiyan
    [J]. FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [10] Self-organization and dynamics reduction in recurrent networks: stimulus presentation and learning
    Dauce, E
    Quoy, M
    Cessac, B
    Doyon, B
    Samuelides, M
    [J]. NEURAL NETWORKS, 1998, 11 (03) : 521 - 533