Biologically inspired reinforcement learning: Reward-based decomposition for multi-goal environments

被引:0
|
作者
Zhou, WD [1 ]
Coggins, R [1 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Comp Engn Lab, Sydney, NSW 2006, Australia
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present an emotion-based hierarchical reinforcement learning (HRL) algorithm for environments with multiple sources of reward. The architecture of the system is inspired by the neurobiology of the brain and particularly those areas responsible for emotions, decision making and behaviour execution, being the amygdala, the orbito-frontal cortex and the basal ganglia respectively. The learning problem is decomposed according to sources of reward. A reward source serves as a goal for a given subtask. Each subtask is assigned an artificial emotion indication (AEI) which predicts the reward component associated with the subtask. The AEIs are learned along with the top-level policy simultaneously and used to interrupt subtask execution when the AEIs change significantly. The algorithm is tested in a simulated gridworld which has two sources of reward and is partially observable. Experiments are performed comparing the emotion based algorithm with other HRL algorithms under the same learning conditions. The use of the biologically inspired architecture significantly accelerates the learning process and achieves higher long term reward compared to a human designed policy and a restricted form of the MAXQ algorithm.
引用
收藏
页码:80 / 94
页数:15
相关论文
共 50 条
  • [1] Guided goal generation for hindsight multi-goal reinforcement learning
    Bai, Chenjia
    Liu, Peng
    Zhao, Wei
    Tang, Xianglong
    [J]. NEUROCOMPUTING, 2019, 359 : 353 - 367
  • [2] Reward-weighted DHER Mechanism For Multi-goal Reinforcement Learning With Application To Robotic Manipulation Control
    Wei, Xueyu
    Duan, Lilong
    Xue, Wei
    [J]. JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2023, 26 (12): : 1829 - 1841
  • [3] Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
    Zhao, Rui
    Sun, Xudong
    Tresp, Volker
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [4] Probabilistic Reward-Based Reinforcement Learning for Multi-Agent Pursuit and Evasion
    Zhang, Bo-Kun
    Hu, Bin
    Chen, Long
    Zhang, Ding-Xue
    Cheng, Xin-Ming
    Guan, Zhi-Hong
    [J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3352 - 3357
  • [5] Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning
    Castanet, Nicolas
    Sigaud, Olivier
    Lamprier, Sylvain
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [6] Combining Hindsight with Goal-enhanced Prediction for Multi-goal Reinforcement Learning
    Yang, Rui
    Luo, Feng
    Li, Xiu
    [J]. 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 314 - 321
  • [7] CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
    Colas, Cedric
    Fournier, Pierre
    Sigaud, Olivier
    Chetouani, Mohamed
    Oudeyer, Pierre-Yves
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [8] Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments
    Caselles-Dupre, Hugo
    Sigaud, Olivier
    Chetouani, Mohamed
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [9] Goal Density-based Hindsight Experience Prioritization for Multi-Goal Robot Manipulation Reinforcement Learning
    Kuang, Yingyi
    Weinberg, Abraham Itzhak
    Vogiatzis, George
    Faria, Diego R.
    [J]. 2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 432 - 437
  • [10] Hierarchical reinforcement learning for handling sparse rewards in multi-goal navigation
    Yan, Jiangyue
    Luo, Biao
    Xu, Xiaodong
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (06)