Temporal encoding in deep reinforcement learning agents

被引:0
|
作者
Dongyan Lin
Ann Zixiang Huang
Blake Aaron Richards
机构
[1] McGill University,Integrated Program in Neuroscience
[2] Mila,School of Computer Science
[3] McGill University,Department of Neurology and Neurosurgery, Montreal Neurological Institute
[4] McGill University,Learning in Machines and Brains Program
[5] Canadian Institute for Advanced Research,undefined
来源
Scientific Reports | / 13卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Neuroscientists have observed both cells in the brain that fire at specific points in time, known as “time cells”, and cells whose activity steadily increases or decreases over time, known as “ramping cells”. It is speculated that time and ramping cells support temporal computations in the brain and carry mnemonic information. However, due to the limitations in animal experiments, it is difficult to determine how these cells really contribute to behavior. Here, we show that time cells and ramping cells naturally emerge in the recurrent neural networks of deep reinforcement learning models performing simulated interval timing and working memory tasks, which have learned to estimate expected rewards in the future. We show that these cells do indeed carry information about time and items stored in working memory, but they contribute to behavior in large part by providing a dynamic representation on which policy can be computed. Moreover, the information that they do carry depends on both the task demands and the variables provided to the models. Our results suggest that time cells and ramping cells could contribute to temporal and mnemonic calculations, but the way in which they do so may be complex and unintuitive to human observers.
引用
收藏
相关论文
共 50 条
  • [21] Deep Reinforcement Learning of Marked Temporal Point Processes
    Upadhyay, Utkarsh
    De, Abir
    Gomez-Rodrizuez, Manuel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [22] A Procedural Constructive Learning Mechanism with Deep Reinforcement Learning for Cognitive Agents
    Rossi, Leonardo de Lellis
    Rohmer, Eric
    Costa, Paula Dornhofer Paro
    Colombini, Esther Luna
    Simoes, Alexandre da Silva
    Gudwin, Ricardo Ribeiro
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (01)
  • [23] Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning
    Schaff, Charles
    Yunis, David
    Chakrabarti, Ayan
    Walter, Matthew R.
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 9798 - 9805
  • [24] A Procedural Constructive Learning Mechanism with Deep Reinforcement Learning for Cognitive Agents
    Leonardo de Lellis Rossi
    Eric Rohmer
    Paula Dornhofer Paro Costa
    Esther Luna Colombini
    Alexandre da Silva Simões
    Ricardo Ribeiro Gudwin
    Journal of Intelligent & Robotic Systems, 2024, 110
  • [25] Analysing deep reinforcement learning agents trained with domain randomisation
    Dai, Tianhong
    Arulkumaran, Kai
    Gerbert, Tamara
    Tukra, Samyakh
    Behbahani, Feryal
    Bharath, Anil Anthony
    NEUROCOMPUTING, 2022, 493 : 143 - 165
  • [26] GBDT Modeling of Deep Reinforcement Learning Agents Using Distillation
    Hatano, Toshiki
    Tsuneda, Toi
    Suzuki, Yuta
    Imade, Kuniyasu
    Shesimo, Kazuki
    Yamane, Satoshi
    2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS (ICM), 2021,
  • [27] A Survey on Visual Navigation for Artificial Agents With Deep Reinforcement Learning
    Zeng, Fanyu
    Wang, Chen
    Ge, Shuzhi Sam
    IEEE ACCESS, 2020, 8 : 135426 - 135442
  • [28] Autonomous Agents in Snake Game via Deep Reinforcement Learning
    Wei, Zhepei
    Wang, Di
    Zhang, Ming
    Tan, Ah-Hwee
    Miao, Chunyan
    Zhou, You
    2018 IEEE INTERNATIONAL CONFERENCE ON AGENTS (ICA), 2018, : 20 - 25
  • [29] Coordinated behavior of cooperative agents using deep reinforcement learning
    Diallo, Elhadji Amadou Oury
    Sugiyama, Ayumi
    Sugawara, Toshiharu
    NEUROCOMPUTING, 2020, 396 : 230 - 240
  • [30] Boosting Deep Reinforcement Learning Agents with Generative Data Augmentation
    Papagiannis, Tasos
    Alexandridis, Georgios
    Stafylopatis, Andreas
    APPLIED SCIENCES-BASEL, 2024, 14 (01):