Temporal encoding in deep reinforcement learning agents

被引：0

作者：

Dongyan Lin

Ann Zixiang Huang

Blake Aaron Richards

机构：

[1] McGill University,Integrated Program in Neuroscience

[2] Mila,School of Computer Science

[3] McGill University,Department of Neurology and Neurosurgery, Montreal Neurological Institute

[4] McGill University,Learning in Machines and Brains Program

[5] Canadian Institute for Advanced Research,undefined

来源：

Scientific Reports | / 13卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Neuroscientists have observed both cells in the brain that fire at specific points in time, known as “time cells”, and cells whose activity steadily increases or decreases over time, known as “ramping cells”. It is speculated that time and ramping cells support temporal computations in the brain and carry mnemonic information. However, due to the limitations in animal experiments, it is difficult to determine how these cells really contribute to behavior. Here, we show that time cells and ramping cells naturally emerge in the recurrent neural networks of deep reinforcement learning models performing simulated interval timing and working memory tasks, which have learned to estimate expected rewards in the future. We show that these cells do indeed carry information about time and items stored in working memory, but they contribute to behavior in large part by providing a dynamic representation on which policy can be computed. Moreover, the information that they do carry depends on both the task demands and the variables provided to the models. Our results suggest that time cells and ramping cells could contribute to temporal and mnemonic calculations, but the way in which they do so may be complex and unintuitive to human observers.

引用

共 50 条

[21] Deep Reinforcement Learning of Marked Temporal Point Processes
Upadhyay, Utkarsh
De, Abir
Gomez-Rodrizuez, Manuel
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[22] A Procedural Constructive Learning Mechanism with Deep Reinforcement Learning for Cognitive Agents
Rossi, Leonardo de Lellis
Rohmer, Eric
Costa, Paula Dornhofer Paro
Colombini, Esther Luna
Simoes, Alexandre da Silva
Gudwin, Ricardo Ribeiro
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (01)
[23] Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning
Schaff, Charles
Yunis, David
Chakrabarti, Ayan
Walter, Matthew R.
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 9798 - 9805
[24] A Procedural Constructive Learning Mechanism with Deep Reinforcement Learning for Cognitive Agents
Leonardo de Lellis Rossi
Eric Rohmer
Paula Dornhofer Paro Costa
Esther Luna Colombini
Alexandre da Silva Simões
Ricardo Ribeiro Gudwin
Journal of Intelligent & Robotic Systems, 2024, 110
[25] Analysing deep reinforcement learning agents trained with domain randomisation
Dai, Tianhong
Arulkumaran, Kai
Gerbert, Tamara
Tukra, Samyakh
Behbahani, Feryal
Bharath, Anil Anthony
NEUROCOMPUTING, 2022, 493 : 143 - 165
[26] GBDT Modeling of Deep Reinforcement Learning Agents Using Distillation
Hatano, Toshiki
Tsuneda, Toi
Suzuki, Yuta
Imade, Kuniyasu
Shesimo, Kazuki
Yamane, Satoshi
2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS (ICM), 2021,
[27] A Survey on Visual Navigation for Artificial Agents With Deep Reinforcement Learning
Zeng, Fanyu
Wang, Chen
Ge, Shuzhi Sam
IEEE ACCESS, 2020, 8 : 135426 - 135442
[28] Autonomous Agents in Snake Game via Deep Reinforcement Learning
Wei, Zhepei
Wang, Di
Zhang, Ming
Tan, Ah-Hwee
Miao, Chunyan
Zhou, You
2018 IEEE INTERNATIONAL CONFERENCE ON AGENTS (ICA), 2018, : 20 - 25
[29] Coordinated behavior of cooperative agents using deep reinforcement learning
Diallo, Elhadji Amadou Oury
Sugiyama, Ayumi
Sugawara, Toshiharu
NEUROCOMPUTING, 2020, 396 : 230 - 240
[30] Boosting Deep Reinforcement Learning Agents with Generative Data Augmentation
Papagiannis, Tasos
Alexandridis, Georgios
Stafylopatis, Andreas
APPLIED SCIENCES-BASEL, 2024, 14 (01):

← 1 2 3 4 5 →