(Reinforcement?) Learning to forage optimally

被引:31
|
作者
Kolling, Nils [1 ]
Akam, Thomas [1 ,2 ]
机构
[1] Univ Oxford, Dept Expt Psychol, Oxford, England
[2] Champalimaud Ctr Unknown, Champalimaud Neurosci Program, Lisbon, Portugal
基金
英国惠康基金;
关键词
MEDIAL PREFRONTAL CORTEX; MODEL-FREE; NEURAL MECHANISMS; DECISION-MAKING; REWARD; HABITS; TIME; STRIATUM; PREDICTION; BEHAVIOR;
D O I
10.1016/j.conb.2017.08.008
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Foraging effectively is critical to the survival of all animals and this imperative is thought to have profoundly shaped brain evolution. Decisions made by foraging animals often approximate optimal strategies, but the learning and decision mechanisms generating these choices remain poorly understood. Recent work with laboratory foraging tasks in humans suggest their behaviour is poorly explained by model free reinforcement learning, with simple heuristic strategies better describing behaviour in some tasks, and in others evidence of prospective prediction of the future state of the environment. We suggest that model-based average reward reinforcement learning may provide a common framework for understanding these apparently divergent foraging strategies.
引用
收藏
页码:162 / 169
页数:8
相关论文
共 50 条
  • [1] LEARNING TO FORAGE - OPTIMALLY
    OLLASON, JG
    [J]. THEORETICAL POPULATION BIOLOGY, 1980, 18 (01) : 44 - 56
  • [2] DO CHIPPING SPARROWS FORAGE OPTIMALLY
    PULLIAM, HR
    [J]. ARDEA, 1980, 68 (1-4) : 75 - 82
  • [3] DO BUMBLEBEES FORAGE OPTIMALLY, AND DOES IT MATTER
    HEINRICH, B
    [J]. AMERICAN ZOOLOGIST, 1983, 23 (02): : 273 - 281
  • [4] Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning
    Long, Pinxin
    Fan, Tingxiang
    Liao, Xinyi
    Liu, Wenxi
    Zhang, Hao
    Pan, Jia
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 6252 - 6259
  • [5] COVID-19 vaccine incentive scheduling using an optimally controlled reinforcement learning model
    Stuckey, K.
    Newton, P. K.
    [J]. PHYSICA D-NONLINEAR PHENOMENA, 2023, 445
  • [6] LEARNING TO GROW OPTIMALLY
    Cellarier, L. L.
    [J]. COMPUTATIONAL INTELLIGENCE: FOUNDATIONS AND APPLICATIONS: PROCEEDINGS OF THE 9TH INTERNATIONAL FLINS CONFERENCE, 2010, 4 : 851 - 858
  • [7] DO BLUE JAYS HUNTING FOR CRYPTIC PREY FORAGE OPTIMALLY
    KAMIL, AC
    [J]. BEHAVIOURAL PROCESSES, 1984, 9 (2-3) : 302 - 303
  • [8] Cascade Learning by Optimally Partitioning
    Pang, Yanwei
    Cao, Jiale
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (12) : 4148 - 4161
  • [9] Quickly 'learning' to move optimally
    Brenner, Eli
    Smeets, Jeroen B. J.
    [J]. EXPERIMENTAL BRAIN RESEARCH, 2011, 213 (01) : 153 - 161
  • [10] Quickly ‘learning’ to move optimally
    Eli Brenner
    Jeroen B. J. Smeets
    [J]. Experimental Brain Research, 2011, 213 : 153 - 161