(Reinforcement?) Learning to forage optimally

被引:31
|
作者
Kolling, Nils [1 ]
Akam, Thomas [1 ,2 ]
机构
[1] Univ Oxford, Dept Expt Psychol, Oxford, England
[2] Champalimaud Ctr Unknown, Champalimaud Neurosci Program, Lisbon, Portugal
基金
英国惠康基金;
关键词
MEDIAL PREFRONTAL CORTEX; MODEL-FREE; NEURAL MECHANISMS; DECISION-MAKING; REWARD; HABITS; TIME; STRIATUM; PREDICTION; BEHAVIOR;
D O I
10.1016/j.conb.2017.08.008
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Foraging effectively is critical to the survival of all animals and this imperative is thought to have profoundly shaped brain evolution. Decisions made by foraging animals often approximate optimal strategies, but the learning and decision mechanisms generating these choices remain poorly understood. Recent work with laboratory foraging tasks in humans suggest their behaviour is poorly explained by model free reinforcement learning, with simple heuristic strategies better describing behaviour in some tasks, and in others evidence of prospective prediction of the future state of the environment. We suggest that model-based average reward reinforcement learning may provide a common framework for understanding these apparently divergent foraging strategies.
引用
收藏
页码:162 / 169
页数:8
相关论文
共 50 条
  • [41] Curriculum Learning in Reinforcement Learning
    Narvekar, Sanmit
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5195 - 5196
  • [42] Observational Learning by Reinforcement Learning
    Borsa, Diana
    Heess, Nicolas
    Piot, Bilal
    Liu, Siqi
    Hasenclever, Leonard
    Munos, Remi
    Pietquin, Olivier
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1117 - 1124
  • [43] Learning Pessimism for Reinforcement Learning
    Cetin, Edoardo
    Celiktutan, Oya
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6971 - 6979
  • [44] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Morales, Eduardo F.
    Murrieta-Cid, Rafael
    Becerra, Israel
    Esquivel-Basaldua, Marco A.
    [J]. INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805
  • [45] Reinforcement learning in learning automata and cellular learning automata via multiple reinforcement signals
    Vafashoar, Reza
    Meybodi, Mohammad Reza
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 169 : 1 - 27
  • [46] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Eduardo F. Morales
    Rafael Murrieta-Cid
    Israel Becerra
    Marco A. Esquivel-Basaldua
    [J]. Intelligent Service Robotics, 2021, 14 : 773 - 805
  • [47] Reinforcement Learning and Inverse Reinforcement Learning with System 1 and System 2
    Peysakhovich, Alexander
    [J]. AIES '19: PROCEEDINGS OF THE 2019 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2019, : 409 - 415
  • [48] Reaching optimally over the workspace: a machine learning approach
    Marin, Didier
    Sigaud, Olivier
    [J]. 2012 4TH IEEE RAS & EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL ROBOTICS AND BIOMECHATRONICS (BIOROB), 2012, : 1128 - 1133
  • [49] Learning to Label with Active Learning and Reinforcement Learning
    Tang, Xiu
    Wu, Sai
    Chen, Gang
    Chen, Ke
    Shou, Lidan
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 : 549 - 557
  • [50] Learning to Act Optimally in Partially Observable Multiagent Settings
    Ceren, Roi
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1532 - 1533