Action control, forward models and expected rewards: representations in reinforcement learning

被引:0
|
作者
Rusanen, Anna-Mari [1 ]
Lappi, Otto [1 ]
Kuokkanen, Jesse [1 ]
Pekkanen, Jami [1 ]
机构
[1] Univ Helsinki, Dept Digital Human, Cognit Sci, POB 59, Helsinki 00014, Finland
关键词
Representation; Reinforcement learning; Action control; Radical enactivism; Cognitive science; RECEPTIVE FIELDS; MOTOR; ARCHITECTURE; PHILOSOPHY; PRINCIPLES; CEREBELLUM; IMAGERY;
D O I
10.1007/s11229-021-03408-w
中图分类号
N09 [自然科学史]; B [哲学、宗教];
学科分类号
01 ; 0101 ; 010108 ; 060207 ; 060305 ; 0712 ;
摘要
The fundamental cognitive problem for active organisms is to decide what to do next in a changing environment. In this article, we analyze motor and action control in computational models that utilize reinforcement learning (RL) algorithms. In reinforcement learning, action control is governed by an action selection policy that maximizes the expected future reward in light of a predictive world model. In this paper we argue that RL provides a way to explicate the so-called action-oriented views of cognitive systems in representational terms.
引用
收藏
页码:14017 / 14033
页数:17
相关论文
共 50 条
  • [21] Reinforcement Learning in Manufacturing Control: Baselines, challenges and ways forward
    Samsonov, Vladimir
    Ben Hicham, Karim
    Meisen, Tobias
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 112
  • [22] Reinforcement Learning in Order to Control Biomechanical Models
    Gottschalk, Simon
    Burger, Michael
    PROGRESS IN INDUSTRIAL MATHEMATICS AT ECMI 2018, 2019, 30 : 521 - 527
  • [23] Action sharpens sensory representations of expected outcomes
    Yon, Daniel
    Gilbert, Sam J.
    de Lange, Floris P.
    Press, Clare
    NATURE COMMUNICATIONS, 2018, 9
  • [24] Action sharpens sensory representations of expected outcomes
    Daniel Yon
    Sam J. Gilbert
    Floris P. de Lange
    Clare Press
    Nature Communications, 9
  • [25] Reinforcement Learning for Joint Optimization of Multiple Rewards
    Agarwal, Mridul
    Aggarwal, Vaneet
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [26] Detecting Rewards Deterioration in Episodic Reinforcement Learning
    Greenberg, Ido
    Mannor, Shie
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [27] Reinforcement learning with pattern-based rewards
    Peters, JF
    Henry, C
    Ramanna, S
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2005, : 267 - 272
  • [28] RewardsOfSum: Exploring Reinforcement Learning Rewards for Summarisation
    Parnell, Jacob
    Unanue, Inigo Jauregi
    Piccardi, Massimo
    SPNLP 2021: THE 5TH WORKSHOP ON STRUCTURED PREDICTION FOR NLP, 2021, : 1 - 11
  • [29] Reinforcement Learning with Non-Markovian Rewards
    Gaon, Maor
    Brafman, Ronen, I
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3980 - 3987
  • [30] Reinforcement Learning with Immediate Rewards and Linear Hypotheses
    Naoki Abe
    Alan W. Biermann
    Philip M. Long
    Algorithmica , 2003, 37 : 263 - 293