Active inference and learning

被引:305
|
作者
Friston, Karl [1 ]
FitzGerald, Thomas [1 ,2 ]
Rigoli, Francesco [1 ]
Schwartenbeck, Philipp [1 ,2 ,3 ,4 ]
O'Doherty, John [5 ]
Pezzulo, Giovanni [6 ]
机构
[1] UCL, Wellcome Trust Ctr Neuroimaging, 12 Queen Sq, London, England
[2] Max Planck UCL Ctr Computat Psychiat & Ageing Res, London, England
[3] Salzburg Univ, Ctr Neurocognit Res, Salzburg, Austria
[4] Paracelsus Med Univ Salzburg, Inst Neurosci, Christian Doppler Klin, Salzburg, Austria
[5] CALTECH, Caltech Brain Imaging Ctr, Pasadena, CA 91125 USA
[6] CNR, Inst Cognit Sci & Technol, Rome, Italy
来源
基金
英国惠康基金;
关键词
Active inference; Habit learning; Bayesian inference; Goal-directed; Free energy; Information gain; Bayesian surprise; Epistemic value; Exploration; Exploitation; DECISION-MAKING; INFORMATION; PREDICTION; CHOICE; MODELS; EXPLORATION; SURPRISE; NOVELTY; UNCERTAINTY; COMPLEXITY;
D O I
10.1016/j.neubiorev.2016.06.022
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
This paper offers an active inference account of choice behaviour and learning. It focuses on the distinction between goal-directed and habitual behaviour and how they contextualise each other. We show that habits emerge naturally (and autodidactically) from sequential policy optimisation when agents are equipped with state-action policies. In active inference, behaviour has explorative (epistemic) and exploitative (pragmatic) aspects that are sensitive to ambiguity and risk respectively, where epistemic (ambiguity-resolving) behaviour enables pragmatic (reward-seeking) behaviour and the subsequent emergence of habits. Although goal-directed and habitual policies are usually associated with model-based and model-free schemes, we find the more important distinction is between belief-free and belief-based schemes. The underlying (variational) belief updating provides a comprehensive (if metaphorical) process theory for several phenomena, including the transfer of dopamine responses, reversal learning, habit formation and devaluation. Finally, we show that active inference reduces to a classical (Bellman) scheme, in the absence of ambiguity. (C) 2016 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license
引用
收藏
页码:862 / 879
页数:18
相关论文
共 50 条
  • [41] Resilience and active inference
    Miller, Mark
    Albarracin, Mahault
    Pitliya, Riddhi J.
    Kiefer, Alex
    Mago, Jonas
    Gorman, Claire
    Friston, Karl J.
    Ramstead, Maxwell J. D.
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [42] Active inference and agency
    Friston, Karl
    COGNITIVE NEUROSCIENCE, 2014, 5 (02) : 119 - 121
  • [43] Contrastive Active Inference
    Mazzaglia, Pietro
    Verbelen, Tim
    Dhoedt, Bart
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [44] Active Inference and Abduction
    Pietarinen, Ahti-Veikko
    Beni, Majid D.
    BIOSEMIOTICS, 2021, 14 (02) : 499 - 517
  • [45] Deep active inference
    Ueltzhoeffer, Kai
    BIOLOGICAL CYBERNETICS, 2018, 112 (06) : 547 - 573
  • [46] Impulsivity and Active Inference
    Mirza, M. Berk
    Adams, Rick A.
    Parr, Thomas
    Friston, Karl
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2019, 31 (02) : 202 - 220
  • [47] Scaling Active Inference
    Tschantz, Alexander
    Baltieri, Manuel
    Seth, Anil K.
    Buckley, Christopher L.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [48] Deep active inference
    Kai Ueltzhöffer
    Biological Cybernetics, 2018, 112 : 547 - 573
  • [49] An active learning-based approach for location-aware acquaintance inference
    Chen, Bo-Heng
    Li, Cheng-Te
    Chuang, Kun-Ta
    Pang, Jun
    Zhang, Yang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 59 (03) : 539 - 569
  • [50] Active Learning of Equivalence Relations by Minimizing the Expected Loss Using Constraint Inference
    Rendle, Steffen
    Schmidt-Thieme, Lars
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 1001 - 1006