A-learning: A new formulation of associative learning theory

被引:8
|
作者
Ghirlanda, Stefano [1 ,2 ,3 ]
Lind, Johan [3 ]
Enquist, Magnus [3 ]
机构
[1] CUNY, Brooklyn Coll, New York, NY 10021 USA
[2] CUNY, Grad Ctr, New York, NY 10021 USA
[3] Stockholm Univ, Stockholm, Sweden
基金
美国国家科学基金会;
关键词
Associative learning; Pavlovian conditioning; Instrumental conditioning; Mathematical model; Conditioned reinforcement; Outcome revaluation; UNCONDITIONED STIMULUS; EXTINCTION; BEHAVIOR; REINFORCEMENT; MODEL; AUTOMAINTENANCE; OPERANT; WATER; ORGANIZATION; CONTINGENCY;
D O I
10.3758/s13423-020-01749-0
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
We present a new mathematical formulation of associative learning focused on non-human animals, which we call A-learning. Building on current animal learning theory and machine learning, A-learning is composed of two learning equations, one for stimulus-response values and one for stimulus values (conditioned reinforcement). A third equation implements decision-making by mapping stimulus-response values to response probabilities. We show that A-learning can reproduce the main features of: instrumental acquisition, including the effects of signaled and unsignaled non-contingent reinforcement; Pavlovian acquisition, including higher-order conditioning, omission training, autoshaping, and differences in form between conditioned and unconditioned responses; acquisition of avoidance responses; acquisition and extinction of instrumental chains and Pavlovian higher-order conditioning; Pavlovian-to-instrumental transfer; Pavlovian and instrumental outcome revaluation effects, including insight into why these effects vary greatly with training procedures and with the proximity of a response to the reinforcer. We discuss the differences between current theory and A-learning, such as its lack of stimulus-stimulus and response-stimulus associations, and compare A-learning with other temporal-difference models from machine learning, such as Q-learning, SARSA, and the actor-critic model. We conclude that A-learning may offer a more convenient view of associative learning than current mathematical models, and point out areas that need further development.
引用
收藏
页码:1166 / 1194
页数:29
相关论文
共 50 条
  • [31] Principal deuterium Hugoniot via quantum Monte Carlo and A-learning
    Tenti, Giacomo
    Nakano, Kousuke
    Tirelli, Andrea
    Sorella, Sandro
    Casula, Michele
    PHYSICAL REVIEW B, 2024, 110 (04)
  • [32] HIGH-DIMENSIONAL A-LEARNING FOR OPTIMAL DYNAMIC TREATMENT REGIMES
    Shi, Chengchun
    Fan, Ailin
    Song, Rui
    Lu, Wenbin
    ANNALS OF STATISTICS, 2018, 46 (03): : 925 - 957
  • [33] New conceptual associative learning in amnesia: A case study
    Rajaram, S
    Coslett, HB
    JOURNAL OF MEMORY AND LANGUAGE, 2000, 43 (02) : 291 - 315
  • [34] Associative learning of concepts
    Rockai, Viliam
    Kende, Robert
    Computer Assisted Mechanics and Engineering Sciences, 2007, 14 (04): : 737 - 743
  • [35] Tolman on associative learning
    Guthrie, ER
    PSYCHOLOGICAL REVIEW, 1937, 44 : 525 - 528
  • [36] REPETITION AND ASSOCIATIVE LEARNING
    CLARK, LL
    LANSFORD, TG
    DALLENBACH, KM
    AMERICAN JOURNAL OF PSYCHOLOGY, 1960, 73 (01): : 22 - 44
  • [37] Associative learning and timing
    Kirkpatrick, Kimberly
    Balsam, Peter D.
    CURRENT OPINION IN BEHAVIORAL SCIENCES, 2016, 8 : 181 - 185
  • [38] Time and Associative Learning
    Balsam, Peter D.
    Drew, Michael R.
    Gallistel, C. R.
    COMPARATIVE COGNITION & BEHAVIOR REVIEWS, 2010, 5 : 1 - 22
  • [39] Associative Learning in Invertebrates
    Hawkins, Robert D.
    Byrne, John H.
    COLD SPRING HARBOR PERSPECTIVES IN BIOLOGY, 2015, 7 (05): : 1 - 18
  • [40] Bayesian associative learning
    Shanks, David R.
    TRENDS IN COGNITIVE SCIENCES, 2006, 10 (11) : 477 - 478