Solving the credit assignment problem: explicit and implicit learning of action sequences with probabilistic outcomes

被引:34
|
作者
Fu, Wai-Tat [1 ]
Anderson, John R. [2 ]
机构
[1] Univ Illinois, Human Factors Div & Beckman Inst, Urbana, IL 61801 USA
[2] Carnegie Mellon Univ, Dept Psychol, Pittsburgh, PA 15213 USA
来源
PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG | 2008年 / 72卷 / 03期
关键词
D O I
10.1007/s00426-007-0113-7
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
In most problem-solving activities, feedback is received at the end of an action sequence. This creates a credit-assignment problem where the learner must associate the feedback with earlier actions, and the interdependencies of actions require the learner to remember past choices of actions. In two studies, we investigated the nature of explicit and implicit learning processes in the credit-assignment problem using a probabilistic sequential choice task with and without a secondary memory task. We found that when explicit learning was dominant, learning was faster to select the better option in their first choices than in the last choices. When implicit reinforcement learning was dominant, learning was faster to select the better option in their last choices than in their first choices. Consistent with the probability-learning and sequence-learning literature, the results show that credit assignment involves two processes: an explicit memory encoding process that requires memory rehearsals and an implicit reinforcement-learning process that propagates credits backwards to previous choices.
引用
收藏
页码:321 / 330
页数:10
相关论文
共 50 条
  • [41] Investigating the predictive validity of implicit and explicit measures of motivation in problem-solving behavioural tasks
    Keatley, David
    Clarke, David. D.
    Hagger, Martin S.
    BRITISH JOURNAL OF SOCIAL PSYCHOLOGY, 2013, 52 (03) : 510 - 524
  • [42] Error-driven Input Modulation: Solving the Credit Assignment Problem without a Backward Pass
    Dellaferrera, Giorgia
    Kreiman, Gabriel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [43] Alleviating Credit Assignment Problem Using Deep Representation Learning with Application to Push Recovery Learning
    Davari, Mohammadjavad
    Alipour, Khalil
    Hadi, Alireza
    2017 ARTIFICIAL INTELLIGENCE AND ROBOTICS (IRANOPEN), 2017, : 109 - 114
  • [44] Effects of sleep loss, time of day, and extended mental work on implicit and explicit learning of sequences
    Heuer, H
    Spijkers, W
    Kiesswetter, E
    Schmidtke, V
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-APPLIED, 1998, 4 (02) : 139 - 162
  • [45] Explicit and implicit learning of event sequences: Evidence from event-related brain potentials
    Eimer, M
    Goschke, T
    Schlaghecken, F
    Sturmer, B
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1996, 22 (04) : 970 - 987
  • [46] Addressing the Credit Assignment Problem in Treatment Outcome Prediction using Temporal Difference Learning
    Harati, Sahar
    Crowell, Andrea
    Mayberg, Helen
    Nemati, Shamim
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2020, 2020, : 43 - 54
  • [47] Addition of Learning to Critic Agent as a Solution to the Multi-Agent Credit Assignment Problem
    Rahaie, Zahra
    Beigy, Hamid
    2009 FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING, COMPUTING WITH WORDS AND PERCEPTIONS IN SYSTEM ANALYSIS, DECISION AND CONTROL, 2010, : 219 - 222
  • [48] IMPLICIT VERSUS EXPLICIT LEARNING-PROCESSES IN A PROBABILISTIC, CONTINUOUS FINE-MOTOR CATCHING TASK
    GREEN, TD
    FLOWERS, JH
    JOURNAL OF MOTOR BEHAVIOR, 1991, 23 (04) : 293 - 300
  • [49] INTEGRATED LEARNING - EXPLICIT STRATEGIES AND THEIR ROLE IN PROBLEM-SOLVING INSTRUCTION FOR STUDENTS WITH LEARNING-DISABILITIES
    HOLLINGSWORTH, M
    WOODWARD, J
    EXCEPTIONAL CHILDREN, 1993, 59 (05) : 444 - 455
  • [50] MATHEMATICS PROBLEM SOLVING PROFESSIONAL LEARNING THROUGH COLLABORATIVE ACTION RESEARCH
    Mgombelo, Joyce
    Jaipal-Jamani, Kamini
    PROCEEDINGS OF THE SEVENTH CONGRESS OF THE EUROPEAN SOCIETY FOR RESEARCH IN MATHEMATICS EDUCATION (CERME 7), 2011, : 2766 - 2776