Addictive drugs have been hypothesized to access the same neurophysiological mechanisms as natural learning systems. These natural learning systems can be modeled through temporal-difference reinforcement learning (TDRL), which requires a reward-error signal that has been hypothesized to be carried by dopamine. TDRL learns to predict reward by driving that reward-error signal to zero. By adding a noncompensable drug-induced dopamine increase to a TDRL model, a computational model of addiction is constructed that over-selects actions leading to drug receipt. The model provides an explanation for important aspects of the addiction literature and provides a theoretic viewpoint with which to address other aspects.
机构:
Philosophy, Psychology, and Psychiatry and Behavioral Neuroscience, University of Cincinnati, Cincinnati, OHPhilosophy, Psychology, and Psychiatry and Behavioral Neuroscience, University of Cincinnati, Cincinnati, OH
机构:
UNIV SO CALIF,LOS ANGELES CTY MED CTR,DEPT OTOLARYNGOL HEAD & NECK SURG,LOS ANGELES,CA 90033UNIV SO CALIF,LOS ANGELES CTY MED CTR,DEPT OTOLARYNGOL HEAD & NECK SURG,LOS ANGELES,CA 90033
COPPOLA, C
WESTERN JOURNAL OF MEDICINE,
1991,
155
(02):
: 193
-
193