共 50 条
- [1] Average Reward Optimization with Multiple Discounting Reinforcement Learners [J]. NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 789 - 800
- [2] The effects of real versus hypothetical reward on delay and probability discounting [J]. QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2010, 63 (06): : 1072 - 1084
- [5] On Average Versus Discounted Reward Temporal-Difference Learning [J]. Machine Learning, 2002, 49 : 179 - 191
- [9] Reward contrast in delay and probability discounting [J]. Learning & Behavior, 2009, 37 : 281 - 288
- [10] Reward contrast in delay and probability discounting [J]. LEARNING & BEHAVIOR, 2009, 37 (03) : 281 - 288