共 50 条
- [41] Reward contrast in delay and probability discounting [J]. LEARNING & BEHAVIOR, 2009, 37 (03) : 281 - 288
- [44] Discounting of reward sequences: a test of competing formal models of hyperbolic discounting [J]. FRONTIERS IN PSYCHOLOGY, 2014, 5
- [46] Decentralized Multi-Agent Reinforcement Learning in Average-Reward Dynamic DCOPs [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1447 - 1455
- [47] An average-reward reinforcement learning algorithm for computing bias-optimal policies [J]. PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, 1996, : 875 - 880
- [48] Average reward rates enable motivational transfer across independent reinforcement learning tasks [J]. FRONTIERS IN BEHAVIORAL NEUROSCIENCE, 2022, 16
- [50] Scaling model-based average-reward reinforcement learning for product delivery [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 735 - 742