共 50 条
- [21] Distributional Reward Decomposition for Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [22] Hierarchical average reward reinforcement learning [J]. Journal of Machine Learning Research, 2007, 8 : 2629 - 2669
- [23] Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning [J]. Journal of Artificial Intelligence Research, 2022, 73 : 173 - 208
- [24] Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 73 : 173 - 208
- [25] Positivity and reward [J]. ARCHIVES OF DISEASE IN CHILDHOOD-EDUCATION AND PRACTICE EDITION, 2019, 104 (04): : 182 - 182
- [27] Actively learning costly reward functions for reinforcement learning [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (01):
- [29] Active Learning for Reward Estimation in Inverse Reinforcement Learning [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 31 - +
- [30] Learning Reward Machines for Partially Observable Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32