共 50 条
- [1] Full Gradient Deep Reinforcement Learning for Average-Reward Criterion [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
- [2] On-Policy Deep Reinforcement Learning for the Average-Reward Criterion [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [3] Hierarchical average reward reinforcement learning [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 2629 - 2669
- [4] Hierarchical average reward reinforcement learning [J]. Journal of Machine Learning Research, 2007, 8 : 2629 - 2669
- [5] Compatible Reward Inverse Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [6] Reward Identification in Inverse Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [8] Robust Average-Reward Reinforcement Learning [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 80 : 719 - 803
- [9] Robust Average-Reward Reinforcement Learning [J]. Journal of Artificial Intelligence Research, 2024, 80 : 719 - 803
- [10] Active Learning for Reward Estimation in Inverse Reinforcement Learning [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 31 - +