共 50 条
- [2] Feasible Q-Learning for Average Reward Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [3] Robust Average-Reward Reinforcement Learning [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 80 : 719 - 803
- [4] Robust Average-Reward Reinforcement Learning [J]. Journal of Artificial Intelligence Research, 2024, 80 : 719 - 803
- [5] Average-Reward Learning and Planning with Options [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [6] Learning and Planning in Average-Reward Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7665 - 7676
- [7] Average-Reward Reinforcement Learning with Trust Region Methods [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2797 - 2803
- [8] Tuning Local Search by Average-Reward Reinforcement Learning [J]. LEARNING AND INTELLIGENT OPTIMIZATION, 2008, 5313 : 192 - 205
- [10] Full Gradient Deep Reinforcement Learning for Average-Reward Criterion [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211