共 50 条
- [1] Full Gradient Deep Reinforcement Learning for Average-Reward Criterion [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
- [2] Robust Average-Reward Reinforcement Learning [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 80 : 719 - 803
- [3] Robust Average-Reward Reinforcement Learning [J]. Journal of Artificial Intelligence Research, 2024, 80 : 719 - 803
- [4] Average-Reward Reinforcement Learning with Trust Region Methods [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2797 - 2803
- [5] Tuning Local Search by Average-Reward Reinforcement Learning [J]. LEARNING AND INTELLIGENT OPTIMIZATION, 2008, 5313 : 192 - 205
- [6] Inverse Reinforcement Learning with the Average Reward Criterion [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [7] An Average-Reward Reinforcement Learning Algorithm based on Schweitzer's Transformation [J]. PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 2966 - 2970
- [8] Average-Reward Learning and Planning with Options [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [9] Decentralized Multi-Agent Reinforcement Learning in Average-Reward Dynamic DCOPs [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1447 - 1455
- [10] An average-reward reinforcement learning algorithm for computing bias-optimal policies [J]. PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, 1996, : 875 - 880