共 50 条
- [1] Learning and Planning in Average-Reward Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7665 - 7676
- [2] Robust Average-Reward Markov Decision Processes [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15215 - 15223
- [3] Average-Reward Decentralized Markov Decision Processes [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1997 - 2002
- [5] Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [6] Finite Sample Analysis of Average-Reward TD Learning and Q-Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [7] Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes [J]. THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
- [8] Incremental Improvements of Heuristic Policies for Average-Reward Markov Decision Processes [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 1721 - 1728
- [9] Approximate Relative Value Learning for Average-reward Continuous State MDPs [J]. 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 956 - 964
- [10] NECESSARY CONDITIONS FOR THE OPTIMALITY EQUATION IN AVERAGE-REWARD MARKOV DECISION-PROCESSES [J]. APPLIED MATHEMATICS AND OPTIMIZATION, 1989, 19 (01): : 97 - 112