共 50 条
- [1] Average-Reward Decentralized Markov Decision Processes [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1997 - 2002
- [2] Learning and Planning in Average-Reward Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7665 - 7676
- [4] Incremental Improvements of Heuristic Policies for Average-Reward Markov Decision Processes [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 1721 - 1728
- [5] NECESSARY CONDITIONS FOR THE OPTIMALITY EQUATION IN AVERAGE-REWARD MARKOV DECISION-PROCESSES [J]. APPLIED MATHEMATICS AND OPTIMIZATION, 1989, 19 (01): : 97 - 112
- [6] A Duality Approach for Regret Minimization in Average-Reward Ergodic Markov Decision Processes [J]. LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 862 - 883
- [7] Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [9] Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes [J]. THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195