共 50 条
- [31] Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
- [35] A Sojourn-Based Approach to Semi-Markov Reinforcement Learning Journal of Scientific Computing, 2022, 92
- [37] Solving decentralized continuous Markov decision problems with structured reward KI 2007: Advances in Artificial Intelligence, Proceedings, 2007, 4667 : 337 - 351
- [38] BATCH POLICY LEARNING IN AVERAGE REWARD MARKOV DECISION PROCESSES ANNALS OF STATISTICS, 2022, 50 (06): : 3364 - 3387
- [39] Learning and Planning in Average-Reward Markov Decision Processes INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7665 - 7676
- [40] On mean reward variance in semi-Markov processes Mathematical Methods of Operations Research, 2005, 62 : 387 - 397