共 50 条
- [22] A Novel Q-learning Algorithm with Function Approximation for Constrained Markov Decision Processes 2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 400 - 405
- [23] Semi-Markov Offline Reinforcement Learning for Healthcare CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 174, 2022, 174 : 119 - 137
- [24] A NEW REINFORCEMENT LEARNING ALGORITHM WITH FIXED EXPLORATION FOR SEMI-MARKOV CONTROL IN PREVENTIVE MAINTENANCE PROCEEDINGS OF THE ASME 12TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE - 2017, VOL 3, 2017,
- [26] A sensitivity view of Markov decision processes and reinforcement learning MODELING, CONTROL AND OPTIMIZATION OF COMPLEX SYSTEMS: IN HONOR OF PROFESSOR YU-CHI HO, 2003, 14 : 261 - 283
- [27] A Sublinear-Regret Reinforcement Learning Algorithm on Constrained Markov Decision Processes with reset action ICMLSC 2020: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, 2020, : 51 - 55
- [28] Continuous-state reinforcement learning with fuzzy approximation ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS, 2008, 4865 : 27 - +
- [30] Multivariate Decision Tree Function Approximation for Reinforcement Learning NEURAL INFORMATION PROCESSING: THEORY AND ALGORITHMS, PT I, 2010, 6443 : 687 - 694