共 50 条
- [1] Off-policy and on-policy reinforcement learning with the Tsetlin machine [J]. Applied Intelligence, 2023, 53 : 8596 - 8613
- [2] Tabu search exploration for on-policy reinforcement learning [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2910 - 2915
- [4] On-Policy Deep Reinforcement Learning for the Average-Reward Criterion [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [5] Offline Reinforcement Learning with On-Policy Q-Function Regularization [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 455 - 471
- [6] Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 118 - 125
- [7] Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [8] Fault-Tolerant Control of Degrading Systems with On-Policy Reinforcement Learning [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 13733 - 13738
- [9] Two novel on-policy reinforcement learning algorithms based on TD(λ)-methods [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 280 - +
- [10] Practical Critic Gradient based Actor Critic for On-Policy Reinforcement Learning [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211