共 50 条
- [33] Achieving Mean-Variance Efficiency by Continuous-Time Reinforcement Learning Proceedings of the 3rd ACM International Conference on AI in Finance, ICAIF 2022, 2022, : 377 - 385
- [34] Achieving Mean-Variance Efficiency by Continuous-Time Reinforcement Learning 3RD ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2022, 2022, : 377 - 385
- [35] Efficient Exploration in Continuous-time Model-based Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [36] Fault-Tolerant Control of Degrading Systems with On-Policy Reinforcement Learning IFAC PAPERSONLINE, 2020, 53 (02): : 13733 - 13738
- [37] Competitive reinforcement learning in continuous control tasks PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1909 - 1914
- [39] Two novel on-policy reinforcement learning algorithms based on TD(λ)-methods 2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 280 - +
- [40] Practical Critic Gradient based Actor Critic for On-Policy Reinforcement Learning LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211