共 50 条
- [1] Policy Gradient and Actor–Critic Learning in Continuous Time and Space: Theory and Algorithms [J]. Journal of Machine Learning Research, 2022, 23
- [3] Policy-Gradient Based Actor-Critic Algorithms [J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 505 - 509
- [4] Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5658 - 5688
- [5] Actor-critic algorithms [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 1008 - 1014
- [6] On actor-critic algorithms [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2003, 42 (04) : 1143 - 1166
- [7] Algorithms for Variance Reduction in a Policy-Gradient Based Actor-Critic Framework [J]. ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 130 - 136
- [8] Characterizing the Gap Between Actor-Critic and Policy Gradient [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [9] A Robust Approach for Continuous Interactive Actor-Critic Algorithms [J]. IEEE ACCESS, 2021, 9 : 104242 - 104260