共 50 条
- [2] Policy-Gradient Based Actor-Critic Algorithms [J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 505 - 509
- [5] Algorithms for Variance Reduction in a Policy-Gradient Based Actor-Critic Framework [J]. ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 130 - 136
- [6] Characterizing the Gap Between Actor-Critic and Policy Gradient [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [7] Soft-Robust Actor-Critic Policy-Gradient [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 208 - 218
- [8] Actor-critic algorithm with incremental dual natural policy gradient [J]. 2017, Editorial Board of Journal on Communications (38):
- [9] Actor-critic algorithms [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 1008 - 1014
- [10] On actor-critic algorithms [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2003, 42 (04) : 1143 - 1166