共 50 条
- [2] Policy-Gradient Based Actor-Critic Algorithms [J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 505 - 509
- [3] Soft-Robust Actor-Critic Policy-Gradient [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 208 - 218
- [4] Actor-critic algorithm with incremental dual natural policy gradient [J]. 2017, Editorial Board of Journal on Communications (38):
- [5] Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5658 - 5688
- [6] Noisy Importance Sampling Actor-Critic: An Off-Policy Actor-Critic With Experience Replay [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
- [9] Characterizing Motor Control of Mastication With Soft Actor-Critic [J]. FRONTIERS IN HUMAN NEUROSCIENCE, 2020, 14
- [10] Algorithms for Variance Reduction in a Policy-Gradient Based Actor-Critic Framework [J]. ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 130 - 136