共 50 条
- [2] Characterizing the Gap Between Actor-Critic and Policy Gradient [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [3] Policy-Gradient Based Actor-Critic Algorithms [J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 505 - 509
- [5] Soft-Robust Actor-Critic Policy-Gradient [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 208 - 218
- [6] Actor-critic algorithm with incremental dual natural policy gradient [J]. 2017, Editorial Board of Journal on Communications (38):
- [7] Exploring Policy Diversity in Parallel Actor-Critic Learning [J]. 2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1196 - 1203
- [8] Generalized Offline Actor-Critic with Behavior Regularization [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (04): : 843 - 855
- [9] Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5658 - 5688
- [10] Efficient Model Learning Methods for Actor-Critic Control [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03): : 591 - 602