共 50 条
- [2] Variance Penalized On-Policy and Off-Policy Actor-Critic THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7899 - 7907
- [4] Actor-Critic based Improper Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [6] Selector-Actor-Critic and Tuner-Actor-Critic Algorithms for Reinforcement Learning 2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
- [7] Policy-Gradient Based Actor-Critic Algorithms PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 505 - 509
- [8] Actor-Critic reinforcement learning based on prior knowledge Yang, Zhenyu, 1600, Transport and Telecommunication Institute, Lomonosova street 1, Riga, LV-1019, Latvia (18):
- [9] A World Model for Actor–Critic in Reinforcement Learning Pattern Recognition and Image Analysis, 2023, 33 : 467 - 477
- [10] Distributed off-Policy Actor-Critic Reinforcement Learning with Policy Consensus 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4674 - 4679