共 50 条
- [33] Characterizing the Gap Between Actor-Critic and Policy Gradient INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [37] Variational value learning in advantage actor-critic reinforcement learning 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1955 - 1960
- [38] AN ACTOR-CRITIC REINFORCEMENT LEARNING ALGORITHM BASED ON ADAPTIVE RBF NETWORK PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 984 - 988