共 50 条
- [3] A Model of Neuronal Specialization Using Hebbian Policy-Gradient with "Slow" Noise [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT I, 2009, 5768 : 218 - 228
- [5] A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [7] Adaptive Playouts in Monte-Carlo Tree Search with Policy-Gradient Reinforcement Learning [J]. ADVANCES IN COMPUTER GAMES, ACG 2015, 2015, 9525 : 1 - 11
- [9] Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations [J]. PLOS ONE, 2011, 6 (05):