共 50 条
- [31] Variance reduction techniques for gradient estimates in reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1507 - 1514
- [32] MQGrad: Reinforcement Learning of Gradient Quantization in Parameter Server PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 83 - 90
- [33] Inverse Reinforcement Learning through Policy Gradient Minimization THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1993 - 1999
- [35] Policy gradient methods for reinforcement learning with function approximation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 1057 - 1063
- [36] Fuzzy Baselines to Stabilize Policy Gradient Reinforcement Learning EXPLAINABLE AI AND OTHER APPLICATIONS OF FUZZY TECHNIQUES, NAFIPS 2021, 2022, 258 : 436 - 446
- [39] Reinforcement learning for continuous action using stochastic gradient ascent INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 288 - 295
- [40] Meta-Gradient Reinforcement Learning with an Objective Discovered Online ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33