共 50 条
- [31] Evolution-Guided Policy Gradient in Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [32] Policy Gradient using Weak Derivatives for Reinforcement Learning 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5531 - 5537
- [33] Total stochastic gradient algorithms and applications in reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [35] Variance reduction techniques for gradient estimates in reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1507 - 1514
- [36] MQGrad: Reinforcement Learning of Gradient Quantization in Parameter Server PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 83 - 90
- [37] Inverse Reinforcement Learning through Policy Gradient Minimization THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1993 - 1999
- [39] Policy gradient methods for reinforcement learning with function approximation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 1057 - 1063
- [40] Fuzzy Baselines to Stabilize Policy Gradient Reinforcement Learning EXPLAINABLE AI AND OTHER APPLICATIONS OF FUZZY TECHNIQUES, NAFIPS 2021, 2022, 258 : 436 - 446