共 50 条
- [2] Policy gradient fuzzy reinforcement learning PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 992 - 995
- [5] A modification of gradient policy in reinforcement learning procedure 2012 15TH INTERNATIONAL CONFERENCE ON INTERACTIVE COLLABORATIVE LEARNING (ICL), 2012,
- [6] Adaptive Natural Policy Gradient in Reinforcement Learning PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 605 - 610
- [7] Policy Gradient Method For Robust Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [8] Reinforcement Learning to Rank with Pairwise Policy Gradient PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 509 - 518
- [9] Scalable Multitask Policy Gradient Reinforcement Learning THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1847 - 1853
- [10] A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139