共 50 条
- [1] Generalized Proximal Policy Optimization with Sample Reuse ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [3] Upper confident bound advantage function proximal policy optimization Cluster Computing, 2023, 26 : 2001 - 2010
- [4] Upper confident bound advantage function proximal policy optimization CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (03): : 2001 - 2010
- [5] Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
- [6] Autonomous Valet Parking with Asynchronous Advantage Actor-Critic Proximal Policy Optimization 2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 334 - 340
- [7] Competitive Advantage and Competition Policy in Developing Countries WORLD COMPETITION, 2009, 32 (02): : 275 - +
- [8] Proximal Policy Optimization With Policy Feedback IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07): : 4600 - 4610
- [9] Coordinated Proximal Policy Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [10] Truly Proximal Policy Optimization 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 113 - 122