共 50 条
- [41] Memory Reduction through Experience Classification for Deep Reinforcement Learning with Prioritized Experience Replay PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2019), 2019, : 166 - 171
- [42] Parallelized Synchronous Multi-Agent Deep Reinforcement Learning with Experience Replay Memory 2019 13TH IEEE INTERNATIONAL CONFERENCE ON SERVICE-ORIENTED SYSTEM ENGINEERING (SOSE) / 10TH INTERNATIONAL WORKSHOP ON JOINT CLOUD COMPUTING (JCC) / IEEE INTERNATIONAL WORKSHOP ON CLOUD COMPUTING IN ROBOTIC SYSTEMS (CCRS), 2019, : 325 - 330
- [43] Adaptable Conservative Q-Learning for Offline Reinforcement Learning PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 200 - 212
- [46] Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [48] Adaptive Policy Learning for Offline-to-Online Reinforcement Learning THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11372 - 11380
- [49] Mildly Conservative Q-Learning for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,