共 50 条
- [1] Deep Deterministic Policy Gradient With Classified Experience Replay Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (07): : 1816 - 1823
- [3] Asynchronous Methods for Multi-agent Deep Deterministic Policy Gradient NEURAL INFORMATION PROCESSING (ICONIP 2018), PT II, 2018, 11302 : 711 - 721
- [4] Multi-Agent Deep Deterministic Policy Gradient Algorithm Based on Classification Experience Replay 2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 988 - 992
- [5] Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 1255 - 1262
- [8] MP-TD3: Multi-Pool Prioritized Experience Replay-Based Asynchronous Twin Delayed Deep Deterministic Policy Gradient Algorithm IEEE ACCESS, 2024, 12 : 105268 - 105280
- [10] Policy Space Noise in Deep Deterministic Policy Gradient NEURAL INFORMATION PROCESSING (ICONIP 2018), PT II, 2018, 11302 : 624 - 634