共 50 条
- [41] Quasi-Stochastic Approximation and Off-Policy Reinforcement Learning 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5244 - 5251
- [42] Distributed off-Policy Actor-Critic Reinforcement Learning with Policy Consensus 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4674 - 4679
- [43] Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
- [44] Model-free off-policy reinforcement learning in continuous environment 2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1091 - 1096
- [45] Research on Experience Replay of Off-policy Deep Reinforcement Learning: A Review Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (11): : 2237 - 2256
- [46] VALUE-AWARE IMPORTANCE WEIGHTING FOR OFF-POLICY REINFORCEMENT LEARNING CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 745 - 763
- [47] Re-attentive experience replay in off-policy reinforcement learning Machine Learning, 2024, 113 : 2327 - 2349
- [49] Reliability assessment of off-policy deep reinforcement learning: A benchmark for aerodynamics DATA-CENTRIC ENGINEERING, 2024, 5