共 50 条
- [41] Quasi-Stochastic Approximation and Off-Policy Reinforcement Learning [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5244 - 5251
- [42] Distributed off-Policy Actor-Critic Reinforcement Learning with Policy Consensus [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4674 - 4679
- [43] Research on Experience Replay of Off-policy Deep Reinforcement Learning: A Review [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (11): : 2237 - 2256
- [44] Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
- [45] Model-free off-policy reinforcement learning in continuous environment [J]. 2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1091 - 1096
- [46] Re-attentive experience replay in off-policy reinforcement learning [J]. Machine Learning, 2024, 113 : 2327 - 2349
- [47] VALUE-AWARE IMPORTANCE WEIGHTING FOR OFF-POLICY REINFORCEMENT LEARNING [J]. CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 745 - 763
- [48] Re-attentive experience replay in off-policy reinforcement learning [J]. MACHINE LEARNING, 2024, 113 (05) : 2327 - 2349
- [49] Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions [J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
- [50] Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32