共 50 条
- [21] Safe Off-policy Reinforcement Learning Using Barrier Functions 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2176 - 2181
- [22] Representations for Stable Off-Policy Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [24] On the Reuse Bias in Off-Policy Reinforcement Learning PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4513 - 4521
- [25] A perspective on off-policy evaluation in reinforcement learning Frontiers of Computer Science, 2019, 13 : 911 - 912
- [26] Off-Policy Differentiable Logic Reinforcement Learning MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 617 - 632
- [27] Marginalized Operators for Off-policy Reinforcement Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 655 - 679
- [28] Off-Policy Shaping Ensembles in Reinforcement Learning 21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 1021 - 1022
- [30] Sequential Search with Off-Policy Reinforcement Learning PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4006 - 4015