共 50 条
- [1] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [2] Safe and efficient off-policy reinforcement learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [3] Bounds for Off-policy Prediction in Reinforcement Learning [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3991 - 3997
- [5] Off-Policy Reinforcement Learning with Delayed Rewards [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [7] A perspective on off-policy evaluation in reinforcement learning [J]. Frontiers of Computer Science, 2019, 13 : 911 - 912
- [8] Representations for Stable Off-Policy Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [9] On the Reuse Bias in Off-Policy Reinforcement Learning [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4513 - 4521
- [10] Reliable Off-Policy Evaluation for Reinforcement Learning [J]. OPERATIONS RESEARCH, 2024, 72 (02) : 699 - 716