共 50 条
- [1] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [2] Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [3] Representations for Stable Off-Policy Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [4] Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [5] Representations for Stable Off-Policy Reinforcement Learning [J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
- [6] Safe and efficient off-policy reinforcement learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [8] A perspective on off-policy evaluation in reinforcement learning [J]. Frontiers of Computer Science, 2019, 13 : 911 - 912
- [9] Reliable Off-Policy Evaluation for Reinforcement Learning [J]. OPERATIONS RESEARCH, 2024, 72 (02) : 699 - 716