共 50 条
- [1] Marginalized Operators for Off-policy Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 655 - 679
- [2] Subgaussian and Differentiable Importance Sampling for Off-Policy Evaluation and Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
- [3] Conditional Importance Sampling for Off-Policy Learning [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 45 - 54
- [7] A perspective on off-policy evaluation in reinforcement learning [J]. Frontiers of Computer Science, 2019, 13 : 911 - 912
- [8] Reliable Off-Policy Evaluation for Reinforcement Learning [J]. OPERATIONS RESEARCH, 2024, 72 (02) : 699 - 716
- [9] Research on Off-Policy Evaluation in Reinforcement Learning: A Survey [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1926 - 1945
- [10] Marginalized Importance Sampling for Off-Environment Policy Evaluation [J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229