共 50 条
- [31] Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4933 - 4934
- [32] Balanced Off-Policy Evaluation in General Action Spaces INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [33] More Robust Doubly Robust Off-policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
- [34] Combining Parametric and Nonparametric Models for Off-Policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [35] Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [36] Accountable Off-Policy Evaluation With Kernel Bellman Statistics 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
- [37] Research on Off-Policy Evaluation in Reinforcement Learning: A Survey Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1926 - 1945
- [38] Accountable Off-Policy Evaluation With Kernel Bellman Statistics INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [39] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [40] Off-Policy Proximal Policy Optimization THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9162 - 9170