共 50 条
- [31] Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 174, 2022, 174 : 397 - 410
- [32] Minimax Value Interval for Off-Policy Evaluation and Policy Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [33] Optimal and Adaptive Off-policy Evaluation in Contextual Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [34] Conformal Off-Policy Evaluation in Markov Decision Processes 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3087 - 3094
- [35] Balanced Off-Policy Evaluation in General Action Spaces INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [36] More Robust Doubly Robust Off-policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
- [37] Combining Parametric and Nonparametric Models for Off-Policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [38] Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [39] Accountable Off-Policy Evaluation With Kernel Bellman Statistics 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
- [40] Research on Off-Policy Evaluation in Reinforcement Learning: A Survey Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1926 - 1945