共 50 条
- [41] On the Design of Estimators for Bandit Off-Policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [42] Data Poisoning Attacks on Off-Policy Policy Evaluation Methods UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 1264 - 1274
- [43] Off-Policy Evaluation with Policy-Dependent Optimization Response ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [46] Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 174, 2022, 174 : 397 - 410
- [47] Minimax Value Interval for Off-Policy Evaluation and Policy Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [48] Conformal Off-Policy Evaluation in Markov Decision Processes 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3087 - 3094
- [49] Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 538 - 546
- [50] Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4933 - 4934