共 50 条
- [1] Optimal and Adaptive Off-policy Evaluation in Contextual Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [2] Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 733 - 741
- [3] Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [4] Off-Policy Risk Assessment in Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [5] Conformal Off-Policy Prediction in Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [6] Optimal Baseline Corrections for Off-Policy Contextual Bandits PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 722 - 732
- [7] Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [8] Off-Policy Evaluation via Off-Policy Classification ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [10] Off-policy Bandits with Deficient Support KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 965 - 975