共 50 条
- [1] Optimal Baseline Corrections for Off-Policy Contextual Bandits PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 722 - 732
- [2] Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 2125 - 2135
- [3] Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 733 - 741
- [4] Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [5] Off-Policy Risk Assessment in Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [6] Conformal Off-Policy Prediction in Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [7] Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [9] Off-policy Bandits with Deficient Support KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 965 - 975