共 50 条
- [41] Debiased Off-Policy Evaluation for Recommendation Systems 15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 372 - 379
- [42] Off-Policy Evaluation in Partially Observable Environments THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10276 - 10283
- [43] On the Design of Estimators for Bandit Off-Policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [44] Off-Policy Evaluation with Policy-Dependent Optimization Response ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [45] Bounded Off-Policy Evaluation with Missing Data for Course Recommendation and Curriculum Design INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [46] Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [47] Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 174, 2022, 174 : 397 - 410
- [48] Minimax Value Interval for Off-Policy Evaluation and Policy Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [49] Interpretable Off-Policy Learning via Hyperbox Search INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [50] Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,