共 50 条
- [11] Representation Balancing MDPs for Off-Policy Policy Evaluation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [12] Off-Policy Evaluation via the Regularized Lagrangian ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [13] Consistent On-Line Off-Policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [15] Offline RL Without Off-Policy Evaluation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [16] Learning Action Embeddings for Off-Policy Evaluation ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 108 - 122
- [18] A perspective on off-policy evaluation in reinforcement learning Frontiers of Computer Science, 2019, 13 : 911 - 912
- [20] Distributional Off-Policy Evaluation for Slate Recommendations THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 8265 - 8273