共 50 条
- [2] Reliable Off-Policy Evaluation for Reinforcement Learning [J]. OPERATIONS RESEARCH, 2024, 72 (02) : 699 - 716
- [3] Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [4] Research on Off-Policy Evaluation in Reinforcement Learning: A Survey [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1926 - 1945
- [5] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [6] Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [7] Off-policy evaluation for tabular reinforcement learning with synthetic trajectories [J]. Statistics and Computing, 2024, 34
- [8] Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [10] Doubly Robust Off-policy Value Evaluation for Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48