共 50 条
- [22] Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [23] Research on Off-Policy Evaluation in Reinforcement Learning: A Survey Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1926 - 1945
- [25] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [26] Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3647 - 3655
- [27] Safe Off-policy Reinforcement Learning Using Barrier Functions 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2176 - 2181
- [28] Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [29] Rethinking Population-assisted Off-policy Reinforcement Learning PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 624 - 632