共 50 条
- [22] Flexible Data Augmentation in Off-Policy Reinforcement Learning ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2021), PT I, 2021, 12854 : 224 - 235
- [23] Off-Policy Deep Reinforcement Learning without Exploration INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [25] Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [26] Online Off-Policy Reinforcement Learning for Optimal Control of Unknown Nonlinear Systems Using Neural Networks IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (08): : 5112 - 5122
- [28] Research on Off-Policy Evaluation in Reinforcement Learning: A Survey Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1926 - 1945
- [29] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [30] Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3647 - 3655