共 50 条
- [41] Off-Policy Deep Reinforcement Learning without Exploration INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [43] Research on Off-Policy Evaluation in Reinforcement Learning: A Survey Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1926 - 1945
- [44] Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [45] Hyperparameter Tuning of an Off-Policy Reinforcement Learning Algorithm for H∞ Tracking Control LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
- [48] Off-policy synchronous iteration IRL method for multi-player zero-sum games with input constraints Neurocomputing, 2021, 378 : 413 - 421
- [49] Distributed off-Policy Actor-Critic Reinforcement Learning with Policy Consensus 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4674 - 4679