共 50 条
- [1] Offline Reinforcement Learning with On-Policy Q-Function Regularization MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 455 - 471
- [3] Offline Reinforcement Learning as Anti-exploration THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8106 - 8114
- [4] Learning Optimal Q-Function Using Deep Boltzmann Machine for Reliable Trading of Cryptocurrency INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 468 - 480
- [5] Adaptable Conservative Q-Learning for Offline Reinforcement Learning PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 200 - 212
- [6] Mildly Conservative Q-Learning for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [9] Learning Q-Function Approximations for Hybrid Control Problems IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 1364 - 1369
- [10] Reinforcement Learning with an Ensemble of Binary Action Deep Q-Networks Computer Systems Science and Engineering, 2023, 46 (03): : 2651 - 2666