共 50 条
- [42] Mutual Information Regularized Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [43] Revisiting the Minimalist Approach to Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [44] Bellman Residual Orthogonalization for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [46] Supported Value Regularization for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [47] Supported Policy Optimization for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [48] Offline Reinforcement Learning for Automated Stock Trading IEEE ACCESS, 2023, 11 : 112577 - 112589
- [49] On the Role of Discount Factor in Offline Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [50] Offline Evaluation of Online Reinforcement Learning Algorithms THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1926 - 1933