共 50 条
- [1] Offline Reinforcement Learning via Policy Regularization and Ensemble Q-Functions [J]. 2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1167 - 1174
- [5] ORAD: a new framework of offline Reinforcement Learning with Q-value regularization [J]. Evolutionary Intelligence, 2024, 17 : 339 - 347
- [6] Off-policy and on-policy reinforcement learning with the Tsetlin machine [J]. Applied Intelligence, 2023, 53 : 8596 - 8613
- [7] Tabu search exploration for on-policy reinforcement learning [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2910 - 2915
- [10] Supported Value Regularization for Offline Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,