共 50 条
- [1] Model-Based Offline Adaptive Policy Optimization with Episodic Memory [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 50 - 62
- [2] Model-Based Offline Reinforcement Learning with Uncertainty Estimation and Policy Constraint [J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 1 - 13
- [3] MOReL: Model-Based Offline Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [4] Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [5] MOPO: Model-based Offline Policy Optimization [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [6] Model-Based Reinforcement Learning via Proximal Policy Optimization [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 4736 - 4740
- [7] Offline Model-based Adaptable Policy Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [8] Model-Based Offline Reinforcement Learning with Local Misspecification [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7423 - 7431
- [9] Offline Reinforcement Learning with Reverse Model-based Imagination [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [10] Offline Model-Based Reinforcement Learning for Tokamak Control [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211