共 50 条
- [1] MOPO: Model-based Offline Policy Optimization [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [2] Model-based Policy Optimization with Unsupervised Model Adaptation [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [3] Parallel-mentoring for Offline Model-based Optimization [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [4] COMBO: Conservative Offline Model-Based Policy Optimization [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [6] Model-Based Offline Adaptive Policy Optimization with Episodic Memory [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 50 - 62
- [7] Conservative Objective Models for Effective Offline Model-Based Optimization [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7368 - 7378
- [8] ROMO: Retrieval-enhanced Offline Model-based Optimization [J]. 2023 5TH INTERNATIONAL CONFERENCE ON DISTRIBUTED ARTIFICIAL INTELLIGENCE, DAI 2023, 2023,
- [9] Model-Based Offline Policy Optimization with Distribution Correcting Regularization [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 174 - 189
- [10] Bidirectional Learning for Offline Infinite-width Model-based Optimization [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,