共 50 条
- [1] Implicit and Explicit Policy Constraints for Offline Reinforcement Learning [J]. CAUSAL LEARNING AND REASONING, VOL 236, 2024, 236 : 499 - 513
- [2] Supported Policy Optimization for Offline Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [3] Constrained Offline Policy Optimization [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [4] Constrained Variational Policy Optimization for Safe Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [6] Learning Behavior of Offline Reinforcement Learning Agents [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
- [9] Density Constrained Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [10] BiES: Adaptive Policy Optimization for Model-Based Offline Reinforcement Learning [J]. AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13151 : 570 - 581