共 50 条
- [21] Provable Benefit of Multitask Representation Learning in Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [23] Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7667 - 7674
- [24] Scalable Multitask Policy Gradient Reinforcement Learning [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1847 - 1853
- [25] Evolutionary computation on multitask reinforcement learning problems [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING, AND CONTROL, VOLS 1 AND 2, 2007, : 685 - 688
- [26] Distributed Multitask Reinforcement Learning with Quadratic Convergence [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [27] Modular Multitask Reinforcement Learning with Policy Sketches [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [28] RAAM: The Benefits of Robustness in Approximating Aggregated MDPs in Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
- [29] Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [30] Reinforcement learning methods to handle actions with differing costs in MDPs [J]. KNOWLEDGE-BASED INTELLIGNET INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2003, 2774 : 553 - 560