共 50 条
- [1] Skill Reward for Safe Deep Reinforcement Learning [J]. UBIQUITOUS SECURITY, 2022, 1557 : 203 - 213
- [2] Reward Redistribution for Reinforcement Learning of Dynamic Nonprehensile Manipulation [J]. 2021 7TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2021, : 326 - 331
- [5] Combined Optimization and Reinforcement Learning for Manipulation Skills [J]. ROBOTICS: SCIENCE AND SYSTEMS XII, 2016,
- [6] Reward of Reinforcement Learning of Test Optimization for Continuous Integration [J]. Ruan Jian Xue Bao/Journal of Software, 2019, 30 (05): : 1438 - 1449
- [7] Convergent Policy Optimization for Safe Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [8] Reducing Safety Interventions in Provably Safe Reinforcement Learning [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7515 - 7522
- [9] Full Gradient Deep Reinforcement Learning for Average-Reward Criterion [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211