Safety reinforcement learning control via transfer learning

被引:1
|
作者
Zhang, Quanqi [1 ]
Wu, Chengwei [1 ]
Tian, Haoyu [1 ]
Gao, Yabin [1 ]
Yao, Weiran [1 ]
Wu, Ligang [1 ]
机构
[1] Harbin Inst Technol, Dept Control Sci & Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning control; Safety; Stability; Transfer learning; LYAPUNOV FUNCTIONS;
D O I
10.1016/j.automatica.2024.111714
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) has emerged as a promising approach for modern control systems. However, its success in real-world applications has been limited due to the lack of safety guarantees. To address this issue, the authors present a novel transfer learning framework that facilitates policy training in a non-dangerous environment, followed by transfer of the trained policy to the original dangerous environment. The transferred policy is theoretically proven to stabilize the original system while maintaining safety. Additionally, we propose an uncertainty learning algorithm incorporated in RL that overcomes natural data cascading and data evolution problems in RL to enhance learning accuracy. The transfer learning framework avoids trial-and-error in unsafe environments, ensuring not only after-learning safety but, more importantly, addressing the challenging problem of safe exploration during learning. Simulation results demonstrate the promise of the transfer learning framework for RL safety control on the task of vehicle lateral stability control with safety constraints. (c) 2024 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Joint Space Control via Deep Reinforcement Learning
    Kumar, Visak
    Hoeller, David
    Sundaralingam, Balakumar
    Tremblay, Jonathan
    Birchfield, Stan
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3619 - 3626
  • [22] Online reinforcement learning control via discontinuous gradient
    Arellano-Muro, Carlos A.
    Castillo-Toledo, Bernardino
    Di Gennaro, Stefano
    Loukianov, Alexander G.
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024, 38 (05) : 1762 - 1776
  • [23] Motion control for laser machining via reinforcement learning
    Xie, Yunhui
    Praeger, Matthew
    Grant-Jacob, James A.
    Eason, Robert W.
    Mills, Ben
    OPTICS EXPRESS, 2022, 30 (12) : 20963 - 20979
  • [24] Safe HVAC Control via Batch Reinforcement Learning
    Liu, Hsin-Yu
    Balaji, Bharathan
    Gao, Sicun
    Gupta, Rajesh
    Hong, Dezhi
    2022 13TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2022), 2022, : 181 - 192
  • [25] Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
    Gamrian, Shani
    Goldberg, Yoav
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [26] Model-based Reinforcement Learning with Provable Safety Guarantees via Control Barrier Functions
    Zhang, Hongchao
    Li, Zhouchi
    Clark, Andrew
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 792 - 798
  • [27] Lateral Transfer Learning for Multiagent Reinforcement Learning
    Shi, Haobin
    Li, Jingchen
    Mao, Jiahui
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
  • [28] Transfer Learning in Deep Reinforcement Learning: A Survey
    Zhu, Zhuangdi
    Lin, Kaixiang
    Jain, Anil K.
    Zhou, Jiayu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13344 - 13362
  • [29] Visual Transfer for Reinforcement Learning via Wasserstein Domain Confusion
    Roy, Josh
    Konidaris, George
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9454 - 9462
  • [30] Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
    Yang, Tianpei
    Hao, Jianye
    Meng, Zhaopeng
    Zhang, Zongzhang
    Hu, Yujing
    Chen, Yingfeng
    Fan, Changjie
    Wang, Weixun
    Liu, Wulong
    Wang, Zhaodong
    Peng, Jiajie
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3094 - 3100