Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm

被引:0
|
作者
Li, Ning [1 ,3 ,4 ]
Tang, Jichuan [1 ,2 ]
Li, Zhong-Xian [1 ,3 ,4 ]
Gao, Xiuyu [5 ]
机构
[1] Tianjin Univ, Sch Civil Engn, Tianjin 300350, Peoples R China
[2] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore, Singapore
[3] Tianjin Univ, Minist Educ, Key Lab Coast Civil Struct Safety, Tianjin, Peoples R China
[4] Tianjin Univ, Key Lab Earthquake Engn Simulat & Seism Resilienc, China Earthquake Adm, Tianjin, Peoples R China
[5] MTS Corp, Eden Prairie, MN USA
来源
基金
国家重点研发计划;
关键词
deep deterministic policy gradient algorithm; hybrid control; real-time hybrid simulation; reinforcement learning; underwater shaking table; ACTUATOR CONTROL; COMPENSATION; PERFORMANCE;
D O I
10.1002/stc.3035
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The tracking performance of an actuation transfer system in a real-time hybrid simulation (RTHS) frequently faces accuracy and robustness challenges under constraints and complicated environments with uncertainties. This study proposes a novel control approach based on the deep deterministic policy gradient algorithm in reinforcement learning (RL) combined with feedforward (FF) compensation, which emphasizes the implementation of shaking table control and substructure RTHS. The proposed method first describes the control plant within the RL environment. Then, the agent is trained offline to develop optimized control policies for interaction with the environment. A series of validation tests were conducted to assess the performance of the proposed method, starting with the dynamic testing of underwater shaking table control and then a virtual RTHS benchmark problem. For complex systems, such as controlling the underwater shaking table, the proposed algorithm, FF, and adaptive time series (ATS) compensation methods are compared under various water depths and motions. The results show better performance and wider broadband frequency applicability under different shaking table dynamic-coupling effects. Next, a controller based on the proposed method was designed by extending the virtual RTHS via the configuration of the control plant and substructure division, as provided in the RTHS benchmark problem. The proposed RL controller also improved the tracking accuracy and robustness of conventional FF compensators against unmodeled dynamics and perturbation uncertainties. This controller can be extended to further advanced control strategies as a component of model-based control methods.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
    Li, Ning
    Tang, Jichuan
    Li, Zhong-Xian
    Gao, Xiuyu
    [J]. Structural Control and Health Monitoring, 2022, 29 (10)
  • [2] Reinforcement Learning Control with Deep Deterministic Policy Gradient Algorithm for Multivariable pH Process
    Panjapornpon, Chanin
    Chinchalongporn, Patcharapol
    Bardeeniz, Santi
    Makkayatorn, Ratthanita
    Wongpunnawat, Witchaya
    [J]. PROCESSES, 2022, 10 (12)
  • [3] Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm
    Wu, Junta
    Li, Huiyun
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [4] Real-Time Visual Object Tracking Based on Reinforcement Learning with Twin Delayed Deep Deterministic Algorithm
    Zheng, Shengjie
    Wang, Huan
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 165 - 177
  • [5] Robot grasping method optimization using improved deep deterministic policy gradient algorithm of deep reinforcement learning
    Zhang, Hongxu
    Wang, Fei
    Wang, Jianhui
    Cui, Ben
    [J]. REVIEW OF SCIENTIFIC INSTRUMENTS, 2021, 92 (02):
  • [6] A Self-Adaptive Vibration Reduction Method Based on Deep Deterministic Policy Gradient (DDPG) Reinforcement Learning Algorithm
    Jin, Xin
    Ma, Hongbao
    Kang, Yihua
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [7] Real-time planning and collision avoidance control method based on deep reinforcement learning
    Xu, Xinli
    Cai, Peng
    Cao, Yunlong
    Chu, Zhenzhong
    Zhu, Wenbo
    Zhang, Weidong
    [J]. OCEAN ENGINEERING, 2023, 281
  • [8] Real-time energy management of a smart home based on deep deterministic policy gradient
    Zenginis, Ioannis
    Vardakas, John
    Ramantas, Kostas
    Verikoukis, Christos
    [J]. 2021 21ST IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2021 5TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC/I&CPS EUROPE), 2021,
  • [9] Deterministic Policy Gradient based Reinforcement Learning for Current Control of Hybrid Active Power Filter
    Gong, Cheng
    Leong, Chio-Hong
    Lam, Chi-Seng
    [J]. 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [10] A Method of Attitude Control Based on Deep Deterministic Policy Gradient
    Zhang, Jian
    Wu, Fengge
    Zhao, Junsuo
    Xu, Fanjiang
    [J]. COGNITIVE SYSTEMS AND SIGNAL PROCESSING, PT II, 2019, 1006 : 197 - 207