Optimal Incremental-containment Control of Two-order Swarm System Based on Reinforcement Learning

被引:0
|
作者
Chen, Haipeng [1 ]
Fu, Wenxing [1 ]
Liu, Junmin [2 ]
Yu, Dengxiu [3 ]
Chen, Kang [1 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian, Peoples R China
[2] Inner Mongolia North Heavy Ind Grp Co Ltd, Baotou, Peoples R China
[3] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Backstepping; Lypunov function; optimal incremental-containment control; reinforcement learning; swarm system; MULTIAGENT SYSTEMS;
D O I
10.1007/s12555-022-0710-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the optimal incremental-containment control of two-order swarm system based on reinforcement learning (RL) is proposed to avoid the dilemma that the number of agents in a swarm system is immutable, which is essential for a swarm system that cannot meet the containment demands and need more agents to expand the containment range. Notably, the number of agents in a swarm system with a traditional containment controller is immutable, which limits the containment range that the swarm system can achieve. Besides, in traditional optimal control theory, it is obtained by solving the Hamilton-Jacobi-Bellman (HJB) equation, which is difficult to solve due to the unknown nonlinearity. To overcome these problems, several contributions are made in this paper. Firstly, in order to overcome the dilemma that the number of agents in the swarm system is immutable, the incremental-containment control is proposed. Secondly, considering the error and control input as the optimization goal, the optimal cost function is introduced and the optimal incremental-containment control is proposed to reduce resource waste and increase hardware service life. Furthermore, based on the proposed optimal incremental-containment control, the controller is designed by a new RL based on the backstepping method. The Lyapunov function is used to prove the stability of controller. The simulation show the efficiency of the proposed controller.
引用
收藏
页码:3443 / 3455
页数:13
相关论文
共 50 条
  • [1] Optimal Incremental-containment Control of Two-order Swarm System Based on Reinforcement Learning
    Haipeng Chen
    Wenxing Fu
    Junmin Liu
    Dengxiu Yu
    Kang Chen
    [J]. International Journal of Control, Automation and Systems, 2023, 21 : 3443 - 3455
  • [2] Two-order cooperative optimization of swarm control based on reinforcement learning
    Yu, Dengxiu
    Qin, Zhenhao
    Chen, Kang
    Cheong, Kang Hao
    Chen, C. L. Philip
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (01): : 125 - 136
  • [3] Heterogeneous unmanned swarm formation containment control based on reinforcement learning
    Yang, Jiaxiu
    Zhang, Hongli
    Wang, Hao
    Li, Xinkai
    Wang, Cong
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150
  • [4] Deep reinforcement learning to control an unmanned swarm system
    Liang, Hongtao
    Wang, Yaonan
    Hua, Hean
    Zhong, Hang
    Zheng, Chenghong
    Zeng, Junhao
    Liang, Jiacheng
    Li, Zhengchen
    [J]. Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2024, 46 (09): : 1521 - 1534
  • [5] Scalable and cohesive swarm control based on reinforcement learning
    Blais, Marc-Andrė
    Akhloufi, Moulay A.
    [J]. Cognitive Robotics, 2024, 4 : 88 - 103
  • [6] Optimal Containment Control of a Quadrotor Team With Active Leaders via Reinforcement Learning
    Cheng, Ming
    Liu, Hao
    Gao, Qing
    Lu, Jinhu
    Xia, Xiaohua
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (08) : 4502 - 4512
  • [7] Integral Reinforcement-Learning-Based Optimal Containment Control for Partially Unknown Nonlinear Multiagent Systems
    Wu, Qiuye
    Wu, Yongheng
    Wang, Yonghua
    [J]. ENTROPY, 2023, 25 (02)
  • [8] A New Optimal Design of Stable Feedback Control of Two-Wheel System Based on Reinforcement Learning
    Yu, Zhenghong
    Zhu, Xuebin
    [J]. SAE International journal of Sustainable Transportation, Energy, Environment, and Policy, 2023, 5 (01):
  • [9] Based on the two-order oscillation particle swarm optimization algorithm of the regional logistics demand forecasting research
    Jun, Wang
    Qiang, Huang
    Li, Zhang
    [J]. 2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 875 - 880
  • [10] Optimal Control of Two-Dimensional Roesser Model: Solution Based on Reinforcement Learning
    Ye, Linwei
    Zhao, Zhonggai
    Liu, Fei
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (08) : 5424 - 5430