Data-driven adaptive consensus control for heterogeneous nonlinear Multi online reinforcement

被引:0
|
作者
Ji, Xiaoqiang [1 ,2 ]
Zhang, Xicheng
Zhu, Shaoqing [1 ,2 ]
Deng, Fuqin [3 ]
Zhu, Bin [4 ]
机构
[1] Chinese Univ Hong Kong, Sch Sci & Engn, Shenzhen, Peoples R China
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Peoples R China
[3] Wuyi Univ, Sch Intelligent Mfg, Jiangmen, Peoples R China
[4] Tsinghua Univ, Dept Automat, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
Multi-agent system; Data-driven control; Actor-Critic network; Dynamic linearization; PREDICTIVE CONTROL; SYSTEMS; TRACKING; DESIGN;
D O I
10.1016/j.neucom.2024.127818
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a distributed control algorithm based on a data -driven approach is developed to solve the consensus control of heterogeneous nonlinear Multi -Agent Systems (MAS). The consensus obtained from the solution of the Hamilton-Jacobi-Bellman (HJB) equation is challenging for unknown nonlinear systems. address this issue, improved online reinforcement learning (RL) is employed to generate an approximate solution for each agent to achieve consensus. Unlike model -based RL and traditional algorithms, this method leverages I/O data to guide the learning of policies without any prior knowledge of agent dynamics. Furthermore, the adaptability of algorithm to heterogeneous nonlinear agents is enhanced by implementing online updates to the control strategy and dynamic linearization (DL). The convergence analysis of the algorithm is provided, along with the impact of the learning rate parameters on the consensus of MAS. comparing with other data -driven methods, simulations are conducted to verify the stability and adaptability of the proposed algorithm.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Data-Driven Adaptive Iterative Learning Bipartite Consensus for Heterogeneous Nonlinear Cooperation-Antagonism Networks
    Hui, Yu
    Chi, Ronghu
    Huang, Biao
    Hou, Zhongsheng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8262 - 8270
  • [2] Online adaptive data-driven control for unknown nonlinear systems with constrained-input
    Xi'an University of Architecture and Technology, Xi'an, China
    [J]. Int. Conf. Cyber-Energy Syst. Intell. Energy, ICCSIE, 1600,
  • [3] A Data-Driven Distributed Adaptive Control Approach for Nonlinear Multi-Agent Systems
    Yu, Xian
    Jin, Shangtai
    Liu, Genfeng
    Lei, Ting
    Ren, Ye
    [J]. IEEE ACCESS, 2020, 8 : 207884 - 207893
  • [4] Data-driven control of nonlinear systems: An online sequential approach
    Vu, Minh
    Huang, Yunshen
    Zeng, Shen
    [J]. Systems and Control Letters, 2024, 193
  • [5] Improved adaptive data-driven control for discrete nonlinear systems
    Dong, Na
    Wu, Ai-Guo
    Chen, Zeng-Qiang
    [J]. Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2013, 30 (10): : 1309 - 1314
  • [6] Data-Driven Nonlinear Adaptive Optimal Control of Connected Vehicles
    Gao, Weinan
    Jiang, Zhong-Ping
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT VI, 2017, 10639 : 122 - 129
  • [7] Online data-driven composite adaptive backstepping control with exact differentiators
    Pan, Yongping
    Liu, Yiqi
    Yu, Haoyong
    [J]. INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2016, 30 (05) : 779 - 789
  • [8] Underactuated MIMO Airship Control Based on Online Data-Driven Reinforcement Learning
    Boase, Derek
    Gueaieb, Wail
    Miah, Md Suruz
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9464 - 9471
  • [9] Online Data-Driven Control of Nonlinear Systems Using Semidefinite Programming
    Bozza, Augusto
    Martin, Tim
    Cavone, Graziana
    Carli, Raffaele
    Dotoli, Mariagrazia
    Allgower, Frank
    [J]. IEEE Control Systems Letters, 2024, 8 : 3189 - 3194
  • [10] Data-driven bipartite consensus control for multi-agent systems with data quantization
    Zhao H.-R.
    Peng L.
    Yu H.-N.
    Shen Y.-H.
    [J]. Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (02): : 336 - 342