Optimal Self-Learning Cooperative Control for Continuous-Time Heterogeneous Multi-Agent Systems

被引:0
|
作者
Wei Qinglai [1 ]
Liu Derong [1 ]
Song Ruizhuo [2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
关键词
Adaptive Critic Designs; Adaptive Dynamic Programming; Approximate Dynamic Programming; Heterogeneous Multi-Agents; Graphical Games; Policy Iteration; Synchronization; DYNAMIC-PROGRAMMING ALGORITHM; OPTIMAL TRACKING CONTROL; NONLINEAR-SYSTEMS; CONTROL SCHEME; REINFORCEMENT; GAMES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an optimal self-learning cooperative control for heterogeneous multi-agent systems by iterative adaptive dynamic programming (ADP) is developed. The main idea is to design an optimal control law by policy iteration based ADP technique which makes all the agents track a given dynamics and simultaneously makes the iterative performance index function reach the Nash equilibrium. The cooperative policy iteration algorithm for graphical differential games is developed to achieve the optimal control law for the agent of each node. Convergence properties are analyzed which make the performance index functions of heterogeneous multi-agent differential graphical games converge to the Nash equilibrium. Simulation example is given to show the effectiveness of the developed optimal self-learning control scheme.
引用
收藏
页码:3005 / 3010
页数:6
相关论文
共 50 条
  • [41] Stochastic Consensus of a Class of Continuous-time Multi-agent Systems with a Leading Agent
    Zhu, Qiuguo
    Wu, Jun
    Xiong, Rong
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2016, 13
  • [42] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
    LI Ye
    LIU ZhongXin
    LAN Ge
    SADER Malika
    CHEN ZengQiang
    Science China Technological Sciences, 2023, (08) : 2441 - 2453
  • [43] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
    LI Ye
    LIU ZhongXin
    LAN Ge
    SADER Malika
    CHEN ZengQiang
    Science China(Technological Sciences), 2023, 66 (08) : 2441 - 2453
  • [44] An optimal iterative learning control for continuous-time systems
    Nasiri, Mohammad Reza
    IECON 2006 - 32ND ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS, VOLS 1-11, 2006, : 675 - 680
  • [45] Self-learning Governance of Black-Box Multi-Agent Systems
    Oesterle, Michael
    Bartelt, Christian
    Luedtke, Stefan
    Stuckenschmidt, Heiner
    COORDINATION, ORGANIZATIONS, INSTITUTIONS, NORMS, AND ETHICS FOR GOVERNANCE OF MULTI-AGENT SYSTEMS XV, 2022, 13549 : 73 - 91
  • [46] Cooperative optimal preview tracking control of discrete-time multi-agent systems
    Lu Y.-R.
    Liao F.-C.
    Ren J.-M.
    Fu H.-L.
    Sheng C.-Y.
    Liao, Fu-Cheng (fcliao@ustb.edu.cn), 2018, Science Press (40): : 241 - 251
  • [47] Collaborative Optimal Formation Control for Heterogeneous Multi-Agent Systems
    Li, Yandong
    Liu, Meichen
    Lian, Jiya
    Guo, Yuan
    ENTROPY, 2022, 24 (10)
  • [48] Optimal robust formation control for heterogeneous multi-agent systems based on reinforcement learning
    Yan, Bing
    Shi, Peng
    Lim, Cheng-Chew
    Shi, Zhiyuan
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) : 2683 - 2704
  • [49] Optimal containment control of continuous-time multi-agent systems with unknown disturbances using data-driven approach
    Zhinan PENG
    Jiefu ZHANG
    Jiangping HU
    Rui HUANG
    Bijoy Kumar GHOSH
    Science China(Information Sciences), 2020, 63 (10) : 270 - 272
  • [50] Optimal containment control of continuous-time multi-agent systems with unknown disturbances using data-driven approach
    Peng, Zhinan
    Zhang, Jiefu
    Hu, Jiangping
    Huang, Rui
    Ghosh, Bijoy Kumar
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (10)