Adaptive Optimal Output-Feedback Consensus Tracking Control of Nonlinear Multiagent Systems Using Two-Player Stackelberg Game

被引:0
|
作者
Yan, Lei [1 ,2 ]
Liu, Junhe [1 ]
Lai, Guanyu [1 ]
Chen, C. L. Philip [3 ]
Wu, Zongze [1 ]
Liu, Zhi [1 ]
机构
[1] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
[2] Nanyang Inst Technol, Sch Intelligent Mfg, Nanyang 473004, Henan, Peoples R China
[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Observers; Games; Decision making; Reinforcement learning; Optimal control; Numerical simulation; Multi-agent systems; Adaptive optimal consensus; integral reinforcement learning (IRL); output feedback; stackelberg game; CONTROL DESIGN; INPUT; PERFORMANCE;
D O I
10.1109/TSMC.2024.3404147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates the adaptive optimal output-feedback consensus tracking problem for nonlinear multiagent systems (MASs). Although adaptive optimal output-feedback control schemes for nonlinear systems have been developed recently, most results do not consider the two-way interaction between the state observer and its associated subsystem. To address this issue, we formulate the state-observer and the subsystem as a two-player Stackelberg game framework, where the state-observer acts as the follower-player and the subsystem acts as the leader-player. Such a framework helps us to reveal the two-way interaction between the subobserver and the subsystem. Based on this, we design the optimal auxiliary input of the state-observer and the optimal subsystem controller. We implement the optimal policy pair using integral reinforcement learning (IRL) and adaptive critic learning, which provides a critic-only structure. We prove that the Stackelberg-Nash equilibrium is reached and that the closed-loop signals are ultimately uniformly bounded (UUB). We demonstrate the effectiveness of the proposed scheme using a numerical simulation example.
引用
收藏
页码:5377 / 5387
页数:11
相关论文
共 50 条
  • [1] Adaptive Fuzzy Output-Feedback Consensus Tracking Control of Nonlinear Multiagent Systems in Prescribed Performance
    Zhang, Lili
    Che, Wei-Wei
    Chen, Bing
    Lin, Chong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1932 - 1943
  • [2] Two-player Ad hoc Output-feedback Cumulant Game Control
    Aduba, Chukwuemeka
    Won, Chang-Hee
    [J]. ICIMCO 2015 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL. 1, 2015, : 53 - 59
  • [3] Optimal Control of Two-Player Systems With Output Feedback
    Lessard, Laurent
    Lall, Sanjay
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (08) : 2129 - 2144
  • [4] Adaptive output-feedback tracking control for uncertain nonlinear systems
    Ruan, RY
    Pan, RL
    Yang, CL
    [J]. 2004 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1-3, 2004, : 24 - 29
  • [5] Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games
    Zhao, Jun
    Lv, Yongfeng
    Zhao, Ziliang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1437 - 1441
  • [6] Practical Predefined-Time Output-Feedback Consensus Tracking Control for Multiagent Systems
    Zou, An-Min
    Liu, Yangyang
    Hou, Zeng-Guang
    Hu, Zhipei
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (08) : 5311 - 5322
  • [7] Adaptive Output-Feedback Fuzzy Tracking Control for a Class of Nonlinear Systems
    Zhou, Qi
    Shi, Peng
    Lu, Jinjun
    Xu, Shengyuan
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2011, 19 (05) : 972 - 982
  • [8] Adaptive output-feedback tracking of stochastic nonlinear systems
    Ji, HB
    Xi, HS
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (02) : 355 - 360
  • [9] Adaptive output feedback optimal tracking control for nonlinear systems
    [J]. Sun, Xiuxia, 1600, Xi'an Jiaotong University (51):
  • [10] Distributed adaptive consensus control of nonlinear output-feedback systems on directed graphs
    Ding, Zhengtao
    Li, Zhongkui
    [J]. AUTOMATICA, 2016, 72 : 46 - 52