Autonomous Cooperative Hunting with Rule-Based and Self-Learning Control for Multiagent Systems

被引：0

作者：

Luo, Jiaxiang ^{[1
,2
]}

Xu, Bozhe ^{[1
]}

Li, Xiangyang ^{[1
,3
]}

Yao, Zhannan ^{[1
]}

机构：

[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China

[2] Minist Educ, Engn Ctr Precis Elect Mfg Equipment, Guangzhou, Peoples R China

[3] Minist Educ, Key Lab Autonomous Syst & Networked Control, Guangzhou, Peoples R China

来源：

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS | 2024年 / 110卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Multiagent system; Cooperative control; Reinforcement learning; Imitation learning; Collision avoidance; GROUP-SIZE; PURSUIT; SUCCESS;

D O I：

10.1007/s10846-024-02177-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper considers the problem of autonomous cooperative hunting in an unknown dynamic environment, where a group of mobile agents collaborate to capture a moving target. Due to the decentralized decision-making nature of multi-agent systems and the presence of real-world constraints, it is a challenging task. To solve this problem, an artificial rule based hunting algorithm (AR-HA) is firstly developed based on the principles of attraction and repulsion with heading adjustment, and each agent is controlled by the designed rules. Then, to further enhance the stability of cooperative hunting, a self-learning algorithm based on Twin Delayed Deep Deterministic policy gradient (SL-TD3) is proposed. Each agent is governed by its own SL-TD3 controller and learns independently from its interaction with the environment, taking advantage of the reward function designed based on the control rules of AR-HA. Besides, in order to improve training efficiency, imitation learning is employed to initialize the actor network. Experiments on both virtual and real robots demonstrate the effectiveness of the proposed algorithms for autonomous cooperative hunting.

引用

页数：20

共 50 条

[31] NEURAL NETWORKS FOR SELF-LEARNING CONTROL-SYSTEMS
NGUYEN, DH
WIDROW, B
INTERNATIONAL JOURNAL OF CONTROL, 1991, 54 (06) : 1439 - 1451
[32] A self-learning human-machine cooperative control method based on driver intention recognition
Jiang, Yan
Ding, Yuyan
Zhang, Xinglong
Xu, Xin
Huang, Junwen
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (05) : 1101 - 1115
[33] Towards design of complete rule-based control systems
Ligeza, A
ARTIFICIAL INTELLIGENCE IN REAL-TIME CONTROL 1995 (AIRTC'95), 1996, : 167 - 172
[34] Optimal Self-Learning Cooperative Control for Continuous-Time Heterogeneous Multi-Agent Systems
Wei Qinglai
Liu Derong
Song Ruizhuo
2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3005 - 3010
[35] Rule-based price control for bike sharing systems
Ruch, Claudio
Warrington, Joseph
Morari, Manfred
2014 EUROPEAN CONTROL CONFERENCE (ECC), 2014, : 708 - 713
[36] Cooperative avoidance control for multiagent systems
Stipanovic, Dusan M.
Hokayem, Peter F.
Spong, Mark W.
Siljak, Dragoslav D.
JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2007, 129 (05): : 699 - 707
[37] Multiobjective Rule-Based Cooperative Continuous Ant Colony Optimized Fuzzy Systems With a Robot Control Application
Juang, Chia-Feng
Lin, Chan-Hung
Bui, Trong Bac
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (02) : 650 - 663
[38] Convex Temporal Convolutional Network-Based Distributed Cooperative Learning Control for Multiagent Systems
Chen, Shaofeng
Kang, Yu
Di, Jian
Li, Pengfei
Cao, Yang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 5234 - 5243
[39] Effects of self-assessment on retention in rule-based learning
Hassmen, P
Hunt, DP
Dybeck, C
PERCEPTUAL AND MOTOR SKILLS, 2002, 94 (01) : 296 - 306
[40] Theoretical and practical basis for self learning rule-based sensors
Vaughan, M.M., 1600, (05):

← 1 2 3 4 5 →