Model-Based Self-Advising for Multi-Agent Learning

被引：12

作者：

Ye, Dayong ^{[1
,2
]}

Zhu, Tianqing ^{[1
,2
]}

Zhu, Congcong ^{[1
,2
]}

Zhou, Wanlei ^{[3
]}

Yu, Philip S. ^{[4
]}

机构：

[1] Univ Technol Sydney, Ctr Cyber Secur & Privacy, Ultimo, NSW 2007, Australia

[2] Univ Technol Sydney, Sch Comp Sci, Ultimo, NSW 2007, Australia

[3] City Univ Macau, Macau, Peoples R China

[4] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 10期

基金：

澳大利亚研究理事会;

关键词：

Task analysis; Urban areas; Knowledge transfer; Current measurement; Computer science; Autonomous vehicles; Training; Agent advising; deep neural network; multiagent learning; NEURAL-NETWORKS;

D O I：

10.1109/TNNLS.2022.3147221

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multiagent learning, one of the main ways to improve learning performance is to ask for advice from another agent. Contemporary advising methods share a common limitation that a teacher agent can only advise a student agent if the teacher has experience with an identical state. However, in highly complex learning scenarios, such as autonomous driving, it is rare for two agents to experience exactly the same state, which makes the advice less of a learning aid and more of a one-time instruction. In these scenarios, with contemporary methods, agents do not really help each other learn, and the main outcome of their back and forth requests for advice is an exorbitant communications' overhead. In human interactions, teachers are often asked for advice on what to do in situations that students are personally unfamiliar with. In these, we generally draw from similar experiences to formulate advice. This inspired us to provide agents with the same ability when asked for advice on an unfamiliar state. Hence, we propose a model-based self-advising method that allows agents to train a model based on states similar to the state in question to inform its response. As a result, the advice given can not only be used to resolve the current dilemma but also many other similar situations that the student may come across in the future via self-advising. Compared with contemporary methods, our method brings a significant improvement in learning performance with much lower communication overheads.

引用

页码：7934 / 7945

页数：12

共 50 条

[21] Multi-agent learning model with bargaining
Qiao, Haiyan
Rozenblit, Jerzy
Szidarovszky, Ferenc
Yang, Lizhi
PROCEEDINGS OF THE 2006 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2006, : 934 - +
[22] An integrated fuzzy and learning approach to performance improvement of model-based multi-agent robotic control systems
Yang, Erfu
Gu, Dongbing
2007 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS I-V, CONFERENCE PROCEEDINGS, 2007, : 1417 - 1422
[23] MULTI-MODEL FEDERATED LEARNING OPTIMIZATION BASED ON MULTI-AGENT REINFORCEMENT LEARNING
Atapour, S. Kaveh
Seyedmohammadi, S. Jamal
Sheikholeslami, S. Mohammad
Abouei, Jamshid
Mohammadi, Arash
Plataniotis, Konstantinos N.
2023 IEEE 9TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING, CAMSAP, 2023, : 151 - 155
[24] A multi-agent learning model based on dynamic fuzzy logic
Xie, LP
Li, FZ
2005 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2005, : 310 - 313
[25] Multi-agent crowdsourcing model based on Q-learning
Fang, Xin
Guo, Yongan
2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
[26] Model-based event/self-triggered fixed-time consensus of nonlinear multi-agent systems
Lv, Mingbiao
Gao, Jinfeng
Liu, Peter X.
Zhang, Yuqing
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2024, 238 (04) : 744 - 754
[27] Model-based development of a multi-agent system for controlling material flow systems
Fischer, Juliane
Marcos, Marga
Vogel-Heuser, Birgit
AT-AUTOMATISIERUNGSTECHNIK, 2018, 66 (05) : 438 - 448
[28] A multi-agent system for model-based real-time fault diagnosis
Elektrotech Informationstech E&I, 1 (06):
[29] Adaptive multi-agent smart academic advising framework
Abdelhamid, Abdelaziz A.
Alotaibi, Sultan R.
IET SOFTWARE, 2021, 15 (05) : 293 - 307
[30] HiSOMA: A hierarchical multi-agent model integrating self-organizing neural networks with multi-agent deep reinforcement learning
Geng, Minghong
Pateria, Shubham
Subagdja, Budhitama
Tan, Ah-Hwee
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252

← 1 2 3 4 5 →