Model-Based Self-Advising for Multi-Agent Learning

被引：12

作者：

Ye, Dayong ^{[1
,2
]}

Zhu, Tianqing ^{[1
,2
]}

Zhu, Congcong ^{[1
,2
]}

Zhou, Wanlei ^{[3
]}

Yu, Philip S. ^{[4
]}

机构：

[1] Univ Technol Sydney, Ctr Cyber Secur & Privacy, Ultimo, NSW 2007, Australia

[2] Univ Technol Sydney, Sch Comp Sci, Ultimo, NSW 2007, Australia

[3] City Univ Macau, Macau, Peoples R China

[4] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 10期

基金：

澳大利亚研究理事会;

关键词：

Task analysis; Urban areas; Knowledge transfer; Current measurement; Computer science; Autonomous vehicles; Training; Agent advising; deep neural network; multiagent learning; NEURAL-NETWORKS;

D O I：

10.1109/TNNLS.2022.3147221

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multiagent learning, one of the main ways to improve learning performance is to ask for advice from another agent. Contemporary advising methods share a common limitation that a teacher agent can only advise a student agent if the teacher has experience with an identical state. However, in highly complex learning scenarios, such as autonomous driving, it is rare for two agents to experience exactly the same state, which makes the advice less of a learning aid and more of a one-time instruction. In these scenarios, with contemporary methods, agents do not really help each other learn, and the main outcome of their back and forth requests for advice is an exorbitant communications' overhead. In human interactions, teachers are often asked for advice on what to do in situations that students are personally unfamiliar with. In these, we generally draw from similar experiences to formulate advice. This inspired us to provide agents with the same ability when asked for advice on an unfamiliar state. Hence, we propose a model-based self-advising method that allows agents to train a model based on states similar to the state in question to inform its response. As a result, the advice given can not only be used to resolve the current dilemma but also many other similar situations that the student may come across in the future via self-advising. Compared with contemporary methods, our method brings a significant improvement in learning performance with much lower communication overheads.

引用

页码：7934 / 7945

页数：12

共 50 条

[1] Filtered Observations for Model-Based Multi-agent Reinforcement Learning
Meng, Linghui
Xiong, Xuantang
Zang, Yifan
Zhang, Xi
Li, Guoqi
Xing, Dengpeng
Xu, Bo
[J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 540 - 555
[2] Model-based learning of interaction strategies in multi-agent systems
Carmel, D
Markovitch, S
[J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1998, 10 (03) : 309 - 332
[3] Exploration Strategies for Model-based Learning in Multi-agent Systems
Carmel D.
Markovitch S.
[J]. Autonomous Agents and Multi-Agent Systems, 1999, 2 (2) : 141 - 172
[4] Explainable Action Advising for Multi-Agent Reinforcement Learning
Guo, Yue
Campbell, Joseph
Stepputtis, Simon
Li, Ruiyu
Hughes, Dana
Fang, Fei
Sycara, Katia
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5515 - 5521
[5] Multi-agent based university advising system
Attia, Mohamed
Badawy, Osama
Kosba, Essam
[J]. 24th International Conference on Computer Theory and Applications, ICCTA 2014 - Proceedings, 2014, : 95 - 101
[6] Model-Based Reinforcement Learning using Model Mediator in Dynamic Multi-Agent Environment
Imai S.
Iwasawa Y.
Matsuo Y.
[J]. Transactions of the Japanese Society for Artificial Intelligence, 2023, 38 (05)
[7] Model-Based Diagnosis of Multi-Agent Systems: A Survey
Kalech, Meir
Natan, Avraham
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12334 - 12341
[8] Multi-agent systems for model-based fault diagnosis
Ren, X
Hargrave, SM
Thompson, HA
Fleming, PJ
[J]. NEW TECHNOLOGIES FOR COMPUTER CONTROL 2001, 2002, : 95 - 100
[9] Collaborative learning based on multi-agent model
Wang, DZ
Shen, RM
Shen, LP
[J]. Web-based Learning: Men & Machines, 2002, : 107 - 114
[10] Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation
Sessa, Pier Giuseppe
Kamgarpour, Maryam
Krause, Andreas
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19580 - 19597

← 1 2 3 4 5 →