Model-Based Self-Advising for Multi-Agent Learning

被引:12
|
作者
Ye, Dayong [1 ,2 ]
Zhu, Tianqing [1 ,2 ]
Zhu, Congcong [1 ,2 ]
Zhou, Wanlei [3 ]
Yu, Philip S. [4 ]
机构
[1] Univ Technol Sydney, Ctr Cyber Secur & Privacy, Ultimo, NSW 2007, Australia
[2] Univ Technol Sydney, Sch Comp Sci, Ultimo, NSW 2007, Australia
[3] City Univ Macau, Macau, Peoples R China
[4] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
基金
澳大利亚研究理事会;
关键词
Task analysis; Urban areas; Knowledge transfer; Current measurement; Computer science; Autonomous vehicles; Training; Agent advising; deep neural network; multiagent learning; NEURAL-NETWORKS;
D O I
10.1109/TNNLS.2022.3147221
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multiagent learning, one of the main ways to improve learning performance is to ask for advice from another agent. Contemporary advising methods share a common limitation that a teacher agent can only advise a student agent if the teacher has experience with an identical state. However, in highly complex learning scenarios, such as autonomous driving, it is rare for two agents to experience exactly the same state, which makes the advice less of a learning aid and more of a one-time instruction. In these scenarios, with contemporary methods, agents do not really help each other learn, and the main outcome of their back and forth requests for advice is an exorbitant communications' overhead. In human interactions, teachers are often asked for advice on what to do in situations that students are personally unfamiliar with. In these, we generally draw from similar experiences to formulate advice. This inspired us to provide agents with the same ability when asked for advice on an unfamiliar state. Hence, we propose a model-based self-advising method that allows agents to train a model based on states similar to the state in question to inform its response. As a result, the advice given can not only be used to resolve the current dilemma but also many other similar situations that the student may come across in the future via self-advising. Compared with contemporary methods, our method brings a significant improvement in learning performance with much lower communication overheads.
引用
收藏
页码:7934 / 7945
页数:12
相关论文
共 50 条
  • [1] Filtered Observations for Model-Based Multi-agent Reinforcement Learning
    Meng, Linghui
    Xiong, Xuantang
    Zang, Yifan
    Zhang, Xi
    Li, Guoqi
    Xing, Dengpeng
    Xu, Bo
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 540 - 555
  • [2] Model-based learning of interaction strategies in multi-agent systems
    Carmel, D
    Markovitch, S
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1998, 10 (03) : 309 - 332
  • [3] Exploration Strategies for Model-based Learning in Multi-agent Systems
    Carmel D.
    Markovitch S.
    [J]. Autonomous Agents and Multi-Agent Systems, 1999, 2 (2) : 141 - 172
  • [4] Explainable Action Advising for Multi-Agent Reinforcement Learning
    Guo, Yue
    Campbell, Joseph
    Stepputtis, Simon
    Li, Ruiyu
    Hughes, Dana
    Fang, Fei
    Sycara, Katia
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5515 - 5521
  • [5] Multi-agent based university advising system
    Attia, Mohamed
    Badawy, Osama
    Kosba, Essam
    [J]. 24th International Conference on Computer Theory and Applications, ICCTA 2014 - Proceedings, 2014, : 95 - 101
  • [6] Model-Based Reinforcement Learning using Model Mediator in Dynamic Multi-Agent Environment
    Imai S.
    Iwasawa Y.
    Matsuo Y.
    [J]. Transactions of the Japanese Society for Artificial Intelligence, 2023, 38 (05)
  • [7] Model-Based Diagnosis of Multi-Agent Systems: A Survey
    Kalech, Meir
    Natan, Avraham
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12334 - 12341
  • [8] Multi-agent systems for model-based fault diagnosis
    Ren, X
    Hargrave, SM
    Thompson, HA
    Fleming, PJ
    [J]. NEW TECHNOLOGIES FOR COMPUTER CONTROL 2001, 2002, : 95 - 100
  • [9] Collaborative learning based on multi-agent model
    Wang, DZ
    Shen, RM
    Shen, LP
    [J]. Web-based Learning: Men & Machines, 2002, : 107 - 114
  • [10] Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation
    Sessa, Pier Giuseppe
    Kamgarpour, Maryam
    Krause, Andreas
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19580 - 19597