Model-Based Self-Advising for Multi-Agent Learning

被引:12
|
作者
Ye, Dayong [1 ,2 ]
Zhu, Tianqing [1 ,2 ]
Zhu, Congcong [1 ,2 ]
Zhou, Wanlei [3 ]
Yu, Philip S. [4 ]
机构
[1] Univ Technol Sydney, Ctr Cyber Secur & Privacy, Ultimo, NSW 2007, Australia
[2] Univ Technol Sydney, Sch Comp Sci, Ultimo, NSW 2007, Australia
[3] City Univ Macau, Macau, Peoples R China
[4] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
基金
澳大利亚研究理事会;
关键词
Task analysis; Urban areas; Knowledge transfer; Current measurement; Computer science; Autonomous vehicles; Training; Agent advising; deep neural network; multiagent learning; NEURAL-NETWORKS;
D O I
10.1109/TNNLS.2022.3147221
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multiagent learning, one of the main ways to improve learning performance is to ask for advice from another agent. Contemporary advising methods share a common limitation that a teacher agent can only advise a student agent if the teacher has experience with an identical state. However, in highly complex learning scenarios, such as autonomous driving, it is rare for two agents to experience exactly the same state, which makes the advice less of a learning aid and more of a one-time instruction. In these scenarios, with contemporary methods, agents do not really help each other learn, and the main outcome of their back and forth requests for advice is an exorbitant communications' overhead. In human interactions, teachers are often asked for advice on what to do in situations that students are personally unfamiliar with. In these, we generally draw from similar experiences to formulate advice. This inspired us to provide agents with the same ability when asked for advice on an unfamiliar state. Hence, we propose a model-based self-advising method that allows agents to train a model based on states similar to the state in question to inform its response. As a result, the advice given can not only be used to resolve the current dilemma but also many other similar situations that the student may come across in the future via self-advising. Compared with contemporary methods, our method brings a significant improvement in learning performance with much lower communication overheads.
引用
收藏
页码:7934 / 7945
页数:12
相关论文
共 50 条
  • [21] Multi-agent learning model with bargaining
    Qiao, Haiyan
    Rozenblit, Jerzy
    Szidarovszky, Ferenc
    Yang, Lizhi
    PROCEEDINGS OF THE 2006 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2006, : 934 - +
  • [22] An integrated fuzzy and learning approach to performance improvement of model-based multi-agent robotic control systems
    Yang, Erfu
    Gu, Dongbing
    2007 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS I-V, CONFERENCE PROCEEDINGS, 2007, : 1417 - 1422
  • [23] MULTI-MODEL FEDERATED LEARNING OPTIMIZATION BASED ON MULTI-AGENT REINFORCEMENT LEARNING
    Atapour, S. Kaveh
    Seyedmohammadi, S. Jamal
    Sheikholeslami, S. Mohammad
    Abouei, Jamshid
    Mohammadi, Arash
    Plataniotis, Konstantinos N.
    2023 IEEE 9TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING, CAMSAP, 2023, : 151 - 155
  • [24] A multi-agent learning model based on dynamic fuzzy logic
    Xie, LP
    Li, FZ
    2005 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2005, : 310 - 313
  • [25] Multi-agent crowdsourcing model based on Q-learning
    Fang, Xin
    Guo, Yongan
    2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
  • [26] Model-based event/self-triggered fixed-time consensus of nonlinear multi-agent systems
    Lv, Mingbiao
    Gao, Jinfeng
    Liu, Peter X.
    Zhang, Yuqing
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2024, 238 (04) : 744 - 754
  • [27] Model-based development of a multi-agent system for controlling material flow systems
    Fischer, Juliane
    Marcos, Marga
    Vogel-Heuser, Birgit
    AT-AUTOMATISIERUNGSTECHNIK, 2018, 66 (05) : 438 - 448
  • [28] A multi-agent system for model-based real-time fault diagnosis
    Elektrotech Informationstech E&I, 1 (06):
  • [29] Adaptive multi-agent smart academic advising framework
    Abdelhamid, Abdelaziz A.
    Alotaibi, Sultan R.
    IET SOFTWARE, 2021, 15 (05) : 293 - 307
  • [30] HiSOMA: A hierarchical multi-agent model integrating self-organizing neural networks with multi-agent deep reinforcement learning
    Geng, Minghong
    Pateria, Shubham
    Subagdja, Budhitama
    Tan, Ah-Hwee
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252