A GNN-based teacher-student framework with multi-advice

被引:1
|
作者
Lei, Yunjiao [1 ]
Ye, Dayong [1 ]
Zhu, Congcong [2 ]
Shen, Sheng [3 ]
Zhou, Wanlei [2 ]
Zhu, Tianqing [2 ]
机构
[1] Univ Technol Sydney, Sch Comp Sci, Sydney, NSW 2007, Australia
[2] City Univ Macau, Fac Data Sci, Taipa 999078, Macao, Peoples R China
[3] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
基金
澳大利亚研究理事会;
关键词
Reinforcement learning; Teacher-student framework; Multi-agent; Graph neural network (GNN); Transfer learning (TL);
D O I
10.1016/j.eswa.2024.123887
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi -agent learning involves the interaction of multiple agents with the environment to learn an optimal policy. To enhance learning performance, a commonly used approach is the teacher-student framework, which enables agents to seek advice from other agents. However, the current literature suffers from a common limitation, wherein a student agent can only receive advice from a single teacher agent at each time step, thus constraining the knowledge acquired. Although some methods allow advice from multiple teachers, they require pre -training of the teachers, which is impractical when agents concurrently learn from scratch simultaneously. Additionally, the importance of the agents' connection structures is often disregarded, despite its critical role in the advice -giving process. To overcome these limitations, we propose a novel advising approach utilizing graph neural networks (GNNs). This method models the agents' connection structures, learns the weight of advice, and aggregates inputs from multiple teachers to generate refined advice. Our experiments show that this proposed approach has superior learning performance compared to advising baseline methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Academic evaluation based on the teacher-student relationship
    Ribeiro, RMS
    Ribeiro, FS
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 45480 - 45480
  • [22] ABSLearn: a GNN-based framework for aliasing and buffer-size information retrieval
    Ke Liang
    Jim Tan
    Dongrui Zeng
    Yongzhe Huang
    Xiaolei Huang
    Gang Tan
    Pattern Analysis and Applications, 2023, 26 (3) : 1171 - 1189
  • [23] Mobility-Aware MEC Planning With a GNN-Based Graph Partitioning Framework
    Liu, Jiayi
    Xu, Zhongyi
    Wang, Chen
    Liu, Xuefang
    Xie, Xuemei
    Shi, Guangming
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (04): : 4383 - 4395
  • [24] Motivation and engagement in mathematics: a qualitative framework for teacher-student interactions
    Durksen T.L.
    Way J.
    Bobis J.
    Anderson J.
    Skilling K.
    Martin A.J.
    Mathematics Education Research Journal, 2017, 29 (2) : 163 - 181
  • [25] ABSLearn: a GNN-based framework for aliasing and buffer-size information retrieval
    Liang, Ke
    Tan, Jim
    Zeng, Dongrui
    Huang, Yongzhe
    Huang, Xiaolei
    Tan, Gang
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 1171 - 1189
  • [26] A Multitask Teacher-Student Framework for Perceptual Audio Quality Assessment
    Wu, Chih-Wei
    Williams, Phillip A.
    Wolcott, William
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 396 - 400
  • [27] A federated advisory teacher-student framework with simultaneous learning agents
    Lei, Yunjiao
    Ye, Dayong
    Zhu, Tianqing
    Zhou, Wanlei
    KNOWLEDGE-BASED SYSTEMS, 2024, 305
  • [28] A GNN-Based Supervised Learning Framework for Resource Allocation in Wireless IoT Networks
    Chen, Tianrui
    Zhang, Xinruo
    You, Minglei
    Zheng, Gan
    Lambotharan, Sangarapillai
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (03) : 1712 - 1724
  • [29] Learning by reusing previous advice: a memory-based teacher–student framework
    Changxi Zhu
    Yi Cai
    Shuyue Hu
    Ho-fung Leung
    Dickson K. W. Chiu
    Autonomous Agents and Multi-Agent Systems, 2023, 37
  • [30] A Teacher-Student Markov Decision Process-based Framework for Online Correctional Learning
    Lourenco, Ines
    Winqvist, Rebecka
    Rojas, Cristian R.
    Wahlberg, Bo
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3456 - 3461