Multi-Agent Learning with Heterogeneous Linear Contextual Bandits

被引:0
|
作者
Anh Do [1 ]
Thanh Nguyen-Tang [1 ]
Arora, Raman [1 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As trained intelligent systems become increasingly pervasive, multi-agent learning has emerged as a popular framework for studying complex interactions between autonomous agents. Yet, a formal understanding of how and when learners in heterogeneous environments benefit from sharing their respective experiences is still in its infancy. In this paper, we seek answers to these questions in the context of linear contextual bandits. We present a novel distributed learning algorithm based on the upper confidence bound (UCB) algorithm, which we refer to as H-LINUCB, wherein agents cooperatively minimize the group regret under the coordination of a central server. In the setting where the level of heterogeneity or dissimilarity across the environments is known to the agents, we show that H-LINUCB is provably optimal in regimes where the tasks are highly similar or highly dissimilar.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Multi-agent Heterogeneous Stochastic Linear Bandits
    Ghosh, Avishek
    Sankararaman, Abishek
    Ramchandran, Kannan
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 300 - 316
  • [2] Kernel Methods for Cooperative Multi-Agent Contextual Bandits
    Dubey, Abhimanyu
    Pentland, Alex
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [3] Kernel Methods for Cooperative Multi-Agent Contextual Bandits
    Dubey, Abhimanyu
    Pentland, Alex
    [J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [4] Collaborative Multi-agent Stochastic Linear Bandits
    Moradipari, Ahmadreza
    Ghavamzadeh, Mohammad
    Alizadeh, Mahnoosh
    [J]. 2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2761 - 2766
  • [5] Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits
    Chawla, Ronshee
    Vial, Daniel
    Shakkottai, Sanjay
    Srikant, R.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [6] Budget Allocation as a Multi-Agent System of Contextual & Continuous Bandits
    Han, Benjamin
    Arndt, Carl
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 2937 - 2945
  • [7] Decentralized Multi-Agent Linear Bandits with Safety Constraints
    Amani, Sanae
    Thrampoulidis, Christos
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6627 - 6635
  • [8] Distributed learning control for heterogeneous linear multi-agent networks
    Meng, Deyuan
    Zhang, Jingyao
    [J]. AUTOMATICA, 2024, 169
  • [9] Federated Linear Contextual Bandits with Heterogeneous Clients
    Blaser, Ethan
    Li, Chuanhao
    Wang, Hongning
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [10] Heterogeneous Skill Learning for Multi-agent Tasks
    Liu, Yuntao
    Li, Yuan
    Xu, Xinhai
    Dou, Yong
    Liu, Donghong
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,