Kernel Methods for Cooperative Multi-Agent Contextual Bandits

被引:0
|
作者
Dubey, Abhimanyu [1 ,2 ]
Pentland, Alex [1 ,2 ]
机构
[1] MIT, Media Lab, Cambridge, MA 02139 USA
[2] MIT, Inst Data Syst & Soc, Cambridge, MA 02139 USA
关键词
MULTIARMED BANDIT;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cooperative multi-agent decision making involves a group of agents cooperatively solving learning problems while communicating over a network with delays. In this paper, we consider the kernelised contextual bandit problem, where the reward obtained by an agent is an arbitrary linear function of the contexts' images in the related reproducing kernel Hilbert space (RKHS), and a group of agents must cooperate to collectively solve their unique decision problems. For this problem, we propose COOP-KERNELUCB, an algorithm that provides near-optimal bounds on the per-agent regret, and is both computationally and communicatively efficient. For special cases of the cooperative problem, we also provide variants of COOP-KERNELUCB that provides optimal peragent regret. In addition, our algorithm generalizes several existing results in the multi-agent bandit setting. Finally, on a series of both synthetic and real-world multi-agent network benchmarks, we demonstrate that our algorithm significantly outperforms existing benchmarks.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Multi-agent service infrastructure for cooperative management
    Ray, P
    Paramesh, N
    GLOBECOM'02: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-3, CONFERENCE RECORDS: THE WORLD CONVERGES, 2002, : 2999 - 3003
  • [42] Modular Cooperative Tasking for Multi-agent Systems
    Karimadini, Mohammad
    Karimoddini, Ali
    Lin, Hai
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2018, : 618 - 623
  • [43] Cooperative multi-agent mapping and exploration in Webots®
    Scott, Adele F.
    Yu, Changbin
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOTS AND AGENTS, 2009, : 237 - 242
  • [44] FMAP: Distributed cooperative multi-agent planning
    Alejandro Torreño
    Eva Onaindia
    Óscar Sapena
    Applied Intelligence, 2014, 41 : 606 - 626
  • [45] Cooperative negotiation strategy in multi-agent system
    Tian, YJ
    Liu, Y
    Shimohara, K
    Sawaragi, T
    42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 2549 - 2554
  • [46] Cooperative Output Regulation of Multi-Agent Systems
    Huang, Jie
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 1 - 5
  • [47] A Cooperative Switching Algorithm for Multi-Agent Foraging
    Zedadra, Ouarda
    Seridi, Hamid
    Jouandeau, Nicolas
    Fortino, Giancarlo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 50 : 302 - 319
  • [48] Multi-agent robot cooperative assembly system
    Wang, Yuechao
    Tan, Dalong
    Huang, Shan
    Luan, Tian
    Zhao, Yiwen
    Ruan Jian Xue Bao/Journal of Software, 1998, 9 (06): : 6 - 10
  • [49] Multi-Agent Tensor Fusion for Contextual Trajectory Prediction
    Zhao, Tianyang
    Xu, Yifei
    Monfort, Mathew
    Choi, Wongun
    Baker, Chris
    Zhao, Yibiao
    Wang, Yizhou
    Wu, Ying Nian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12118 - 12126
  • [50] Multi-Agent Prototyping for a Cooperative Carrying Task
    Djebrani, Salima
    Abdessemed, Foudil
    2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2009), VOLS 1-4, 2009, : 1421 - +