Collective Online Learning of Gaussian Processes in Massive Multi-Agent Systems

被引:0
|
作者
Trong Nghia Hoang [1 ]
Quang Minh Hoang [2 ]
Low, Kian Hsiang [3 ]
How, Jonathan [4 ]
机构
[1] MIT IBM Watson AI Lab, Cambridge, MA 02142 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Natl Univ Singapore, Singapore, Singapore
[4] MIT, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel Collective Online Learning of Gaussian Processes (COOL-GP) framework for enabling a massive number of GP inference agents to simultaneously perform (a) efficient online updates of their GP models using their local streaming data with varying correlation structures and (b) decentralized fusion of their resulting online GP models with different learned hyperparameter settings and inducing inputs. To realize this, we exploit the notion of a common encoding structure to encapsulate the local streaming data gathered by any GP inference agent into summary statistics based on our proposed representation, which is amenable to both an efficient online update via an importance sampling trick as well as multi-agent model fusion via decentralized message passing that can exploit sparse connectivity among agents for improving efficiency and enhance the robustness of our framework against transmission loss. We provide a rigorous theoretical analysis of the approximation loss arising from our proposed representation to achieve efficient online updates and model fusion. Empirical evaluations show that COOL-GP is highly effective in model fusion, resilient to information disparity between agents, robust to transmission loss, and can scale to thousands of agents.
引用
收藏
页码:7850 / 7857
页数:8
相关论文
共 50 条
  • [1] Online Learning-based Formation Control of Multi-Agent Systems with Gaussian Processes
    Beckers, Thomas
    Hirche, Sandra
    Colombo, Leonardo
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2197 - 2202
  • [2] Decentralized Multi-Agent Exploration with Online-Learning of Gaussian Processes
    Viseras, Alberto
    Wiedemann, Thomas
    Manss, Christoph
    Magel, Lukas
    Mueller, Joachim
    Shutin, Dmitriy
    Merino, Luis
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 4222 - 4229
  • [3] Online Learning for Markov Decision Processes Applied to Multi-Agent Systems
    El Chamie, Mahmoud
    Acikmese, Behcet
    Mesbahi, Mehran
    [J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [4] Multi-agent systems and role games: collective learning processes for ecosystem management
    Bousquet, F
    Barreteau, O
    d'Aquino, P
    Etienne, M
    Boissau, S
    Aubert, S
    Le Page, C
    Babin, D
    Castella, JC
    [J]. COMPLEXITY AND ECOSYSTEM MANAGEMENT: THE THEORY AND PRACTICE OF MULTI-AGENT SYSTEMS, 2002, : 248 - 285
  • [5] Distributed Learning Consensus Control for Unknown Nonlinear Multi-Agent Systems based on Gaussian Processes
    Yang, Zewen
    Sosnowski, Stefan
    Liu, Qingchen
    Jiao, Junjie
    Lederer, Armin
    Hirche, Sandra
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 4406 - 4411
  • [6] Collective decisions in multi-agent systems
    Schweitzer, Frank
    [J]. Advancing Social Simulation: The First World Congress, 2007, : 7 - 12
  • [7] Distributed Experiment Design and Control for Multi-agent Systems with Gaussian Processes
    Viet-Anh Le
    Nghiem, Truong X.
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2226 - 2231
  • [8] Massive multi-agent systems control
    Campagne, JC
    Cardon, A
    Collomb, E
    Nishida, T
    [J]. FORMAL APPROACHES TO AGENT-BASED SYSTEMS, 2005, 3228 : 275 - 280
  • [9] Online Reinforcement Learning in Multi-Agent Systems for Distributed Energy Systems
    Menon, Bharat R.
    Menon, Sangeetha B.
    Srinivasan, Dipti
    Jain, Lakhmi
    [J]. 2014 IEEE INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT ASIA), 2014, : 791 - 796
  • [10] Multi-Agent Safe Planning with Gaussian Processes
    Zhu, Zheqing
    Biyik, Erdem
    Sadigh, Dorsa
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6260 - 6267