Collective Online Learning of Gaussian Processes in Massive Multi-Agent Systems

被引：0

作者：

Trong Nghia Hoang ^{[1
]}

Quang Minh Hoang ^{[2
]}

Low, Kian Hsiang ^{[3
]}

How, Jonathan ^{[4
]}

机构：

[1] MIT IBM Watson AI Lab, Cambridge, MA 02142 USA

[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[3] Natl Univ Singapore, Singapore, Singapore

[4] MIT, Cambridge, MA 02139 USA

来源：

THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a novel Collective Online Learning of Gaussian Processes (COOL-GP) framework for enabling a massive number of GP inference agents to simultaneously perform (a) efficient online updates of their GP models using their local streaming data with varying correlation structures and (b) decentralized fusion of their resulting online GP models with different learned hyperparameter settings and inducing inputs. To realize this, we exploit the notion of a common encoding structure to encapsulate the local streaming data gathered by any GP inference agent into summary statistics based on our proposed representation, which is amenable to both an efficient online update via an importance sampling trick as well as multi-agent model fusion via decentralized message passing that can exploit sparse connectivity among agents for improving efficiency and enhance the robustness of our framework against transmission loss. We provide a rigorous theoretical analysis of the approximation loss arising from our proposed representation to achieve efficient online updates and model fusion. Empirical evaluations show that COOL-GP is highly effective in model fusion, resilient to information disparity between agents, robust to transmission loss, and can scale to thousands of agents.

引用

页码：7850 / 7857

页数：8

共 50 条

[1] Online Learning-based Formation Control of Multi-Agent Systems with Gaussian Processes
Beckers, Thomas
Hirche, Sandra
Colombo, Leonardo
[J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2197 - 2202
[2] Decentralized Multi-Agent Exploration with Online-Learning of Gaussian Processes
Viseras, Alberto
Wiedemann, Thomas
Manss, Christoph
Magel, Lukas
Mueller, Joachim
Shutin, Dmitriy
Merino, Luis
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 4222 - 4229
[3] Online Learning for Markov Decision Processes Applied to Multi-Agent Systems
El Chamie, Mahmoud
Acikmese, Behcet
Mesbahi, Mehran
[J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
[4] Multi-agent systems and role games: collective learning processes for ecosystem management
Bousquet, F
Barreteau, O
d'Aquino, P
Etienne, M
Boissau, S
Aubert, S
Le Page, C
Babin, D
Castella, JC
[J]. COMPLEXITY AND ECOSYSTEM MANAGEMENT: THE THEORY AND PRACTICE OF MULTI-AGENT SYSTEMS, 2002, : 248 - 285
[5] Distributed Learning Consensus Control for Unknown Nonlinear Multi-Agent Systems based on Gaussian Processes
Yang, Zewen
Sosnowski, Stefan
Liu, Qingchen
Jiao, Junjie
Lederer, Armin
Hirche, Sandra
[J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 4406 - 4411
[6] Collective decisions in multi-agent systems
Schweitzer, Frank
[J]. Advancing Social Simulation: The First World Congress, 2007, : 7 - 12
[7] Distributed Experiment Design and Control for Multi-agent Systems with Gaussian Processes
Viet-Anh Le
Nghiem, Truong X.
[J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2226 - 2231
[8] Massive multi-agent systems control
Campagne, JC
Cardon, A
Collomb, E
Nishida, T
[J]. FORMAL APPROACHES TO AGENT-BASED SYSTEMS, 2005, 3228 : 275 - 280
[9] Online Reinforcement Learning in Multi-Agent Systems for Distributed Energy Systems
Menon, Bharat R.
Menon, Sangeetha B.
Srinivasan, Dipti
Jain, Lakhmi
[J]. 2014 IEEE INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT ASIA), 2014, : 791 - 796
[10] Multi-Agent Safe Planning with Gaussian Processes
Zhu, Zheqing
Biyik, Erdem
Sadigh, Dorsa
[J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6260 - 6267

← 1 2 3 4 5 →