Neural Regret-Matching for Distributed Constraint Optimization Problems

被引：0

作者：

Deng, Yanchen ^{[1
]}

Yu, Runshen ^{[1
]}

Wang, Xinrun ^{[1
]}

An, Bo ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore

来源：

PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021 | 2021年

基金：

新加坡国家研究基金会;

关键词：

ALGORITHMS; ADOPT;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Distributed constraint optimization problems (DCOPs) are a powerful model for multi-agent coordination and optimization, where information and controls are distributed among multiple agents by nature. However, most of incomplete algorithms for DCOPs are context-free, i.e., agents make a decision purely based on the state of their neighbors, which makes them prone to get trapped in poor local convergence. On the other hand, context-based algorithms use tables to exactly store all the information (e.g., costs, confidence bounds), which limits their scalability. This paper tackles the limitation by incorporating deep neural networks in solving DCOPs for the first time and presents a neural context-based sampling scheme built upon regret-matching. In the algorithm, each agent trains a neural network to approximate the regret related to its local problem under current context and performs sampling according to the estimated regret. Furthermore, to ensure exploration, we propose a regret rounding scheme that rounds small regret values to positive numbers. We theoretically show the regret bound of our algorithm and extensive evaluations indicate that our algorithm can scale up to large-scale DCOPs and significantly outperform the state-of-the-art methods.

引用

页码：146 / 153

页数：8

共 50 条

[1] Markets, correlation, and regret-matching
Hart, Sergiu
Mas-Colell, Andreu
[J]. GAMES AND ECONOMIC BEHAVIOR, 2015, 93 : 42 - 58
[2] Predictive Regret-Matching for Cooperating Interceptors to Defeat an Advanced Threat
Rajagopalan, Arvind
Duong Duc Nguyen
Kim, Jijoong
[J]. AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 28 - 40
[3] Simple Adaptive Strategies: From Regret-Matching to Uncoupled Dynamics
Levine, David K.
[J]. GAMES AND ECONOMIC BEHAVIOR, 2014, 87 : 652 - 653
[4] Distributed Interference Mitigation in Two-Tier Wireless Networks Using Correlated Equilibrium and Regret-Matching Learning
Sroka, Pawel
Kliks, Adrian
[J]. 2014 EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS (EUCNC), 2014,
[5] Proactive Distributed Constraint Optimization Problems
Hoang, Khoi
[J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2411 - 2413
[6] Asymmetric Distributed Constraint Optimization Problems
Grinshpoun, Tal
Grubshtein, Alon
Zivan, Roie
Netzer, Arnon
Meisels, Amnon
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 : 613 - 647
[7] Proactive Dynamic Distributed Constraint Optimization Problems
Hoang, Khoi D.
Fioretto, Ferdinando
Hou, Ping
Yeoh, William
Yokoo, Makoto
Zivan, Roie
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 179 - 225
[8] Distributed constraint optimization problems and applications: A survey
Fioretto, Ferdinando
Pontelli, Enrico
Yeoh, William
[J]. Journal of Artificial Intelligence Research, 2018, 61 : 623 - 698
[9] Exploiting the Structure of Distributed Constraint Optimization Problems
Fioretto, Ferdinando
[J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 4233 - 4234
[10] Exploiting the Structure of Distributed Constraint Optimization Problems
Fioretto, Ferdinando
[J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 2007 - 2008

← 1 2 3 4 5 →