Neural Regret-Matching for Distributed Constraint Optimization Problems

被引:0
|
作者
Deng, Yanchen [1 ]
Yu, Runshen [1 ]
Wang, Xinrun [1 ]
An, Bo [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
ALGORITHMS; ADOPT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distributed constraint optimization problems (DCOPs) are a powerful model for multi-agent coordination and optimization, where information and controls are distributed among multiple agents by nature. However, most of incomplete algorithms for DCOPs are context-free, i.e., agents make a decision purely based on the state of their neighbors, which makes them prone to get trapped in poor local convergence. On the other hand, context-based algorithms use tables to exactly store all the information (e.g., costs, confidence bounds), which limits their scalability. This paper tackles the limitation by incorporating deep neural networks in solving DCOPs for the first time and presents a neural context-based sampling scheme built upon regret-matching. In the algorithm, each agent trains a neural network to approximate the regret related to its local problem under current context and performs sampling according to the estimated regret. Furthermore, to ensure exploration, we propose a regret rounding scheme that rounds small regret values to positive numbers. We theoretically show the regret bound of our algorithm and extensive evaluations indicate that our algorithm can scale up to large-scale DCOPs and significantly outperform the state-of-the-art methods.
引用
收藏
页码:146 / 153
页数:8
相关论文
共 50 条
  • [1] Markets, correlation, and regret-matching
    Hart, Sergiu
    Mas-Colell, Andreu
    [J]. GAMES AND ECONOMIC BEHAVIOR, 2015, 93 : 42 - 58
  • [2] Predictive Regret-Matching for Cooperating Interceptors to Defeat an Advanced Threat
    Rajagopalan, Arvind
    Duong Duc Nguyen
    Kim, Jijoong
    [J]. AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 28 - 40
  • [3] Simple Adaptive Strategies: From Regret-Matching to Uncoupled Dynamics
    Levine, David K.
    [J]. GAMES AND ECONOMIC BEHAVIOR, 2014, 87 : 652 - 653
  • [4] Distributed Interference Mitigation in Two-Tier Wireless Networks Using Correlated Equilibrium and Regret-Matching Learning
    Sroka, Pawel
    Kliks, Adrian
    [J]. 2014 EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS (EUCNC), 2014,
  • [5] Proactive Distributed Constraint Optimization Problems
    Hoang, Khoi
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2411 - 2413
  • [6] Asymmetric Distributed Constraint Optimization Problems
    Grinshpoun, Tal
    Grubshtein, Alon
    Zivan, Roie
    Netzer, Arnon
    Meisels, Amnon
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 : 613 - 647
  • [7] Proactive Dynamic Distributed Constraint Optimization Problems
    Hoang, Khoi D.
    Fioretto, Ferdinando
    Hou, Ping
    Yeoh, William
    Yokoo, Makoto
    Zivan, Roie
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 179 - 225
  • [8] Distributed constraint optimization problems and applications: A survey
    Fioretto, Ferdinando
    Pontelli, Enrico
    Yeoh, William
    [J]. Journal of Artificial Intelligence Research, 2018, 61 : 623 - 698
  • [9] Exploiting the Structure of Distributed Constraint Optimization Problems
    Fioretto, Ferdinando
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 4233 - 4234
  • [10] Exploiting the Structure of Distributed Constraint Optimization Problems
    Fioretto, Ferdinando
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 2007 - 2008