Predicting and Preventing Coordination Problems in Cooperative Q-learning Systems

被引:0
|
作者
Fulda, Nancy [1 ]
Ventura, Dan [1 ]
机构
[1] Brigham Young Univ, Dept Comp Sci, Provo, UT 84602 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a conceptual framework for creating Q-learning-based algorithms that converge to optimal equilibria in cooperative multiagent settings. This framework includes a set of conditions that are sufficient to guarantee optimal system performance. We demonstrate the efficacy of the framework by using it to analyze several well-known multi-agent learning algorithms and conclude by employing it as a design tool to construct a simple, novel multiagent learning algorithm.
引用
收藏
页码:780 / 785
页数:6
相关论文
共 50 条
  • [41] Cooperative Output Regulation By Q-learning For Discrete Multi-agent Systems In Finite-time
    Wei, Wenjun
    Tang, Jingyuan
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2022, 26 (06): : 853 - 864
  • [42] Cooperative Q-Learning for Multiple Secondary Users in Dynamic Spectrum Access
    Venkatraman, Pavithra
    Hamdaoui, Bechir
    2011 7TH INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2011, : 238 - 242
  • [43] Cooperative Spectrum Sensing for Cognitive Radios using Distributed Q-Learning
    van den Biggelaar, Olivier
    Dricot, Jean-Michel
    De Doncker, Philippe
    Horlin, Francois
    2011 IEEE VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2011,
  • [44] Cooperative Q-learning based channel selection for cognitive radio networks
    Feten Slimeni
    Zied Chtourou
    Bart Scheers
    Vincent Le Nir
    Rabah Attia
    Wireless Networks, 2019, 25 : 4161 - 4171
  • [45] Distributed Cooperative Q-learning for Power Allocation in Cognitive Femtocell Networks
    Saad, Hussein
    Mohamed, Amr
    ElBatt, Tamer
    2012 IEEE VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2012,
  • [46] Cooperative Path Planning for Single Leader Using Q-learning Method
    Zhang, Lichuan
    Wu, Dongwei
    Ren, Ranzhen
    Xing, Runfa
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [47] Cooperative pursuit with multiple pursuers based on Deep Minimax Q-learning
    Ji, Mengda
    Xu, Genjiu
    Duan, Zekun
    Wang, Liying
    Li, Zesheng
    Ge, Jianjun
    Li, Mingqiang
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 146
  • [48] A theoretical analysis of cooperative behaviorin multi-agent Q-learning
    Waltman, Ludo
    Kaymak, Uzay
    2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 84 - +
  • [49] Communication-Less Cooperative Q-Learning Agents in Maze Problem
    Uwano, Fumito
    Takadama, Keiki
    INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES 2016, 2017, 8 : 453 - 467
  • [50] A Cooperative Q-learning Approach for Online Power Allocation in Femtocell Networks
    Saad, Hussein
    Mohamed, Amr
    ElBatt, Tamer
    2013 IEEE 78TH VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2013,