Predicting and Preventing Coordination Problems in Cooperative Q-learning Systems

被引:0
|
作者
Fulda, Nancy [1 ]
Ventura, Dan [1 ]
机构
[1] Brigham Young Univ, Dept Comp Sci, Provo, UT 84602 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a conceptual framework for creating Q-learning-based algorithms that converge to optimal equilibria in cooperative multiagent settings. This framework includes a set of conditions that are sufficient to guarantee optimal system performance. We demonstrate the efficacy of the framework by using it to analyze several well-known multi-agent learning algorithms and conclude by employing it as a design tool to construct a simple, novel multiagent learning algorithm.
引用
收藏
页码:780 / 785
页数:6
相关论文
共 50 条
  • [21] Learning Automata Based Q-Learning for Content Placement in Cooperative Caching
    Yang, Zhong
    Liu, Yuanwei
    Chen, Yue
    Jiao, Lei
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (06) : 3667 - 3680
  • [22] Satisficing Q-learning: Efficient learning in problems with dichotomous attributes
    Goodrich, MA
    Quigley, M
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 65 - 72
  • [23] A study on expertise of agents and its effects on cooperative Q-learning
    Araabi, Babak Nadjar
    Mastoureshgh, Sahar
    Ahmadabadi, Majid Nili
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 398 - 409
  • [24] Multi-criteria expertness based cooperative Q-learning
    Esmat Pakizeh
    Maziar Palhang
    Mir Mohsen Pedram
    Applied Intelligence, 2013, 39 : 28 - 40
  • [25] Multi-criteria expertness based cooperative Q-learning
    Pakizeh, Esmat
    Palhang, Maziar
    Pedram, Mir Mohsen
    APPLIED INTELLIGENCE, 2013, 39 (01) : 28 - 40
  • [26] Sequential Q-Learning With Kalman Filtering for Multirobot Cooperative Transportation
    Wang, Ying
    de Silva, Clarence W.
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2010, 15 (02) : 261 - 268
  • [27] Evaluating cooperative-competitive dynamics with deep Q-learning
    Kopacz, Aniko
    Csato, Lehel
    Chira, Camelia
    NEUROCOMPUTING, 2023, 550
  • [28] Multi-robot Cooperative Planning by Consensus Q-learning
    Sadhu, Arup Kumar
    Konar, Amit
    Banerjee, Bonny
    Nagar, Atulya K.
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4158 - 4164
  • [29] A Multi-Step Joint Q-learning Cooperative Algorithm for Regional Interconnected Power Systems
    Xiong, Li
    Li, Ling
    Liu, Wei
    Liang, Zhencheng
    Ling, Wuneng
    Zhang, Ye
    2024 IEEE 2ND INTERNATIONAL CONFERENCE ON POWER SCIENCE AND TECHNOLOGY, ICPST 2024, 2024, : 2250 - 2255
  • [30] Cooperative Spectrum Sensing Using Q-Learning with Experimental Validation
    Chen, Zhe
    Qiu, Robert C.
    IEEE SOUTHEASTCON 2011: BUILDING GLOBAL ENGINEERS, 2011, : 405 - 408