Predicting and Preventing Coordination Problems in Cooperative Q-learning Systems

被引：0

作者：

Fulda, Nancy ^{[1
]}

Ventura, Dan ^{[1
]}

机构：

[1] Brigham Young Univ, Dept Comp Sci, Provo, UT 84602 USA

来源：

20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a conceptual framework for creating Q-learning-based algorithms that converge to optimal equilibria in cooperative multiagent settings. This framework includes a set of conditions that are sufficient to guarantee optimal system performance. We demonstrate the efficacy of the framework by using it to analyze several well-known multi-agent learning algorithms and conclude by employing it as a design tool to construct a simple, novel multiagent learning algorithm.

引用

页码：780 / 785

页数：6

共 50 条

[21] Learning Automata Based Q-Learning for Content Placement in Cooperative Caching
Yang, Zhong
Liu, Yuanwei
Chen, Yue
Jiao, Lei
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (06) : 3667 - 3680
[22] Satisficing Q-learning: Efficient learning in problems with dichotomous attributes
Goodrich, MA
Quigley, M
PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 65 - 72
[23] A study on expertise of agents and its effects on cooperative Q-learning
Araabi, Babak Nadjar
Mastoureshgh, Sahar
Ahmadabadi, Majid Nili
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 398 - 409
[24] Multi-criteria expertness based cooperative Q-learning
Esmat Pakizeh
Maziar Palhang
Mir Mohsen Pedram
Applied Intelligence, 2013, 39 : 28 - 40
[25] Multi-criteria expertness based cooperative Q-learning
Pakizeh, Esmat
Palhang, Maziar
Pedram, Mir Mohsen
APPLIED INTELLIGENCE, 2013, 39 (01) : 28 - 40
[26] Sequential Q-Learning With Kalman Filtering for Multirobot Cooperative Transportation
Wang, Ying
de Silva, Clarence W.
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2010, 15 (02) : 261 - 268
[27] Evaluating cooperative-competitive dynamics with deep Q-learning
Kopacz, Aniko
Csato, Lehel
Chira, Camelia
NEUROCOMPUTING, 2023, 550
[28] Multi-robot Cooperative Planning by Consensus Q-learning
Sadhu, Arup Kumar
Konar, Amit
Banerjee, Bonny
Nagar, Atulya K.
2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4158 - 4164
[29] A Multi-Step Joint Q-learning Cooperative Algorithm for Regional Interconnected Power Systems
Xiong, Li
Li, Ling
Liu, Wei
Liang, Zhencheng
Ling, Wuneng
Zhang, Ye
2024 IEEE 2ND INTERNATIONAL CONFERENCE ON POWER SCIENCE AND TECHNOLOGY, ICPST 2024, 2024, : 2250 - 2255
[30] Cooperative Spectrum Sensing Using Q-Learning with Experimental Validation
Chen, Zhe
Qiu, Robert C.
IEEE SOUTHEASTCON 2011: BUILDING GLOBAL ENGINEERS, 2011, : 405 - 408

← 1 2 3 4 5 →