Study on Statistics Based Q-learning Algorithm for Multi-Agent System

被引:0
|
作者
Xie Ya [1 ]
Huang Zhonghua [1 ]
机构
[1] Hunan Inst Engn, Xiangtan 411104, Hunan, Peoples R China
关键词
Q-Learning; Statistics; Multi-Agent; RoboCup;
D O I
10.1109/ISDEA.2013.541
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes statistic learning based Q-learning algorithm for Multi-Agent System, the agent can learn other agents' action policies through observing and counting the joint action, a concise but useful hypothesis is adopted to denote the optimal policies of other agents, the full joint probability of policies distribution guarantees the learning agent to choose optimal action. The algorithm can improve the learning speed because it cut conventional Q-learning space from exponential one to linear one. The convergence of the algorithm is proved, the successful application of this algorithm in the RoboCup shows its good learning performance.
引用
收藏
页码:595 / 600
页数:6
相关论文
共 50 条
  • [1] A novel multi-agent Q-learning algorithm in cooperative multi-agent system
    Ou, HT
    Zhang, WD
    Zhang, WY
    Xu, XM
    [J]. PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 272 - 276
  • [2] Q-learning in Multi-Agent Cooperation
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Lin, Tzung-Feng
    [J]. 2008 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS, 2008, : 239 - 244
  • [3] Multi-Agent Advisor Q-Learning
    Subramanian, Sriram Ganapathi
    Taylor, Matthew E.
    Larson, Kate
    Crowley, Mark
    [J]. Journal of Artificial Intelligence Research, 2022, 74 : 1 - 74
  • [4] A distributed Q-learning algorithm for multi-agent team coordination
    Huang, J
    Yang, B
    Liu, DY
    [J]. Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 108 - 113
  • [5] Multi-Agent Advisor Q-Learning
    Subramanian, Sriram Ganapathi
    Taylor, Matthew E.
    Larson, Kate
    Crowley, Mark
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6884 - 6889
  • [6] Q-learning Algorithm Based Multi-Agent Coordinated Control Method for Microgrids
    Xi, Yuanyuan
    Chang, Liuchen
    Mao, Meiqin
    Jin, Peng
    Hatziargyriou, Nikos
    Xu, Haibo
    [J]. 2015 9TH INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND ECCE ASIA (ICPE-ECCE ASIA), 2015, : 1497 - 1504
  • [7] Multi-Agent Advisor Q-Learning
    Subramanian, Sriram Ganapathi
    Taylor, Matthew E.
    Larson, Kate
    Crowley, Mark
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 1 - 74
  • [8] Multi-Agent Cooperation Q-Learning Algorithm Based on Constrained Markov Game
    Ge, Yangyang
    Zhu, Fei
    Huang, Wei
    Zhao, Peiyao
    Liu, Quan
    [J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2020, 17 (02) : 647 - 664
  • [9] Multi-agent crowdsourcing model based on Q-learning
    Fang, Xin
    Guo, Yongan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
  • [10] Research of Q-learning Algorithm of Multi-agent in Micro-grid Control System Based on Probability
    Li Xiangna
    Yi Nan
    Li Mengao
    Xu Weifeng
    [J]. PROCEEDINGS OF THE 2016 5TH INTERNATIONAL CONFERENCE ON ENERGY AND ENVIRONMENTAL PROTECTION (ICEEP 2016), 2016, 98 : 317 - 321