Multiagent coordination utilising Q-learning

被引:0
|
作者
Patnaik, Srikanta [1 ]
Mahalik, N. P. [2 ]
机构
[1] FM Univ, Dept Informat & Commun Technol, Balasore 756019, Orissa, India
[2] Calif State Univ Fresno, Dept Ind Technol, Fresno, CA 93740 USA
关键词
autonomous system; Q-learning; situation calculus; spatio-temporal; Petri net-model;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem 'Coordination' among multiple agents is an active and open-ended problem of robotics. The coordination problem, considered in this paper, has been divided into three categories: (1) Spatial coordination; (2) Temporal coordination and (3) Spatio-temporal coordination. The principles of 'Situation Calculus' have been extended to model the spatial coordination problem of robot. The concept of temporal coordination has been introduced with Situated Automata and Q-learning. The collective learning behaviour of a multiagent system has been improved by the principles of Q-learning. The spatio-temporal coordination, that deals with the coordination problem involving both space and time has been modelled using the timed Petri nets. The special emphasis has been given to the behavioural model of eye and hand coordination of a mobile robot. One typical application of the multiagent coordination in the soccer playing robot has been proposed at the end of this paper.
引用
收藏
页码:361 / 379
页数:19
相关论文
共 50 条
  • [21] Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems
    Li, Husheng
    [J]. EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2010,
  • [22] Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning
    Ohnishi, Shota
    Uchibe, Eiji
    Yamaguchi, Yotaro
    Nakanishi, Kosuke
    Yasui, Yuji
    Ishii, Shin
    [J]. FRONTIERS IN NEUROROBOTICS, 2019, 13
  • [23] Learning rates for Q-learning
    Even-Dar, E
    Mansour, Y
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 5 : 1 - 25
  • [24] Learning rates for Q-Learning
    Even-Dar, E
    Mansour, Y
    [J]. COMPUTATIONAL LEARNING THEORY, PROCEEDINGS, 2001, 2111 : 589 - 604
  • [25] Multi-Agent Coordination Method Based on Fuzzy Q-Learning
    Peng, Jun
    Liu, Miao
    Wu, Min
    Zhang, Xiaoyong
    Lin, Kuo-Chi
    [J]. 2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5411 - +
  • [26] A distributed Q-learning algorithm for multi-agent team coordination
    Huang, J
    Yang, B
    Liu, DY
    [J]. Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 108 - 113
  • [27] Adaptive Individual Q-Learning-A Multiagent Reinforcement Learning Method for Coordination Optimization
    Zhang, Zhen
    Wang, Dongqing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [28] Contextual Q-Learning
    Pinto, Tiago
    Vale, Zita
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2927 - 2928
  • [29] Bayesian Q-learning
    Dearden, R
    Friedman, N
    Russell, S
    [J]. FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 761 - 768
  • [30] Zap Q-Learning
    Devraj, Adithya M.
    Meyn, Sean P.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30