Coordinated Reinforcement Learning Agents in a Multi-Agent Virtual Environment

被引:6
|
作者
Sause, William [1 ]
机构
[1] Nova SE Univ, Grad Sch Comp & Informat Sci, Ft Lauderdale, FL 33314 USA
关键词
Reinforcement learning; virtual environments; intelligent agents;
D O I
10.1109/ICMLA.2013.46
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research presents a framework for coordinating multiple intelligent agents within a single virtual environment. Coordination is accomplished via a "next available agent" scheme while learning is achieved through the use of the Q-learning and Sarsa temporal difference reinforcement learning algorithms. To assess the effectiveness of each learning algorithm, experiments were conducted that measured an agent's ability to learn tasks in a static and dynamic environment while using both a fixed (FEP) and variable (VEP) epsilon-greedy probability rate. Results show that Sarsa, on average, outperformed Q-learning in almost all experiments. Overall, VEP resulted in higher percentages of successes and optimal successes than FEP, and showed convergence to the optimal policy when measuring the average number of time steps per episode.
引用
收藏
页码:227 / 230
页数:4
相关论文
共 50 条
  • [21] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [22] Explaining the Behaviour of Reinforcement Learning Agents in a Multi-Agent Cooperative Environment Using Policy Graphs
    Vila, Marc
    Gnatyshak, Dmitry
    Tormos, Adrian
    Gimenez-Abalos, Victor
    Alvarez-Napagao, Sergio
    ELECTRONICS, 2024, 13 (03)
  • [23] OFCOURSE: A Multi-Agent Reinforcement Learning Environment for Order Fulfillment
    Zhu, Yiheng
    Zhan, Yang
    Huang, Xuankun
    Chen, Yuwei
    Chen, Yujie
    Wei, Jiangwen
    Feng, Wei
    Zhou, Yinzhi
    Hu, Haoyuan
    Ye, Jieping
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [24] Cooperative Multi-Agent Reinforcement Learning in a Large Stationary Environment
    Zemzem, Wiem
    Tagina, Moncef
    2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017), 2017, : 365 - 371
  • [25] TraCo: Learning Virtual Traffic Coordinator for Cooperation with Multi-Agent Reinforcement Learning
    Liu, Weiwei
    Jing, Wei
    Gao, Lingping
    Guo, Ke
    Xu, Gang
    Liu, Yong
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [26] Coordinated Multi-Agent Imitation Learning
    Le, Hoang M.
    Yue, Yisong
    Carr, Peter
    Lucey, Patrick
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [27] Decomposing Temporal Equilibrium Strategy for Coordinated Distributed Multi-Agent Reinforcement Learning
    Zhu, Chenyang
    Si, Wen
    Zhu, Jinyu
    Jiang, Zhihao
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17618 - 17627
  • [28] Coordinated Slicing and Admission Control Using Multi-Agent Deep Reinforcement Learning
    Sulaiman, Muhammad
    Moayyedi, Arash
    Ahmadi, Mahdieh
    Salahuddin, Mohammad A.
    Boutaba, Raouf
    Saleh, Aladdin
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (02): : 1110 - 1124
  • [29] Learning of defaults by agents in a distributed multi-agent system environment
    1600, Springer Science and Business Media Deutschland GmbH (13):
  • [30] Decentralized multi-agent reinforcement learning with networked agents: recent advances
    Zhang, Kaiqing
    Yang, Zhuoran
    Basar, Tamer
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (06) : 802 - 814