Coordinated Reinforcement Learning Agents in a Multi-Agent Virtual Environment

被引：6

作者：

Sause, William ^{[1
]}

机构：

[1] Nova SE Univ, Grad Sch Comp & Informat Sci, Ft Lauderdale, FL 33314 USA

来源：

2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1 | 2013年

关键词：

Reinforcement learning; virtual environments; intelligent agents;

D O I：

10.1109/ICMLA.2013.46

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This research presents a framework for coordinating multiple intelligent agents within a single virtual environment. Coordination is accomplished via a "next available agent" scheme while learning is achieved through the use of the Q-learning and Sarsa temporal difference reinforcement learning algorithms. To assess the effectiveness of each learning algorithm, experiments were conducted that measured an agent's ability to learn tasks in a static and dynamic environment while using both a fixed (FEP) and variable (VEP) epsilon-greedy probability rate. Results show that Sarsa, on average, outperformed Q-learning in almost all experiments. Overall, VEP resulted in higher percentages of successes and optimal successes than FEP, and showed convergence to the optimal policy when measuring the average number of time steps per episode.

引用

页码：227 / 230

页数：4

共 50 条

[21] Multi-Agent Reinforcement Learning
Stankovic, Milos
2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
[22] Explaining the Behaviour of Reinforcement Learning Agents in a Multi-Agent Cooperative Environment Using Policy Graphs
Vila, Marc
Gnatyshak, Dmitry
Tormos, Adrian
Gimenez-Abalos, Victor
Alvarez-Napagao, Sergio
ELECTRONICS, 2024, 13 (03)
[23] OFCOURSE: A Multi-Agent Reinforcement Learning Environment for Order Fulfillment
Zhu, Yiheng
Zhan, Yang
Huang, Xuankun
Chen, Yuwei
Chen, Yujie
Wei, Jiangwen
Feng, Wei
Zhou, Yinzhi
Hu, Haoyuan
Ye, Jieping
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[24] Cooperative Multi-Agent Reinforcement Learning in a Large Stationary Environment
Zemzem, Wiem
Tagina, Moncef
2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017), 2017, : 365 - 371
[25] TraCo: Learning Virtual Traffic Coordinator for Cooperation with Multi-Agent Reinforcement Learning
Liu, Weiwei
Jing, Wei
Gao, Lingping
Guo, Ke
Xu, Gang
Liu, Yong
CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[26] Coordinated Multi-Agent Imitation Learning
Le, Hoang M.
Yue, Yisong
Carr, Peter
Lucey, Patrick
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[27] Decomposing Temporal Equilibrium Strategy for Coordinated Distributed Multi-Agent Reinforcement Learning
Zhu, Chenyang
Si, Wen
Zhu, Jinyu
Jiang, Zhihao
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17618 - 17627
[28] Coordinated Slicing and Admission Control Using Multi-Agent Deep Reinforcement Learning
Sulaiman, Muhammad
Moayyedi, Arash
Ahmadi, Mahdieh
Salahuddin, Mohammad A.
Boutaba, Raouf
Saleh, Aladdin
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (02): : 1110 - 1124
[29] Learning of defaults by agents in a distributed multi-agent system environment
1600, Springer Science and Business Media Deutschland GmbH (13):
[30] Decentralized multi-agent reinforcement learning with networked agents: recent advances
Zhang, Kaiqing
Yang, Zhuoran
Basar, Tamer
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (06) : 802 - 814

← 1 2 3 4 5 →