Elevator group control using multiple reinforcement learning agents

被引:135
|
作者
Crites, RH
Barto, AG
机构
[1] Unica Technol Inc, Lincoln, MA 01773 USA
[2] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
关键词
reinforcement learning; multiple agents; teams; elevator group control; discrete event dynamic systems;
D O I
10.1023/A:1007518724497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithms have appeared that approximate dynamic programming on an incremental basis. They can be trained on the basis of real or simulated experiences, focusing their computation on areas of state space that are actually visited during control, making them computationally tractable on very large problems. If each member of a team of agents employs one of these algorithms, a new collective learning algorithm emerges for the team as a whole. In this paper we demonstrate that such collective RL algorithms can be powerful heuristic methods for addressing large-scale control problems. Elevator group control serves as our testbed. It is a difficult domain posing a combination of challenges not seen in most multi-agent learning research to date. We use a team of RL agents, each of which is responsible for controlling one elevator car. The team receives a global reward signal which appears noisy to each agent due to the effects of the actions of the other agents, the random nature of the arrivals and the incomplete observation of the state. In spite of these complications, we show results that in simulation surpass the best of the heuristic elevator control algorithms of which we are aware. These results demonstrate the power of multi-agent RL on a very large scale stochastic dynamic optimization problem of practical utility.
引用
收藏
页码:235 / 262
页数:28
相关论文
共 50 条
  • [1] Elevator Group Control Using Multiple Reinforcement Learning Agents
    Robert H. Crites
    Andrew G. Barto
    [J]. Machine Learning, 1998, 33 : 235 - 262
  • [2] Elevator group control using reinforcement learning with CMAC
    Gao, Yang
    Hu, Jing-Kai
    Wang, Ben-Nian
    Wang, Dong-Li
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 35 (02): : 362 - 365
  • [3] Elevator Group Supervisory Control System using Genetic Network Programming with reinforcement learning
    Zhou, J
    Eguchi, T
    Hirasawa, K
    Hu, JL
    Markon, S
    [J]. 2005 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-3, PROCEEDINGS, 2005, : 336 - 342
  • [4] Improving elevator performance using reinforcement learning
    Crites, RH
    Barto, AG
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 1017 - 1023
  • [5] The implementation of reinforcement learning algorithms on the elevator control system
    Li, Hang
    [J]. PROCEEDINGS OF 2015 IEEE 20TH CONFERENCE ON EMERGING TECHNOLOGIES & FACTORY AUTOMATION (ETFA), 2015,
  • [6] A study of applying genetic network programming with reinforcement learning to elevator group supervisory control system
    Zhou, Jin
    Eguchi, Toru
    Mabu, Shingo
    Hirasawa, Kotaro
    Hu, Jinglu
    Markon, Sandor
    [J]. 2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 3020 - +
  • [7] Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning
    Schaff, Charles
    Yunis, David
    Chakrabarti, Ayan
    Walter, Matthew R.
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 9798 - 9805
  • [8] Research of Elevator Group Scheduling System Based on Reinforcement Learning Algorithm
    Zheng, Liu
    Guang, Shu
    Hui, Dong
    [J]. PROCEEDINGS OF 2013 2ND INTERNATIONAL CONFERENCE ON MEASUREMENT, INFORMATION AND CONTROL (ICMIC 2013), VOLS 1 & 2, 2013, : 606 - 610
  • [9] Urban traffic signal control using reinforcement learning agents
    Balaji, P. G.
    German, X.
    Srinivasan, D.
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (03) : 177 - 188
  • [10] Elevator group control by using talented algorithm
    Dagdelen, Ulvi
    Bagis, Aytekin
    Karaboga, Dervis
    [J]. ARTIFICIAL INTELLIGENCE AND NEURAL NETWORKS, 2006, 3949 : 203 - 210