Deep Reinforcement Learning with Successive Over-Relaxation and its Application in Autoscaling Cloud Resources

被引:2
|
作者
John, Indu [1 ]
Bhatnagar, Shalabh [1 ]
机构
[1] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore, Karnataka, India
关键词
reinforcement learning; deep learning; cloud computing; resource allocation; atari games;
D O I
10.1109/ijcnn48605.2020.9206598
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new deep reinforcement learning algorithm using the technique of successive over-relaxation (SOR) in Deep Q-networks (DQNs). The new algorithm, named SOR-DQN, uses modified targets in the DQN framework with the aim of accelerating training. This work is motivated by the problem of auto-scaling resources for cloud applications, for which existing algorithms suffer from issues such as slow convergence, poor performance during the training phase and non-scalability. For the above problem, SOR-DQN achieves significant improvements over DQN on both synthetic and real datasets. We also study the generalization ability of the algorithm to multiple tasks by using it to train agents playing Atari video games.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Successive Over-Relaxation Q-Learning
    Kamanchi, Chandramouli
    Diddigi, Raghuram Bharadwaj
    Bhatnagar, Shalabh
    IEEE CONTROL SYSTEMS LETTERS, 2020, 4 (01): : 55 - 60
  • [2] NONLINEAR SUCCESSIVE OVER-RELAXATION
    BREWSTER, ME
    KANNAN, R
    NUMERISCHE MATHEMATIK, 1984, 44 (02) : 309 - 315
  • [3] ESTIMATION OF SUCCESSIVE OVER-RELAXATION FACTOR
    RIGLER, AK
    MATHEMATICS OF COMPUTATION, 1965, 19 (90) : 302 - &
  • [4] Enhance Stability of Successive Over-Relaxation Method and Orthogonalized Symmetry Successive Over-Relaxation in a Larger Range of Relaxation Parameter
    Liu, Chein-Shan
    Chang, Chih-Wen
    SYMMETRY-BASEL, 2024, 16 (07):
  • [5] Reinforcement learning-based application Autoscaling in the Cloud: A survey
    Gari, Yisel
    Monge, David A.
    Pacini, Elina
    Mateos, Cristian
    Garcia Garino, Carlos
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 102
  • [6] ON CONVERGENCE CRITERIA FOR METHOD OF SUCCESSIVE OVER-RELAXATION
    BROYDEN, CG
    MATHEMATICS OF COMPUTATION, 1964, 18 (85) : 136 - &
  • [7] The successive over-relaxation method in reconfigurable hardware
    Kasbah, Safaa J.
    Haraty, Ramzi A.
    Damaj, Issarn W.
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 2395 - +
  • [8] Optimizing Cloud Workloads: Autoscaling with Reinforcement Learning
    Mishra, Pratik
    Hans, Sandeep
    Saha, Diptikalyan
    Moogi, Pratibha
    2024 IEEE 17TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD 2024, 2024, : 217 - 222
  • [9] SUCCESSIVE PERIPHERAL OVER-RELAXATION AND OTHER BLOCK METHODS
    BENSON, A
    EVANS, DJ
    JOURNAL OF COMPUTATIONAL PHYSICS, 1976, 21 (01) : 1 - 19
  • [10] Generating efficient parallel code for successive over-relaxation
    Tang, PY
    ICA(3)PP 97 - 1997 3RD INTERNATIONAL CONFERENCE ON ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, 1997, : 503 - 510