Deep Reinforcement Learning with Successive Over-Relaxation and its Application in Autoscaling Cloud Resources

被引：2

作者：

John, Indu ^{[1
]}

Bhatnagar, Shalabh ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore, Karnataka, India

来源：

2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年

关键词：

reinforcement learning; deep learning; cloud computing; resource allocation; atari games;

D O I：

10.1109/ijcnn48605.2020.9206598

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a new deep reinforcement learning algorithm using the technique of successive over-relaxation (SOR) in Deep Q-networks (DQNs). The new algorithm, named SOR-DQN, uses modified targets in the DQN framework with the aim of accelerating training. This work is motivated by the problem of auto-scaling resources for cloud applications, for which existing algorithms suffer from issues such as slow convergence, poor performance during the training phase and non-scalability. For the above problem, SOR-DQN achieves significant improvements over DQN on both synthetic and real datasets. We also study the generalization ability of the algorithm to multiple tasks by using it to train agents playing Atari video games.

引用

页数：6

共 50 条

[1] Successive Over-Relaxation Q-Learning
Kamanchi, Chandramouli
Diddigi, Raghuram Bharadwaj
Bhatnagar, Shalabh
IEEE CONTROL SYSTEMS LETTERS, 2020, 4 (01): : 55 - 60
[2] NONLINEAR SUCCESSIVE OVER-RELAXATION
BREWSTER, ME
KANNAN, R
NUMERISCHE MATHEMATIK, 1984, 44 (02) : 309 - 315
[3] ESTIMATION OF SUCCESSIVE OVER-RELAXATION FACTOR
RIGLER, AK
MATHEMATICS OF COMPUTATION, 1965, 19 (90) : 302 - &
[4] Enhance Stability of Successive Over-Relaxation Method and Orthogonalized Symmetry Successive Over-Relaxation in a Larger Range of Relaxation Parameter
Liu, Chein-Shan
Chang, Chih-Wen
SYMMETRY-BASEL, 2024, 16 (07):
[5] Reinforcement learning-based application Autoscaling in the Cloud: A survey
Gari, Yisel
Monge, David A.
Pacini, Elina
Mateos, Cristian
Garcia Garino, Carlos
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 102
[6] ON CONVERGENCE CRITERIA FOR METHOD OF SUCCESSIVE OVER-RELAXATION
BROYDEN, CG
MATHEMATICS OF COMPUTATION, 1964, 18 (85) : 136 - &
[7] The successive over-relaxation method in reconfigurable hardware
Kasbah, Safaa J.
Haraty, Ramzi A.
Damaj, Issarn W.
IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 2395 - +
[8] Optimizing Cloud Workloads: Autoscaling with Reinforcement Learning
Mishra, Pratik
Hans, Sandeep
Saha, Diptikalyan
Moogi, Pratibha
2024 IEEE 17TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD 2024, 2024, : 217 - 222
[9] SUCCESSIVE PERIPHERAL OVER-RELAXATION AND OTHER BLOCK METHODS
BENSON, A
EVANS, DJ
JOURNAL OF COMPUTATIONAL PHYSICS, 1976, 21 (01) : 1 - 19
[10] Generating efficient parallel code for successive over-relaxation
Tang, PY
ICA(3)PP 97 - 1997 3RD INTERNATIONAL CONFERENCE ON ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, 1997, : 503 - 510

← 1 2 3 4 5 →