A Scalable Parallel Q-Learning Algorithm for Resource Constrained Decentralized Computing Environments

被引：7

作者：

Camelo, Miguel ^{[1
]}

Famaey, Jeroen ^{[1
]}

Latre, Steven ^{[1
]}

机构：

[1] Univ Antwerp, IMEC, Dept Math & Comp Sci, Middelheimlaan 1, B-2020 Antwerp, Belgium

来源：

PROCEEDINGS OF 2016 2ND WORKSHOP ON MACHINE LEARNING IN HPC ENVIRONMENTS (MLHPC) | 2016年

关键词：

D O I：

10.1109/MLHPC.2016.007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The Internet of Things (IoT) is more and more becoming a platform for mission critical applications with stringent requirements in terms of response time and mobility. Therefore, a centralized High Performance Computing (HPC) environment is often not suitable or simply non-existing. Instead, there is a need for a scalable HPC model that supports the deployment of applications on the decentralized but resource constrained devices of the IoT. Recently, Reinforcement Learning (RL) algorithms have been used for decision making within applications by directly interacting with the environment. However, most RL algorithms are designed for centralized environments and are time and resource consuming. Therefore, they are not applicable to such constrained decentralized computing environments. In this paper, we propose a scalable Parallel Q-Learning (PQL) algorithm for resource constrained environments. By combining a table partition strategy together with a co-allocation of both processing and storage, we can significantly reduce the individual resource cost and, at the same time, guarantee convergence and minimize the communication cost. Experimental results show that our algorithm reduces the required training in proportion of the number of Q-Learning agents and, in terms of execution time, it is up to 24 times faster than several well-known PQL algorithms.

引用

页码：27 / 35

页数：9

共 50 条

[21] A Novel Q-learning Algorithm with Function Approximation for Constrained Markov Decision Processes
Lakshmanan, K.
Bhatnagar, Shalabh
2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 400 - 405
[22] Fuzzy Deep Q-learning Task Offloading in Delay Constrained Vehicular Fog Computing
Do Bao Son
Vu Tri An
Trinh Thu Hai
Binh Minh Nguyen
Nguyen Phi Le
Huynh Thi Thanh Binh
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[23] Constrained Q-Learning for Batch Process Optimization
Pan, Elton
Petsagkourakis, Panagiotis
Mowbray, Max
Zhang, Dongda
del Rio-Chanona, Antonio
IFAC PAPERSONLINE, 2021, 54 (03): : 492 - 497
[24] Federated Learning via Decentralized Dataset Distillation in Resource-Constrained Edge Environments
Song, Rui
Liu, Dai
Chen, Dave Zhenyu
Festag, Andreas
Trinitis, Carsten
Schulz, Martin
Knoll, Alois
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[25] A Request Scheduling Optimization Mechanism Based on Deep Q-Learning in Edge Computing Environments
Zhang, Yaqiang
Li, Rengang
Zhao, Yaqian
Li, Ruyang
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
[26] Q-learning and hyper-heuristic based algorithm recommendation for changing environments
Golcuk, Ilker
Ozsoydan, Fehmi Burcin
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 102
[27] Decentralized Q-Learning with Constant Aspirations in Stochastic Games
Yongacoglu, Bora
Arslan, Gurdal
Yuksel, Serdar
CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 1744 - 1749
[28] Dynamic Switch Migration Algorithm with Q-learning towards Scalable SDN Control Plane
Min, Zhu
Hua, Qu
Zhao Jihong
2017 9TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2017,
[29] Optimized intellectual resource scheduling using deep reinforcement Q-learning in cloud computing
Uma, J.
Vivekanandan, P.
Shankar, S.
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (05):
[30] Q-Learning Algorithm for Joint Computation Offloading and Resource Allocation in Edge Cloud
Dab, Boutheina
Aitsaadi, Nadjib
Langar, Rami
2019 IFIP/IEEE SYMPOSIUM ON INTEGRATED NETWORK AND SERVICE MANAGEMENT (IM), 2019,

← 1 2 3 4 5 →