A Scalable Parallel Q-Learning Algorithm for Resource Constrained Decentralized Computing Environments

被引:7
|
作者
Camelo, Miguel [1 ]
Famaey, Jeroen [1 ]
Latre, Steven [1 ]
机构
[1] Univ Antwerp, IMEC, Dept Math & Comp Sci, Middelheimlaan 1, B-2020 Antwerp, Belgium
关键词
D O I
10.1109/MLHPC.2016.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Internet of Things (IoT) is more and more becoming a platform for mission critical applications with stringent requirements in terms of response time and mobility. Therefore, a centralized High Performance Computing (HPC) environment is often not suitable or simply non-existing. Instead, there is a need for a scalable HPC model that supports the deployment of applications on the decentralized but resource constrained devices of the IoT. Recently, Reinforcement Learning (RL) algorithms have been used for decision making within applications by directly interacting with the environment. However, most RL algorithms are designed for centralized environments and are time and resource consuming. Therefore, they are not applicable to such constrained decentralized computing environments. In this paper, we propose a scalable Parallel Q-Learning (PQL) algorithm for resource constrained environments. By combining a table partition strategy together with a co-allocation of both processing and storage, we can significantly reduce the individual resource cost and, at the same time, guarantee convergence and minimize the communication cost. Experimental results show that our algorithm reduces the required training in proportion of the number of Q-Learning agents and, in terms of execution time, it is up to 24 times faster than several well-known PQL algorithms.
引用
收藏
页码:27 / 35
页数:9
相关论文
共 50 条
  • [21] A Novel Q-learning Algorithm with Function Approximation for Constrained Markov Decision Processes
    Lakshmanan, K.
    Bhatnagar, Shalabh
    2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 400 - 405
  • [22] Fuzzy Deep Q-learning Task Offloading in Delay Constrained Vehicular Fog Computing
    Do Bao Son
    Vu Tri An
    Trinh Thu Hai
    Binh Minh Nguyen
    Nguyen Phi Le
    Huynh Thi Thanh Binh
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [23] Constrained Q-Learning for Batch Process Optimization
    Pan, Elton
    Petsagkourakis, Panagiotis
    Mowbray, Max
    Zhang, Dongda
    del Rio-Chanona, Antonio
    IFAC PAPERSONLINE, 2021, 54 (03): : 492 - 497
  • [24] Federated Learning via Decentralized Dataset Distillation in Resource-Constrained Edge Environments
    Song, Rui
    Liu, Dai
    Chen, Dave Zhenyu
    Festag, Andreas
    Trinitis, Carsten
    Schulz, Martin
    Knoll, Alois
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [25] A Request Scheduling Optimization Mechanism Based on Deep Q-Learning in Edge Computing Environments
    Zhang, Yaqiang
    Li, Rengang
    Zhao, Yaqian
    Li, Ruyang
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
  • [26] Q-learning and hyper-heuristic based algorithm recommendation for changing environments
    Golcuk, Ilker
    Ozsoydan, Fehmi Burcin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 102
  • [27] Decentralized Q-Learning with Constant Aspirations in Stochastic Games
    Yongacoglu, Bora
    Arslan, Gurdal
    Yuksel, Serdar
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 1744 - 1749
  • [28] Dynamic Switch Migration Algorithm with Q-learning towards Scalable SDN Control Plane
    Min, Zhu
    Hua, Qu
    Zhao Jihong
    2017 9TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2017,
  • [29] Optimized intellectual resource scheduling using deep reinforcement Q-learning in cloud computing
    Uma, J.
    Vivekanandan, P.
    Shankar, S.
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (05):
  • [30] Q-Learning Algorithm for Joint Computation Offloading and Resource Allocation in Edge Cloud
    Dab, Boutheina
    Aitsaadi, Nadjib
    Langar, Rami
    2019 IFIP/IEEE SYMPOSIUM ON INTEGRATED NETWORK AND SERVICE MANAGEMENT (IM), 2019,