A Deep Reinforcement Learning-Based Preemptive Approach for Cost-Aware Cloud Job Scheduling

被引:8
|
作者
Cheng, Long [1 ]
Wang, Yue [1 ]
Cheng, Feng [2 ]
Liu, Cheng [3 ]
Zhao, Zhiming [4 ]
Wang, Ying [3 ]
机构
[1] North China Elect Power Univ, Sch Control & Comp Engn, Beijing 102206, Peoples R China
[2] Southwest Jiaotong Univ, Sch Math, Chengdu 610032, Sichuan, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing 100190, Peoples R China
[4] Univ Amsterdam, Res Grp Multiscale Networked Syst, NL-1012WP Amsterdam, Netherlands
来源
关键词
Cloud computing; Processor scheduling; Real-time systems; Costs; Quality of service; Time factors; Dynamic scheduling; DRL; job scheduling; preemptive mechanism; optimization; TASKS;
D O I
10.1109/TSUSC.2023.3303898
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With some specific characteristics such as elastics and scalability, cloud computing has become the most promising technology for online business nowadays. However, how to efficiently perform real-time job scheduling in cloud still poses significant challenges. The reason is that those jobs are highly dynamic and complex, and it is always hard to allocate them to computing resources in an optimal way, such as to meet the requirements from both service providers and users. In recent years, various works demonstrate that deep reinforcement learning (DRL) can handle real-time cloud jobs well in scheduling. However, to our knowledge, none of them has ever considered extra optimization opportunities for the allocated jobs in their scheduling frameworks. Given this fact, in this work, we introduce a novel DRL-based preemptive method for further improve the performance of the current studies. Specifically, we try to improve the training of scheduling policy with effective job preemptive mechanisms, and on that basis to optimize job execution cost while meeting users' expected response time. We introduce the detailed design of our method, and our evaluations demonstrate that our approach can achieve better performance than other scheduling algorithms under different real-time workloads, including the DRL approach.
引用
收藏
页码:422 / 432
页数:11
相关论文
共 50 条
  • [31] Cost-Aware Cloud Service Request Scheduling for SaaS Providers
    Liu, Zhipiao
    Wang, Shangguang
    Sun, Qibo
    Zou, Hua
    Yang, Fangchun
    COMPUTER JOURNAL, 2014, 57 (02): : 291 - 301
  • [32] Deep reinforcement learning-based memetic algorithm for energy-aware flexible job shop scheduling with multi-AGV
    Zhang, Fayong
    Li, Rui
    Gong, Wenyin
    COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 189
  • [33] Deep reinforcement learning-based algorithms selectors for the resource scheduling in hierarchical Cloud computing
    Zhou G.
    Wen R.
    Tian W.
    Buyya R.
    Journal of Network and Computer Applications, 2022, 208
  • [34] Cost-aware DAG scheduling algorithms for minimizing execution cost on cloud resources
    Moïse W. Convolbo
    Jerry Chou
    The Journal of Supercomputing, 2016, 72 : 985 - 1012
  • [35] Cost-aware DAG scheduling algorithms for minimizing execution cost on cloud resources
    Convolbo, Moise W.
    Chou, Jerry
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (03): : 985 - 1012
  • [36] Deadline-constrained cost-aware workflow scheduling in hybrid cloud
    Hussain, Mehboob
    Luo, Ming-Xing
    Hussain, Abid
    Javed, Muhammad Hafeez
    Abbas, Zeeshan
    Wei, Lian-Fu
    SIMULATION MODELLING PRACTICE AND THEORY, 2023, 129
  • [37] A cost-AWARE approach based ON learning automata FOR resource auto-scaling IN cloud computing environment
    Mogoui, Khosro
    Arani, Mostafa Ghobaei
    International Journal of Hybrid Information Technology, 2015, 8 (07): : 389 - 398
  • [38] Cost-aware cloud workflow scheduling using DRL and simulated annealing
    Yan Gu
    Feng Cheng
    Lijie Yang
    Junhui Xu
    Xiaomin Chen
    Long Cheng
    Digital Communications and Networks, 2024, 10 (06) : 1590 - 1599
  • [39] Cost-aware cloud workflow scheduling using DRL and simulated annealing
    Gu, Yan
    Cheng, Feng
    Yang, Lijie
    Xu, Junhui
    Chen, Xiaomin
    Cheng, Long
    Digital Communications and Networks, 2024, 10 (06) : 1590 - 1599
  • [40] Cost-aware Service Placement and Scheduling in the Edge-Cloud Continuum
    Rac, Samuel
    Brorsson, Mats
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 21 (02)