Data-intensive workflow scheduling strategy based on deep reinforcement learning in multi-clouds

被引:0
|
作者
Zhang, Shuo [1 ,2 ]
Zhao, Zhuofeng [1 ,2 ]
Liu, Chen [1 ,2 ]
Qin, Shenghui [1 ,2 ]
机构
[1] North China Univ Technol, Sch Informat, Beijing 100144, Peoples R China
[2] Beijing Key Lab Large Scale Stream Data Integrat &, Beijing 100144, Peoples R China
关键词
Data-intensive workflow; Deep-Q-network; Multi-objective optimization; Intensive learning; ALGORITHMS;
D O I
10.1186/s13677-023-00504-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increase development of Internet of Things devices, the data-intensive workflow has emerged as a new kinds of representation for IoT applications. Because most IoT systems are structured in multi-clouds environment and the data-intensive workflow has the characteristics of scattered data sources and distributed execution requirements at the cloud center and edge clouds, it brings many challenges to the scheduling of such workflow, such as data flow control management, data transmission scheduling, etc. Aiming at the execution constraints of business and technology and data transmission optimization of data-intensive workflow, a data-intensive workflow scheduling method based on deep reinforcement learning in multi-clouds is proposed. First, the execution constraints, edge node load and data transmission volume of IoT data workflow are modeled; then the data-intensive workflow is segmented with the consideration of business constraints and the first optimization goal of data transmission; besides, taking the workflow execution time and average load balancing as the secondary optimization goal, the improved DQN algorithm is used to schedule the workflow. Based on the DQN algorithm, the model reward function and action selection are redesigned and improved. The simulation results based on WorkflowSim show that, compared with MOPSO, NSGA-II, GTBGA and DQN, the algorithm proposed in this paper can effectively reduce the execution time of IoT data workflow under the condition of ensuring the execution constraints and load balancing of multi-clouds.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [1] Data-intensive workflow scheduling strategy based on deep reinforcement learning in multi-clouds
    Shuo Zhang
    Zhuofeng Zhao
    Chen Liu
    Shenghui Qin
    Journal of Cloud Computing, 12
  • [2] A novel scheduling algorithm for data-intensive workflow in virtualised clouds
    Li F.
    International Journal of Networking and Virtual Organisations, 2019, 20 (03) : 284 - 300
  • [3] Hypergraph-Based Data Reduced Scheduling Policy for Data-Intensive Workflow in Clouds
    Hu, Zhigang
    Li, Jia
    Zheng, Meiguang
    Zhang, Xinxin
    Kang, Hui
    Tao, Yong
    Yang, Jiao
    DATA SCIENCE, PT II, 2017, 728 : 335 - 349
  • [4] An incremental reinforcement learning scheduling strategy for data-intensive scientific workflows in the cloud
    Nascimento, Andre
    Silva, Vitor
    Paes, Aline
    de Oliveira, Daniel
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (11):
  • [5] Supporting Data-Intensive Workflows in Software-Defined Federated Multi-Clouds
    Diaz-Montes, Javier
    Diaz-Granados, Manuel
    Zou, Mengsong
    Tao, Shu
    Parashar, Manish
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2018, 6 (01) : 250 - 263
  • [6] Data-intensive workflow management: For clouds and data-intensive and scalable computing environments
    De Oliveira, Daniel C.M.
    Liu, Ji
    Pacitti, Esther
    Synthesis Lectures on Data Management, 2019, 14 (04): : 1 - 179
  • [7] Deadline based scheduling for data-intensive applications in clouds
    Fu Xiong
    Cang Yeliang
    Zhu Lipeng
    Hu Bin
    Deng Song
    Wang Dong
    The Journal of China Universities of Posts and Telecommunications, 2016, 23 (06) : 8 - 15
  • [8] Deadline based scheduling for data-intensive applications in clouds
    Fu Xiong
    Cang Yeliang
    Zhu Lipeng
    Hu Bin
    Deng Song
    Wang Dong
    The Journal of China Universities of Posts and Telecommunications, 2016, (06) : 8 - 15
  • [9] A Multi-Objective Optimization Model for Data-Intensive Workflow Scheduling in Data Grids
    Moghadam, Mahshid Helali
    Babamir, Seyyed Morteza
    Mirabi, Meghdad
    PROCEEDINGS OF THE 2016 IEEE 41ST CONFERENCE ON LOCAL COMPUTER NETWORKS - LCN WORKSHOPS 2016, 2016, : 25 - 33
  • [10] A Data-Intensive Workflow Scheduling Algorithm for Grid Computing
    Xu, Meng
    Cui, Lizhen
    Wang, Haiyang
    Bi, Yanbing
    Bian, Ji
    FOURTH CHINAGRID ANNUAL CONFERENCE, PROCEEDINGS, 2009, : 110 - 115