Efficient Flow-Based Scheduling for Geo-Distributed Simulation Tasks in Collaborative Edge and Cloud Environments

被引:6
|
作者
Zhang Miao [1 ]
Peng Yong [1 ]
Zhu Jiancheng [1 ]
Yin Quanjun [1 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Computational modeling; Data models; Costs; Cloud computing; Optimization; Clustering algorithms; Task scheduling; distributed simulation; minimum cost maximum flow; cloud and edge environment; ALGORITHM; ASSIGNMENT;
D O I
10.1109/TPDS.2022.3155713
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Edge computing is a good complement to cloud computing for deploying large-scale geo-distributed simulation applications, which are very sensitive to the communication delay among different simulation components (also called tasks in this paper) and users. We mainly focus on the efficient scheduling of simulation components in collaborative edge and cloud environments. As components should be deployed jointly with the consideration of capacity constraints of hosts, it is actually an NP-complete multi-dimensional bin packing problem. Meanwhile, dynamic changes of component and host states require the low deployment latency of scheduling algorithms. Unfortunately, most of the existing schedulers for modern clusters are queue-based, in which tasks are scheduled sequentially, thus lacking the ability to process tightly coupled tasks jointly. Other batching-based placement algorithms are usually time-consuming. This paper describes Pond, a novel flow-based scheduler with the awareness of interactions among tasks and users as well as heterogeneous multi-dimensional resources. First, characteristics of distributed simulation tasks are analysed and the scheduling problem is formulated as a min-cost max-flow (MCMF) problem over the flow network by mapping the communication overhead among tasks and users to the costs of arcs in the network. Considering the inherent defects of existing flow-based schedulers in dealing with multi-dimensional resources, a new method based on dominant resource is proposed and some problem specific heuristics are also designed. Extensive simulation experiments based on Alibaba production trace and some random synthetic parameters are conducted. Results show that Pond can reduce the average communication cost for each task significantly in a quite low deployment latency compared with some baselines.
引用
收藏
页码:3442 / 3459
页数:18
相关论文
共 50 条
  • [1] Efficient Dynamic Deployment of Simulation Tasks in Collaborative Cloud and Edge Environments
    Zhang, Miao
    Jiao, Peng
    Peng, Yong
    Yin, Quanjun
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [2] An incremental flow-based task scheduler for simulation tasks in cloud environments
    Zhang, Miao
    Peng, Yong
    Yin, Quanjun
    Chen, Qinglong
    [J]. INTERNATIONAL CONFERENCE ON INTELLIGENT TRAFFIC SYSTEMS AND SMART CITY (ITSSC 2021), 2022, 12165
  • [3] Ensuring Reliability in Geo-Distributed Edge Cloud
    Jonathan, Albert
    Uluyol, Muhammed
    Chandra, Abhishek
    Weissman, Jon
    [J]. 2017 RESILIENCE WEEK (RWS), 2017, : 127 - 132
  • [4] Efficient multi-attribute precedence-based task scheduling for edge computing in geo-distributed cloud environment
    Chunlin Li
    Chaokun Zhang
    Bingbin Ma
    Youlong Luo
    [J]. Knowledge and Information Systems, 2022, 64 : 175 - 205
  • [5] Efficient multi-attribute precedence-based task scheduling for edge computing in geo-distributed cloud environment
    Li, Chunlin
    Zhang, Chaokun
    Ma, Bingbin
    Luo, Youlong
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (01) : 175 - 205
  • [6] Analysis of Control Traffic in a Geo-distributed Collaborative Cloud
    Sciammarella, Tatiana
    Couto, Rodrigo S.
    Rubinstein, Marcelo G.
    Campista, Miguel Elias M.
    Costa, Luis Henrique M. K.
    [J]. 2016 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (IEEE CLOUDNET), 2016, : 224 - 229
  • [7] Scheduling Stream Processing Tasks on Geo-Distributed Heterogeneous Resources
    Janssen, Gerrit
    Verbitskiy, Ilya
    Renner, Thomas
    Thamsen, Lauritz
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 5159 - 5164
  • [8] A Scheduling Framework for Periodic Tasks in Geo-Distributed Data Centers
    Li, Yan
    Zhang, Hong
    Wang, Yong
    Liu, Xinran
    Zhang, Peng
    [J]. 9TH IEEE INTERNATIONAL SYMPOSIUM ON SERVICE-ORIENTED SYSTEM ENGINEERING (SOSE 2015), 2015, : 247 - 252
  • [9] Joint Scheduling of Data and Computation in Geo-distributed Cloud Systems
    Yin, Lingyan
    Sun, Jizhou
    Zhao, Laiping
    Cui, Chenzhou
    Xiao, Jian
    Yu, Ce
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 657 - 666
  • [10] Flutter: Scheduling Tasks Closer to Data Across Geo-Distributed Datacenters
    Hu, Zhiming
    Li, Baochun
    Luo, Jun
    [J]. IEEE INFOCOM 2016 - THE 35TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS, 2016,