Topology-Aware Scheduling Framework for Microservice Applications in Cloud

被引:3
|
作者
Li, Xin [1 ]
Zhou, Junsong [1 ]
Wei, Xin [1 ]
Li, Dawei [2 ]
Qian, Zhuzhong [3 ]
Wu, Jie [4 ]
Qin, Xiaolin [1 ]
Lu, Sanglu [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Nanjing 211106, Jiangsu, Peoples R China
[2] Montclair State Univ, Dept Comp Sci, Montclair, NJ 07043 USA
[3] Nanjing Univ, Dept Comp Sci & Technol, State Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
[4] Temple Univ, Ctr Networked Comp, Philadelphia 19122, PA USA
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Microservice architectures; Topology; Network topology; Containers; Resource management; Data centers; Virtual machining; Cloud computing; microservice; quality of service; resource scheduling; MULTIOBJECTIVE OPTIMIZATION; ALGORITHM;
D O I
10.1109/TPDS.2023.3238751
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Loosely coupled and highly cohesived microservices running in containers are becoming the new paradigm for application development. Compared with monolithic applications, applications built on microservices architecture can be deployed and scaled independently, which promises to simplify software development and operation. However, the dramatic increase in the scale of microservices and east-west network traffic in the data center have made the cluster management more complex. Not only does the scale of microservices cause a great deal of pressure on cluster management, but also cascading QoS violations present a substantial risk for SLOs (Service Level Objectives). In this paper, we propose a Microservice-Oriented Topology-Aware Scheduling Framework (MOTAS), which effectively utilizes the topologies of microservices and clusters to optimize the network overhead of microservice applications through a heuristic graph mapping algorithm. The proposed framework can also guarantee the cluster resource utilization. To deal with the dynamic environment of microservice, we propose a mechanism based on distributed trace analysis to detect and handle QoS violations in microservice applications. Through real-world experiments, the framework has been proved to be effective in ensuring cluster resource utilization, reducing application end-to-end latency, improving throughput, and handling QoS violations.
引用
收藏
页码:1635 / 1649
页数:15
相关论文
共 50 条
  • [1] Topology-Aware Continuous Experimentation in Microservice-Based Applications
    Schermann, Gerald
    Oliveira, Fabio
    Wittern, Erik
    Leitner, Philipp
    [J]. SERVICE-ORIENTED COMPUTING (ICSOC 2020), 2020, 12571 : 19 - 35
  • [2] A scheduling framework for large-scale, parallel, and topology-aware applications
    Kravtsov, Valentin
    Bar, Pavel
    Carmeli, David
    Schuster, Assaf
    Swain, Martin
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (09) : 983 - 992
  • [3] Topology-Aware GPU Scheduling for Learning Workloads in Cloud Environments
    Amaral, Marcelo
    Polo, Jorda
    Carrera, David
    Seelam, Seetharami
    Steinder, Malgorzata
    [J]. SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
  • [4] A Topology-Aware Adaptive Deployment Framework for Elastic Applications
    Keller, Matthias
    Peuster, Manuel
    Robbert, Christoph
    Karl, Holger
    [J]. 2013 17TH INTERNATIONAL CONFERENCE ON INTELLIGENCE IN NEXT GENERATION NETWORKS (ICIN), 2013, : 61 - 69
  • [5] Topology-Aware OpenMP Process Scheduling
    Thoman, Peter
    Moritsch, Hans
    Fahringer, Thomas
    [J]. BEYOND LOOP LEVEL PARALLELISM IN OPENMP: ACCELERATORS, TASKING AND MORE, PROCEEDINGS, 2010, 6132 : 96 - 108
  • [6] A Topology-Aware Framework for Graph Traversals
    Meng, Jia
    Cao, Liang
    Yu, Huashan
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2017, 2017, 10393 : 165 - 179
  • [7] Effects of Topology-Aware Allocation Policies on Scheduling Performance
    Antonio Pascual, Jose
    Navaridas, Javier
    Miguel-Alonso, Jose
    [J]. JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 2009, 5798 : 138 - 156
  • [8] Topology-Aware Job Scheduling for Machine Learning Cluster
    Lu, Jingyuan
    Li, Peng
    Wang, Kun
    Feng, Huibin
    Guo, Enting
    Wang, Xiaoyan
    Guo, Song
    [J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [9] A topology-aware method for scientific application deployment on cloud
    Fan, Pei
    Chen, Zhenbang
    Wang, Ji
    Zheng, Zibin
    Lyu, Michael R.
    [J]. INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2014, 10 (04) : 338 - 370
  • [10] Topology-aware resource management for HPC applications
    Georgiou, Yiannis
    Jeannot, Emmanuel
    Mercier, Guillaume
    Villiermet, Adele
    [J]. 18TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING (ICDCN 2017), 2017,