Topology-Aware Scheduling Framework for Microservice Applications in Cloud

被引：3

作者：

Li, Xin ^{[1
]}

Zhou, Junsong ^{[1
]}

Wei, Xin ^{[1
]}

Li, Dawei ^{[2
]}

Qian, Zhuzhong ^{[3
]}

Wu, Jie ^{[4
]}

Qin, Xiaolin ^{[1
]}

Lu, Sanglu ^{[3
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Nanjing 211106, Jiangsu, Peoples R China

[2] Montclair State Univ, Dept Comp Sci, Montclair, NJ 07043 USA

[3] Nanjing Univ, Dept Comp Sci & Technol, State Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China

[4] Temple Univ, Ctr Networked Comp, Philadelphia 19122, PA USA

来源：

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS | 2023年 / 34卷 / 05期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Microservice architectures; Topology; Network topology; Containers; Resource management; Data centers; Virtual machining; Cloud computing; microservice; quality of service; resource scheduling; MULTIOBJECTIVE OPTIMIZATION; ALGORITHM;

D O I：

10.1109/TPDS.2023.3238751

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Loosely coupled and highly cohesived microservices running in containers are becoming the new paradigm for application development. Compared with monolithic applications, applications built on microservices architecture can be deployed and scaled independently, which promises to simplify software development and operation. However, the dramatic increase in the scale of microservices and east-west network traffic in the data center have made the cluster management more complex. Not only does the scale of microservices cause a great deal of pressure on cluster management, but also cascading QoS violations present a substantial risk for SLOs (Service Level Objectives). In this paper, we propose a Microservice-Oriented Topology-Aware Scheduling Framework (MOTAS), which effectively utilizes the topologies of microservices and clusters to optimize the network overhead of microservice applications through a heuristic graph mapping algorithm. The proposed framework can also guarantee the cluster resource utilization. To deal with the dynamic environment of microservice, we propose a mechanism based on distributed trace analysis to detect and handle QoS violations in microservice applications. Through real-world experiments, the framework has been proved to be effective in ensuring cluster resource utilization, reducing application end-to-end latency, improving throughput, and handling QoS violations.

引用

页码：1635 / 1649

页数：15

共 50 条

[1] Topology-Aware Continuous Experimentation in Microservice-Based Applications
Schermann, Gerald
Oliveira, Fabio
Wittern, Erik
Leitner, Philipp
[J]. SERVICE-ORIENTED COMPUTING (ICSOC 2020), 2020, 12571 : 19 - 35
[2] A scheduling framework for large-scale, parallel, and topology-aware applications
Kravtsov, Valentin
Bar, Pavel
Carmeli, David
Schuster, Assaf
Swain, Martin
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (09) : 983 - 992
[3] Topology-Aware GPU Scheduling for Learning Workloads in Cloud Environments
Amaral, Marcelo
Polo, Jorda
Carrera, David
Seelam, Seetharami
Steinder, Malgorzata
[J]. SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
[4] A Topology-Aware Adaptive Deployment Framework for Elastic Applications
Keller, Matthias
Peuster, Manuel
Robbert, Christoph
Karl, Holger
[J]. 2013 17TH INTERNATIONAL CONFERENCE ON INTELLIGENCE IN NEXT GENERATION NETWORKS (ICIN), 2013, : 61 - 69
[5] Topology-Aware OpenMP Process Scheduling
Thoman, Peter
Moritsch, Hans
Fahringer, Thomas
[J]. BEYOND LOOP LEVEL PARALLELISM IN OPENMP: ACCELERATORS, TASKING AND MORE, PROCEEDINGS, 2010, 6132 : 96 - 108
[6] A Topology-Aware Framework for Graph Traversals
Meng, Jia
Cao, Liang
Yu, Huashan
[J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2017, 2017, 10393 : 165 - 179
[7] Effects of Topology-Aware Allocation Policies on Scheduling Performance
Antonio Pascual, Jose
Navaridas, Javier
Miguel-Alonso, Jose
[J]. JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 2009, 5798 : 138 - 156
[8] Topology-Aware Job Scheduling for Machine Learning Cluster
Lu, Jingyuan
Li, Peng
Wang, Kun
Feng, Huibin
Guo, Enting
Wang, Xiaoyan
Guo, Song
[J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[9] A topology-aware method for scientific application deployment on cloud
Fan, Pei
Chen, Zhenbang
Wang, Ji
Zheng, Zibin
Lyu, Michael R.
[J]. INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2014, 10 (04) : 338 - 370
[10] Topology-aware resource management for HPC applications
Georgiou, Yiannis
Jeannot, Emmanuel
Mercier, Guillaume
Villiermet, Adele
[J]. 18TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING (ICDCN 2017), 2017,

← 1 2 3 4 5 →