QoS-Aware and Resource Efficient Microservice Deployment in Cloud-Edge Continuum

被引:26
|
作者
Fu, Kaihua [1 ]
Zhang, Wei [1 ]
Chen, Quan [1 ]
Zeng, Deze [2 ]
Peng, Xin [3 ]
Zheng, Wenli [1 ]
Guo, Minyi [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] China Univ Geosci, Wuhan, Peoples R China
[3] Fudan Univ, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/IPDPS49936.2021.00102
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
User-facing services are now evolving towards the microservice architecture where a service is built by connecting multiple microservice stages. While an entire service is heavy, the microservice architecture shows the opportunity to only offload some microservice stages to the edge devices that are close to the end users. However, emerging techniques often result in the violation of Quality-of-Service (QoS) of microservice-based services in cloud-edge continuum, as they do not consider the communication overhead or the resource contention between microservices. We propose Nautilus, a runtime system that effectively deploys microservice-based user-facing services in cloud-edge continuum. It ensures the QoS of microservice-based user-facing services while minimizing the required computational resources. Nautilus is comprised of a communication-aware microservice mapper, a contention-aware resource manager and a load-aware microservice scheduler. The mapper divides the microservice graph into multiple partitions based on the communication overhead and maps the partitions to the nodes. On each node, the resource manager determines the optimal resource allocation for its microservices based on reinforcement learning that may capture the complex contention behaviors. The microservice scheduler monitors the QoS of the entire service, and migrates microservices from busy nodes to idle ones at runtime. Our experimental results show that Nautilus reduces the computational resource usage by 23.9% and the network bandwidth usage by 53.4%, while achieving the required 99%-ile latency.
引用
下载
收藏
页码:932 / 941
页数:10
相关论文
共 50 条
  • [21] A QoS-aware resource allocation framework in virtualised cloud environments
    Tian Y.
    International Journal of Networking and Virtual Organisations, 2019, 21 (03) : 336 - 350
  • [22] QoS-Aware Resource Placement for LEO Satellite Edge Computing
    Pfandzelter, Tobias
    Bermbach, David
    6TH IEEE INTERNATIONAL CONFERENCE ON FOG AND EDGE COMPUTING (ICFEC 2022), 2022, : 66 - 72
  • [23] QRSF: QoS-aware resource scheduling framework in cloud computing
    Sukhpal Singh
    Inderveer Chana
    The Journal of Supercomputing, 2015, 71 : 241 - 292
  • [24] A resource elasticity framework for QoS-aware execution of cloud applications
    Kaur, Pankaj Deep
    Chana, Inderveer
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 37 : 14 - 25
  • [25] An Adaptive Qos-Aware Cloud
    Zhang Yuchao
    Deng Bo
    Peng Fuyang
    2012 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGIES, APPLICATIONS AND MANAGEMENT (ICCCTAM), 2012, : 160 - 163
  • [26] Latency-Aware Deployment of IoT Services in a Cloud-Edge Environment
    Zhang, Shouli
    Liu, Chen
    Wang, Jianwu
    Yang, Zhongguo
    Han, Yanbo
    Li, Xiaohong
    SERVICE-ORIENTED COMPUTING (ICSOC 2019), 2019, 11895 : 231 - 236
  • [27] A Resource Reservation based Framework for QoS-aware Resource Provision in Cloud Computing
    He, Hong
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (09): : 193 - 204
  • [28] A Cost-Efficient QoS-Aware Model for Cloud IaaS Resource Allocation in Large Datacenters
    Metwally, Khaled
    Jarray, Abdallah
    Karmouch, Ahmed
    2015 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (CLOUDNET), 2015, : 38 - 43
  • [29] Energy-efficient and QoS-aware model based resource consolidation in cloud data centers
    Li, Hongjian
    Zhu, Guofeng
    Zhao, Yuyan
    Dai, Yu
    Tian, Wenhong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (03): : 2793 - 2803
  • [30] Energy-efficient and QoS-aware model based resource consolidation in cloud data centers
    Hongjian Li
    Guofeng Zhu
    Yuyan Zhao
    Yu Dai
    Wenhong Tian
    Cluster Computing, 2017, 20 : 2793 - 2803