Edge Capacity Planning for Real Time Compute-Intensive Applications

被引:4
|
作者
Noreikis, Marius [1 ]
Xiao, Yu [1 ]
Jiang, Yuming [2 ]
机构
[1] Aalto Univ, Dept Commun & Networking, Espoo, Finland
[2] Norwegian Univ Sci & Technol, Dept Informat Secur & Commun Technol, Trondheim, Norway
基金
芬兰科学院; 欧盟地平线“2020”;
关键词
Capacity planning; queuing theory; edge computing; GPU; augmented reality;
D O I
10.1109/ICFC.2019.00029
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cloud computing is a major breakthrough in enabling multi-user scalable web services, process offloading and infrastructure cost savings. However, public clouds impose high network latency which became a bottleneck for real time applications such as mobile augmented reality applications. A widely accepted solution is to move latency sensitive services from the centralized cloud to the edge of the internet, close to service users. An important prerequisite for deploying applications at the edge is determining initial required edge capacity. However, little has been done to provide reliable estimates of required computing capacity under Quality-of-Service (QoS) constraints. Differently from previous works that focus only on applications' CPU usage, in this paper, we propose a novel, queuing theory based edge capacity planning solution for real-time compute-intensive applications that takes into account usage of both CPU and GPU. Our solution satisfies the QoS requirements in terms of response delays while minimizing the number of required edge computing nodes, assuming that the nodes are with fixed CPU/GPU capacity. We demonstrate the applicability and accuracy of our solution through extensive evaluation, including a case study using real-life applications. The results show that our solution maximizes the resource utilization through intelligent combinations of service requests, and can accurately estimate the minimal amount of CPU and GPU capacity required for satisfying the QoS requirements.
引用
收藏
页码:175 / 184
页数:10
相关论文
共 50 条
  • [31] EFFICIENCY AND PROGRAMMABILITY OF PROCESSORS FOR COMPUTE-INTENSIVE VISION PROCESSING SUBSYSTEMS
    Rowen, Chris
    [J]. ELECTRONICS WORLD, 2016, 122 (1960): : 26 - 27
  • [32] A Heterogeneous System Architecture for Low-Power Wireless Sensor Nodes in Compute-intensive Distributed Applications
    Engel, Andreas
    Koch, Andreas
    Siebel, Thomas
    [J]. 2015 IEEE 40TH LOCAL COMPUTER NETWORKS CONFERENCE WORKSHOPS (LCN WORKSHOPS), 2015, : 636 - 644
  • [33] Compute-Intensive Workflow Scheduling in Multi-Cloud Environment
    Gupta, Indrajeet
    Kumar, Madhu Sudan
    Janat, Prasanta K.
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 315 - 321
  • [34] Accelerating compute-intensive image segmentation algorithms using GPUs
    Shehab, Mohammed
    Al-Ayyoub, Mahmoud
    Jararweh, Yaser
    Jarrah, Moath
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (05): : 1929 - 1951
  • [35] A Coarse-Grained Reconfigurable Architecture for Compute-Intensive MapReduce Acceleration
    Liang, Shuang
    Yin, Shouyi
    Liu, Leibo
    Guo, Yike
    Wei, Shaojun
    [J]. IEEE COMPUTER ARCHITECTURE LETTERS, 2016, 15 (02) : 69 - 72
  • [36] A Multi-Memory Field-Programmable Custom Computing Machine for Accelerating Compute-Intensive Applications
    Jadhav, Shrikant S.
    Gloster, Clay
    Naher, Jannatun
    Doss, Christopher
    Kim, Youngsoo
    [J]. 2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 619 - 628
  • [37] ADVANCED FEATURES OF NVIDIA KEPLER ARCHITECTURE AND PARALLEL COMPUTATION PLATFORM CUDA FOR DEVELOPING SCIENTIFIC COMPUTE-INTENSIVE APPLICATIONS
    Dudnik, V. A.
    Kudryavtsev, V., I
    Us, S. A.
    Shestakov, M., V
    [J]. PROBLEMS OF ATOMIC SCIENCE AND TECHNOLOGY, 2019, (03): : 105 - 108
  • [38] Optimized FPGA Implementation of a Compute-Intensive Oil Reservoir Simulation Algorithm
    Ioannou, Aggelos D.
    Malakonakis, Pavlos
    Georgopoulos, Konstantinos
    Papaefstathiou, Ioannis
    Dollas, Apostolos
    Mavroidis, Iakovos
    [J]. EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2019, 2019, 11733 : 442 - 454
  • [39] Cuckoo: flexible compute-intensive task offloading in mobile cloud computing
    Zhou, Zhigang
    Zhang, Hongli
    Ye, Lin
    Du, Xiaojiang
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2016, 16 (18): : 3256 - 3268
  • [40] Scheduling strategy of compute-intensive task-flow in generalized cluster
    Zhang, Ke-Jia
    Hu, Ya-Nan
    Li, Chun-Sheng
    Fu, Yu
    Li, Pan-Chi
    [J]. Kongzhi yu Juece/Control and Decision, 2019, 34 (12): : 2537 - 2546