Edge Capacity Planning for Real Time Compute-Intensive Applications

被引:4
|
作者
Noreikis, Marius [1 ]
Xiao, Yu [1 ]
Jiang, Yuming [2 ]
机构
[1] Aalto Univ, Dept Commun & Networking, Espoo, Finland
[2] Norwegian Univ Sci & Technol, Dept Informat Secur & Commun Technol, Trondheim, Norway
基金
芬兰科学院; 欧盟地平线“2020”;
关键词
Capacity planning; queuing theory; edge computing; GPU; augmented reality;
D O I
10.1109/ICFC.2019.00029
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cloud computing is a major breakthrough in enabling multi-user scalable web services, process offloading and infrastructure cost savings. However, public clouds impose high network latency which became a bottleneck for real time applications such as mobile augmented reality applications. A widely accepted solution is to move latency sensitive services from the centralized cloud to the edge of the internet, close to service users. An important prerequisite for deploying applications at the edge is determining initial required edge capacity. However, little has been done to provide reliable estimates of required computing capacity under Quality-of-Service (QoS) constraints. Differently from previous works that focus only on applications' CPU usage, in this paper, we propose a novel, queuing theory based edge capacity planning solution for real-time compute-intensive applications that takes into account usage of both CPU and GPU. Our solution satisfies the QoS requirements in terms of response delays while minimizing the number of required edge computing nodes, assuming that the nodes are with fixed CPU/GPU capacity. We demonstrate the applicability and accuracy of our solution through extensive evaluation, including a case study using real-life applications. The results show that our solution maximizes the resource utilization through intelligent combinations of service requests, and can accurately estimate the minimal amount of CPU and GPU capacity required for satisfying the QoS requirements.
引用
收藏
页码:175 / 184
页数:10
相关论文
共 50 条
  • [1] Energy Efficient Task Offloading for Compute-intensive Mobile Edge Applications
    Zhang, Xiaojie
    Debroy, Saptarshi
    [J]. ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [2] Exploiting GPUs for Compute-Intensive Medical Applications
    Jararweh, Yaser
    Jarrah, Moath
    Hariri, Salim
    [J]. 2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 29 - 34
  • [3] Execution of compute-intensive applications into parallel machines
    Houstis, C
    Kapidakis, S
    Markatos, EP
    Gelenbe, E
    [J]. INFORMATION SCIENCES, 1997, 97 (1-2) : 83 - 124
  • [4] Inexpensive computing environments for compute-intensive applications
    Winter, DR
    McGrath, L
    Berger, S
    Rice, DC
    Robinson, N
    Cushing, J
    Thurman, DA
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVIII, PROCEEDINGS: INFORMATION SYSTEMS, CONCEPTS AND APPLICATIONS OF SYSTEMICS, CYBERNETICS AND INFORMATICS, 2002, : 480 - 483
  • [5] Accelerating compute-intensive applications with GPUs and FPGAs
    Che, Shuai
    Li, Jie
    Sheaffer, Jeremy W.
    Skadron, Kevin
    Lach, John
    [J]. 2008 SYMPOSIUM ON APPLICATION SPECIFIC PROCESSORS, 2008, : 101 - +
  • [6] A parallel arithmetic array for accelerating compute-intensive applications
    Wang, Dong
    Cao, Peng
    Xiao, Yang
    [J]. IEICE ELECTRONICS EXPRESS, 2014, 11 (04):
  • [7] DtCraft: A Distributed Execution Engine for Compute-intensive Applications
    Huang, Tsung-Wei
    Lin, Chun-Xun
    Wong, Martin D. F.
    [J]. 2017 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2017, : 757 - 764
  • [8] Reliable Provisioning of Spot Instances for Compute-intensive Applications
    Voorsluys, William
    Buyya, Rajkumar
    [J]. 2012 IEEE 26TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2012, : 542 - 549
  • [9] Deployment of Run-Time Reconfigurable Hardware Coprocessors Into Compute-Intensive Embedded Applications
    Fons, Francisco
    Fons, Mariano
    Canto, Enrique
    Lopez, Mariano
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2012, 66 (02): : 191 - 221
  • [10] Optimal Offloading for Dynamic Compute-Intensive Applications in Wireless Networks
    Li, Bin
    [J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,