Edge Capacity Planning for Real Time Compute-Intensive Applications

被引：4

作者：

Noreikis, Marius ^{[1
]}

Xiao, Yu ^{[1
]}

Jiang, Yuming ^{[2
]}

机构：

[1] Aalto Univ, Dept Commun & Networking, Espoo, Finland

[2] Norwegian Univ Sci & Technol, Dept Informat Secur & Commun Technol, Trondheim, Norway

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON FOG COMPUTING (ICFC 2019) | 2019年

基金：

芬兰科学院; 欧盟地平线“2020”;

关键词：

Capacity planning; queuing theory; edge computing; GPU; augmented reality;

D O I：

10.1109/ICFC.2019.00029

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Cloud computing is a major breakthrough in enabling multi-user scalable web services, process offloading and infrastructure cost savings. However, public clouds impose high network latency which became a bottleneck for real time applications such as mobile augmented reality applications. A widely accepted solution is to move latency sensitive services from the centralized cloud to the edge of the internet, close to service users. An important prerequisite for deploying applications at the edge is determining initial required edge capacity. However, little has been done to provide reliable estimates of required computing capacity under Quality-of-Service (QoS) constraints. Differently from previous works that focus only on applications' CPU usage, in this paper, we propose a novel, queuing theory based edge capacity planning solution for real-time compute-intensive applications that takes into account usage of both CPU and GPU. Our solution satisfies the QoS requirements in terms of response delays while minimizing the number of required edge computing nodes, assuming that the nodes are with fixed CPU/GPU capacity. We demonstrate the applicability and accuracy of our solution through extensive evaluation, including a case study using real-life applications. The results show that our solution maximizes the resource utilization through intelligent combinations of service requests, and can accurately estimate the minimal amount of CPU and GPU capacity required for satisfying the QoS requirements.

引用

页码：175 / 184

页数：10

共 50 条

[31] EFFICIENCY AND PROGRAMMABILITY OF PROCESSORS FOR COMPUTE-INTENSIVE VISION PROCESSING SUBSYSTEMS
Rowen, Chris
[J]. ELECTRONICS WORLD, 2016, 122 (1960): : 26 - 27
[32] A Heterogeneous System Architecture for Low-Power Wireless Sensor Nodes in Compute-intensive Distributed Applications
Engel, Andreas
Koch, Andreas
Siebel, Thomas
[J]. 2015 IEEE 40TH LOCAL COMPUTER NETWORKS CONFERENCE WORKSHOPS (LCN WORKSHOPS), 2015, : 636 - 644
[33] Compute-Intensive Workflow Scheduling in Multi-Cloud Environment
Gupta, Indrajeet
Kumar, Madhu Sudan
Janat, Prasanta K.
[J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 315 - 321
[34] Accelerating compute-intensive image segmentation algorithms using GPUs
Shehab, Mohammed
Al-Ayyoub, Mahmoud
Jararweh, Yaser
Jarrah, Moath
[J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (05): : 1929 - 1951
[35] A Coarse-Grained Reconfigurable Architecture for Compute-Intensive MapReduce Acceleration
Liang, Shuang
Yin, Shouyi
Liu, Leibo
Guo, Yike
Wei, Shaojun
[J]. IEEE COMPUTER ARCHITECTURE LETTERS, 2016, 15 (02) : 69 - 72
[36] A Multi-Memory Field-Programmable Custom Computing Machine for Accelerating Compute-Intensive Applications
Jadhav, Shrikant S.
Gloster, Clay
Naher, Jannatun
Doss, Christopher
Kim, Youngsoo
[J]. 2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 619 - 628
[37] ADVANCED FEATURES OF NVIDIA KEPLER ARCHITECTURE AND PARALLEL COMPUTATION PLATFORM CUDA FOR DEVELOPING SCIENTIFIC COMPUTE-INTENSIVE APPLICATIONS
Dudnik, V. A.
Kudryavtsev, V., I
Us, S. A.
Shestakov, M., V
[J]. PROBLEMS OF ATOMIC SCIENCE AND TECHNOLOGY, 2019, (03): : 105 - 108
[38] Optimized FPGA Implementation of a Compute-Intensive Oil Reservoir Simulation Algorithm
Ioannou, Aggelos D.
Malakonakis, Pavlos
Georgopoulos, Konstantinos
Papaefstathiou, Ioannis
Dollas, Apostolos
Mavroidis, Iakovos
[J]. EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2019, 2019, 11733 : 442 - 454
[39] Cuckoo: flexible compute-intensive task offloading in mobile cloud computing
Zhou, Zhigang
Zhang, Hongli
Ye, Lin
Du, Xiaojiang
[J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2016, 16 (18): : 3256 - 3268
[40] Scheduling strategy of compute-intensive task-flow in generalized cluster
Zhang, Ke-Jia
Hu, Ya-Nan
Li, Chun-Sheng
Fu, Yu
Li, Pan-Chi
[J]. Kongzhi yu Juece/Control and Decision, 2019, 34 (12): : 2537 - 2546

← 1 2 3 4 5 →