Edge Capacity Planning for Real Time Compute-Intensive Applications

被引：4

作者：

Noreikis, Marius ^{[1
]}

Xiao, Yu ^{[1
]}

Jiang, Yuming ^{[2
]}

机构：

[1] Aalto Univ, Dept Commun & Networking, Espoo, Finland

[2] Norwegian Univ Sci & Technol, Dept Informat Secur & Commun Technol, Trondheim, Norway

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON FOG COMPUTING (ICFC 2019) | 2019年

基金：

芬兰科学院; 欧盟地平线“2020”;

关键词：

Capacity planning; queuing theory; edge computing; GPU; augmented reality;

D O I：

10.1109/ICFC.2019.00029

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Cloud computing is a major breakthrough in enabling multi-user scalable web services, process offloading and infrastructure cost savings. However, public clouds impose high network latency which became a bottleneck for real time applications such as mobile augmented reality applications. A widely accepted solution is to move latency sensitive services from the centralized cloud to the edge of the internet, close to service users. An important prerequisite for deploying applications at the edge is determining initial required edge capacity. However, little has been done to provide reliable estimates of required computing capacity under Quality-of-Service (QoS) constraints. Differently from previous works that focus only on applications' CPU usage, in this paper, we propose a novel, queuing theory based edge capacity planning solution for real-time compute-intensive applications that takes into account usage of both CPU and GPU. Our solution satisfies the QoS requirements in terms of response delays while minimizing the number of required edge computing nodes, assuming that the nodes are with fixed CPU/GPU capacity. We demonstrate the applicability and accuracy of our solution through extensive evaluation, including a case study using real-life applications. The results show that our solution maximizes the resource utilization through intelligent combinations of service requests, and can accurately estimate the minimal amount of CPU and GPU capacity required for satisfying the QoS requirements.

引用

页码：175 / 184

页数：10

共 50 条

[21] An Efficient FPGA-Based Memory Architecture for Compute-Intensive Applications on Embedded Devices
Shahrouzi, S. Navid
Perera, Darshika G.
[J]. 2017 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2017,
[22] CampusWare: An Easy-To-Use, Efficient and Portable Grid Middleware for Compute-intensive Applications
Wang, Dong
Jiang, Jinlei
Wu, Yongwei
Yang, Guangwen
[J]. FOURTH CHINAGRID ANNUAL CONFERENCE, PROCEEDINGS, 2009, : 36 - 43
[23] OPTIMAL SCHEDULING OF COMPUTE-INTENSIVE TASKS ON A NETWORK OF WORKSTATIONS
EFE, K
KRISHNAMOORTHY, V
[J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1995, 6 (06) : 668 - 673
[24] Integration of compute-intensive tasks into scientific workflows in BeesyCluster
Czarnul, Pawel
[J]. COMPUTATIONAL SCIENCE - ICCS 2006, PT 3, PROCEEDINGS, 2006, 3993 : 944 - 947
[25] Analyzing Energy-Efficiency of Two Scheduling Policies in Compute-Intensive Applications on Cloud
Kuang, Ping
Guo, Wenxia
Xu, Xiang
Li, Hongjian
Tian, Wenhong
Buyya, Rajkumar
[J]. IEEE ACCESS, 2018, 6 : 45515 - 45526
[26] Principles for designing data-/compute-intensive distributed applications and middleware systems for heterogeneous environments
Kim, Jik-Soo
Andrade, Henrique
Sussman, Alan
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2007, 67 (07) : 755 - 771
[27] MODELS AND ALGORITHMS FOR COSCHEDULING COMPUTE-INTENSIVE TASKS ON A NETWORK OF WORKSTATIONS
ATALLAH, MJ
BLACK, CL
MARINESCU, DC
SIEGEL, HJ
CASAVANT, TL
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1992, 16 (04) : 319 - 327
[28] Accelerating compute-intensive image segmentation algorithms using GPUs
Mohammed Shehab
Mahmoud Al-Ayyoub
Yaser Jararweh
Moath Jarrah
[J]. The Journal of Supercomputing, 2017, 73 : 1929 - 1951
[29] Trends in Energy Estimates for Computing in AI/Machine Learning Accelerators, Supercomputers, and Compute-Intensive Applications
Shankar, Sadasivan
Reuther, Albert
[J]. 2022 IEEE HIGH PERFORMANCE EXTREME COMPUTING VIRTUAL CONFERENCE (HPEC), 2022,
[30] PacketUsher: Exploiting DPDK to accelerate compute-intensive packet processing
Ren, Qingqing
Zhou, Liang
Xu, Zhijun
Zhang, Yujun
Zhang, Lei
[J]. COMPUTER COMMUNICATIONS, 2020, 161 : 324 - 333

← 1 2 3 4 5 →