Edge Capacity Planning for Real Time Compute-Intensive Applications

被引:4
|
作者
Noreikis, Marius [1 ]
Xiao, Yu [1 ]
Jiang, Yuming [2 ]
机构
[1] Aalto Univ, Dept Commun & Networking, Espoo, Finland
[2] Norwegian Univ Sci & Technol, Dept Informat Secur & Commun Technol, Trondheim, Norway
基金
芬兰科学院; 欧盟地平线“2020”;
关键词
Capacity planning; queuing theory; edge computing; GPU; augmented reality;
D O I
10.1109/ICFC.2019.00029
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cloud computing is a major breakthrough in enabling multi-user scalable web services, process offloading and infrastructure cost savings. However, public clouds impose high network latency which became a bottleneck for real time applications such as mobile augmented reality applications. A widely accepted solution is to move latency sensitive services from the centralized cloud to the edge of the internet, close to service users. An important prerequisite for deploying applications at the edge is determining initial required edge capacity. However, little has been done to provide reliable estimates of required computing capacity under Quality-of-Service (QoS) constraints. Differently from previous works that focus only on applications' CPU usage, in this paper, we propose a novel, queuing theory based edge capacity planning solution for real-time compute-intensive applications that takes into account usage of both CPU and GPU. Our solution satisfies the QoS requirements in terms of response delays while minimizing the number of required edge computing nodes, assuming that the nodes are with fixed CPU/GPU capacity. We demonstrate the applicability and accuracy of our solution through extensive evaluation, including a case study using real-life applications. The results show that our solution maximizes the resource utilization through intelligent combinations of service requests, and can accurately estimate the minimal amount of CPU and GPU capacity required for satisfying the QoS requirements.
引用
收藏
页码:175 / 184
页数:10
相关论文
共 50 条
  • [21] An Efficient FPGA-Based Memory Architecture for Compute-Intensive Applications on Embedded Devices
    Shahrouzi, S. Navid
    Perera, Darshika G.
    [J]. 2017 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2017,
  • [22] CampusWare: An Easy-To-Use, Efficient and Portable Grid Middleware for Compute-intensive Applications
    Wang, Dong
    Jiang, Jinlei
    Wu, Yongwei
    Yang, Guangwen
    [J]. FOURTH CHINAGRID ANNUAL CONFERENCE, PROCEEDINGS, 2009, : 36 - 43
  • [23] OPTIMAL SCHEDULING OF COMPUTE-INTENSIVE TASKS ON A NETWORK OF WORKSTATIONS
    EFE, K
    KRISHNAMOORTHY, V
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1995, 6 (06) : 668 - 673
  • [24] Integration of compute-intensive tasks into scientific workflows in BeesyCluster
    Czarnul, Pawel
    [J]. COMPUTATIONAL SCIENCE - ICCS 2006, PT 3, PROCEEDINGS, 2006, 3993 : 944 - 947
  • [25] Analyzing Energy-Efficiency of Two Scheduling Policies in Compute-Intensive Applications on Cloud
    Kuang, Ping
    Guo, Wenxia
    Xu, Xiang
    Li, Hongjian
    Tian, Wenhong
    Buyya, Rajkumar
    [J]. IEEE ACCESS, 2018, 6 : 45515 - 45526
  • [26] Principles for designing data-/compute-intensive distributed applications and middleware systems for heterogeneous environments
    Kim, Jik-Soo
    Andrade, Henrique
    Sussman, Alan
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2007, 67 (07) : 755 - 771
  • [27] MODELS AND ALGORITHMS FOR COSCHEDULING COMPUTE-INTENSIVE TASKS ON A NETWORK OF WORKSTATIONS
    ATALLAH, MJ
    BLACK, CL
    MARINESCU, DC
    SIEGEL, HJ
    CASAVANT, TL
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1992, 16 (04) : 319 - 327
  • [28] Accelerating compute-intensive image segmentation algorithms using GPUs
    Mohammed Shehab
    Mahmoud Al-Ayyoub
    Yaser Jararweh
    Moath Jarrah
    [J]. The Journal of Supercomputing, 2017, 73 : 1929 - 1951
  • [29] Trends in Energy Estimates for Computing in AI/Machine Learning Accelerators, Supercomputers, and Compute-Intensive Applications
    Shankar, Sadasivan
    Reuther, Albert
    [J]. 2022 IEEE HIGH PERFORMANCE EXTREME COMPUTING VIRTUAL CONFERENCE (HPEC), 2022,
  • [30] PacketUsher: Exploiting DPDK to accelerate compute-intensive packet processing
    Ren, Qingqing
    Zhou, Liang
    Xu, Zhijun
    Zhang, Yujun
    Zhang, Lei
    [J]. COMPUTER COMMUNICATIONS, 2020, 161 : 324 - 333