Evaluating the Performance of Deep Learning Inference Service on Edge Platform

被引:0
|
作者
Choi, Hyun-Hwa [1 ]
Cha, Jae-Geun [1 ]
Yun, Seung-Hyun [2 ]
Kim, Dae Won [1 ]
Jang, Sumin [1 ]
Kim, Sun Wook [1 ]
机构
[1] ETRI, Artificial Intelligence Res Lab, Daejeon, South Korea
[2] SoftonNet, Technol Support Team, Seoul, South Korea
关键词
deep learning inference; edge computing; containerization; low-latency; resource configuration;
D O I
10.1109/ICTC52510.2021.9620870
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning inference requires tremendous amount of computation and typically is offloaded the cloud for execution. Recently, edge computing, which processes and stores data at the edge of the Internet closest to the mobile devices or sensors, has been considered as new computing paradigm. We have studied the performance of the deep neural network (DNN) inference service based on different configurations of resources assigned to a container. In this work, we measured and analyzed a real-world edge service on containerization platform. An edge service is named A!Eye, an application with various DNN inferences. The edge service has both CPU-friendly and GPU-friendly tasks. CPU tasks account for more than half of the latency of the edge service. Our analyses reveal interesting findings about running the DNN inference service on the container-based execution platform; (a) The latency of DNN inference-based edge services is affected by CPU-based operation performance. (b) Pinning CPUs can reduce the latency of an edge service. (c) In order to improve the performance of an edge service, it is very important to avoid PCIe bottleneck shared by resources like CPUs, GPUs and NICs.
引用
收藏
页码:1789 / 1793
页数:5
相关论文
共 50 条
  • [1] Performance Evaluation of Deep Learning Compilers for Edge Inference
    Verma, Gaurav
    Gupta, Yashi
    Malik, Abid M.
    Chapman, Barbara
    [J]. 2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 858 - 865
  • [2] Deep Learning Inference Service at Microsoft
    Soifer, Jonathan
    Li, Jason
    Li, Mingqin
    Zhu, Jeffrey
    Li, Yingnan
    He, Yuxiong
    [J]. PROCEEDINGS OF THE 2019 USENIX CONFERENCE ON OPERATIONAL MACHINE LEARNING, 2019, : 15 - 17
  • [3] Evaluating Approximate Inference in Bayesian Deep Learning
    Wilson, Andrew Gordon
    Lotfi, Sanae
    Vikram, Sharad
    Hoffman, Matthew D.
    Gal, Yarin
    Li, Yingzhen
    Pradier, Melanie F.
    Foong, Andrew
    Farquhar, Sebastian
    Izmailov, Pavel
    [J]. NEURIPS 2021 COMPETITIONS AND DEMONSTRATIONS TRACK, VOL 176, 2021, 176 : 113 - 124
  • [4] Optimized IoT Service Chain Implementation in Edge Cloud Platform: A Deep Learning Framework
    Pham, Chuan
    Nguyen, Duong Tuan
    Tran, Nguyen H.
    Nguyen, Kim Khoa
    Cheriet, Mohamed
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (01): : 538 - 551
  • [5] Provisioning Edge Inference as a Service via Online Learning
    Jin, Yibo
    Jiao, Lei
    Qian, Zhuzhong
    Zhang, Sheng
    Chen, Ning
    Lu, Sanglu
    Wang, Xiaoliang
    [J]. 2020 17TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2020,
  • [6] Deep Learning Inference at the Edge for Mobile and Aerial Robotics
    Faniadis, Efstathios
    Amanatiadis, Angelos
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR 2020), 2020, : 334 - 340
  • [7] Multimodal Deep Learning with Boosted Trees for Edge Inference
    Chong, Penny
    Wynter, Laura
    Chaudhury, Bharathi
    [J]. 2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 99 - 108
  • [8] A Deep Learning Approach to Sensor Fusion Inference at the Edge
    Becnel, T.
    Gaillardon, P-E
    [J]. PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 1420 - 1425
  • [9] The Case for Hierarchical Deep Learning Inference at the Network Edge
    Al-Atat, Ghina
    Fresa, Andrea
    Behera, Adarsh Prasad
    Moothedath, Vishnu Narayanan
    Gross, James
    Champati, Jaya Prakash
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON NETWORKED AI SYSTEMS, NETAISYS 2023, 2023, : 13 - 18
  • [10] Evaluating Deep Learning Techniques for Natural Language Inference
    Eleftheriadis, Petros
    Perikos, Isidoros
    Hatzilygeroudis, Ioannis
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (04):