Scrooge: A Cost-Effective Deep Learning Inference System

被引:16
|
作者
Hu, Yitao [1 ]
Ghosh, Rajrup [1 ]
Govindan, Ramesh [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
关键词
Cloud computing; deep learning inference; auto-scaling;
D O I
10.1145/3472883.3486993
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Advances in deep learning (DL) have prompted the development of cloud-hosted DL-based media applications that process video and audio streams in real-time. Such applications must satisfy throughput and latency objectives and adapt to novel types of dynamics, while incurring minimal cost. Scrooge, a system that provides media applications as a service, achieves these objectives by packing computations efficiently into GPU-equipped cloud VMs, using an optimization formulation to find the lowest cost VM allocations that meet the performance objectives, and rapidly reacting to variations in input complexity (e.g., changes in participants in a video). Experiments show that Scrooge can save serving cost by 16-32% (which translate to tens of thousands of dollars per year) relative to the state-of-the-art while achieving latency objectives for over 98% under dynamic workloads.
引用
收藏
页码:624 / 638
页数:15
相关论文
共 50 条
  • [1] Cost-Effective Learning for Cost-Effective Care?
    Walsh, Kieran
    [J]. ACADEMIC MEDICINE, 2011, 86 (12) : 1485 - 1486
  • [2] Cost-Effective Active Learning for Deep Image Classification
    Wang, Keze
    Zhang, Dongyu
    Li, Ya
    Zhang, Ruimao
    Lin, Liang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (12) : 2591 - 2600
  • [3] The use of deep learning for a cost-effective tunnel maintenance
    Schneider, Oliver
    Prokopova, Alzbeta
    Modetta, Flavio
    Petschen, Veronika
    [J]. GALLERIE E GRANDI OPERE SOTTERRANEE, 2019, (129): : 39 - 42
  • [4] A Cost-Effective Person-Following System for Assistive Unmanned Vehicles with Deep Learning at the Edge
    Boschi, Anna
    Salvetti, Francesco
    Mazzia, Vittorio
    Chiaberge, Marcello
    [J]. MACHINES, 2020, 8 (03)
  • [5] Cost-effective ensemble models selection using deep reinforcement learning
    Birman, Yoni
    Hindi, Shaked
    Katz, Gilad
    Shabtai, Asaf
    [J]. INFORMATION FUSION, 2022, 77 : 133 - 148
  • [6] Cost-Effective and Interpretable Job Skill Recommendation with Deep Reinforcement Learning
    Sun, Ying
    Zhuang, Fuzhen
    Zhu, Hengshu
    He, Qing
    Xiong, Hui
    [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 3827 - 3838
  • [7] Cost-effective mechanism for environmental toxic reduction using deep learning
    Jun, He
    [J]. SUSTAINABLE ENERGY TECHNOLOGIES AND ASSESSMENTS, 2022, 52
  • [8] Cost-Effective Testing of a Deep Learning Model through Input Reduction
    Zhou, Jianyi
    Li, Feng
    Dong, Jinhao
    Zhang, Hongyu
    Hao, Dan
    [J]. 2020 IEEE 31ST INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2020), 2020, : 289 - 300
  • [9] Cost-Effective SCC for Deep Foundations
    Nehdi, Moncef
    El-Chabib, Hassan
    El Naggar, Hesham
    [J]. Concrete International, 2003, 25 (03) : 95 - 103
  • [10] Automated, Cost-Effective Optical System for Accelerated Antimicrobial Susceptibility Testing (AST) Using Deep Learning
    Brown, Calvin
    Tseng, Derek
    Larkin, Paige M. K.
    Realegeno, Susan
    Mortimer, Leanne
    Subramonian, Arjun
    Di Carlo, Dino
    Garner, Omai B.
    Ozcan, Aydogan
    [J]. ACS PHOTONICS, 2020, 7 (09): : 2527 - 2538