Serving DNNs like Clockwork: Performance Predictability from the Bottom Up

被引:0
|
作者
Gujarati, Arpan [1 ]
Karimi, Reza [2 ]
Alzayat, Safya [1 ]
Hao, Wei [1 ]
Kaufmann, Antoine [1 ]
Vigfusson, Ymir [2 ]
Mace, Jonathan [1 ]
机构
[1] Max Planck Inst Software Syst, Saarbrucken, Germany
[2] Emory Univ, Atlanta, GA 30322 USA
基金
美国国家科学基金会;
关键词
TAIL;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Machine learning inference is becoming a core building block for interactive web applications. As a result, the underlying model serving systems on which these applications depend must consistently meet low latency targets. Existing model serving architectures use well-known reactive techniques to alleviate common-case sources of latency, but cannot effectively curtail tail latency caused by unpredictable execution times. Yet the underlying execution times are not fundamentally unpredictable-on the contrary we observe that inference using Deep Neural Network (DNN) models has deterministic performance. Here, starting with the predictable execution times of individual DNN inferences, we adopt a principled design methodology to successively build a fully distributed model serving system that achieves predictable end-to-end performance. We evaluate our implementation, Clockwork, using production trace workloads, and show that Clockwork can support thousands of models while simultaneously meeting 100 ms latency targets for 99.9999% of requests. We further demonstrate that Clockwork exploits predictable execution times to achieve tight request-level service-level objectives (SLOs) as well as a high degree of request-level performance isolation.
引用
收藏
页码:443 / 462
页数:20
相关论文
共 50 条
  • [21] Topology from the bottom up
    Kamien, RD
    SCIENCE, 2003, 299 (5613) : 1671 - 1673
  • [22] Renewables from the bottom up
    Jari Lyytimäki
    Nature, 2016, 539 : 168 - 168
  • [23] Fear from the bottom up
    Stujenske, Joseph M.
    Likhtik, Ekaterina
    NATURE NEUROSCIENCE, 2017, 20 (06) : 765 - 767
  • [24] Building from the bottom up
    Krull, CE
    Krumlauf, R
    NATURE CELL BIOLOGY, 2001, 3 (06) : E138 - E139
  • [25] Nanoelectronics from the bottom up
    Lu, Wei
    Lieber, CharLes M.
    NATURE MATERIALS, 2007, 6 (11) : 841 - 850
  • [26] THE FLIGHT FROM THE BOTTOM UP
    BARCLAY, JR
    JOURNAL OF LEARNING DISABILITIES, 1981, 14 (07) : 388 - 390
  • [27] Culture from the Bottom Up
    Atkinson, Dwight
    Sohn, Jija
    TESOL QUARTERLY, 2013, 47 (04) : 669 - 693
  • [28] Fear from the bottom up
    Joseph M Stujenske
    Ekaterina Likhtik
    Nature Neuroscience, 2017, 20 : 765 - 767
  • [29] Buckling Up from the Bottom
    Matejcic, Marija
    Trepat, Xavier
    DEVELOPMENTAL CELL, 2020, 54 (05) : 569 - 571
  • [30] REBUILDING From the Bottom Up
    Kean, Sam
    SCIENCE, 2010, 327 (5966) : 638 - 639