DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads

被引:1
|
作者
Kim, Seah [1 ]
Kwon, Hyoukjun [2 ,3 ]
Song, Jinook [3 ]
Jo, Jihyuck [3 ]
Chen, Yu-Hsin [3 ]
Lai, Liangzhen [3 ]
Chandra, Vikas [3 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA USA
[2] UC Irvine, Irvine, CA 92697 USA
[3] Meta, Sunnyvale, CA 94089 USA
关键词
Scheduler; AR/VR; Multi-model ML; Hardware-Software Co-Design; ALGORITHM; PRECEDENCE; DEADLINES; TASKS;
D O I
10.1145/3623278.3624753
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone control involve dynamic behaviors in various granularity; task, model, and layers within a model. Such dynamic behaviors introduce new challenges to the system software in an ML system since the overall system load is not completely predictable, unlike traditional ML workloads. In addition, RTMM workloads require real-time processing, involve highly heterogeneous models, and target resource-constrained devices. Under such circumstances, developing an effective scheduler gains more importance to better utilize underlying hardware considering the unique characteristics of RTMM workloads. Therefore, we propose a new scheduler, DREAM, which effectively handles various dynamicity in RTMM workloads targeting multi-accelerator systems. DREAM quantifies the unique requirements for RTMM workloads and utilizes the quantified scores to drive scheduling decisions, considering the current system load and other inference jobs on different models and input frames. DREAM utilizes tunable parameters that provide fast and effective adaptivity to dynamic workload changes. In our evaluation of five scenarios of RTMM workload, DREAM reduces the overall UXCost, which is an equivalent metric of the energy-delay product (EDP) for RTMM defined in the paper, by 32.2% and 50.0% in the geometric mean (up to 80.8% and 97.6%) compared to state-of-the-art baselines, which shows the efficacy of our scheduling methodology.
引用
收藏
页码:73 / 86
页数:14
相关论文
共 50 条
  • [21] Real-time dynamic wrinkles
    Larboulette, C
    Cani, MP
    COMPUTER GRAPHICS INTERNATIONAL, PROCEEDINGS, 2004, : 522 - 525
  • [22] Real-time model checking of dynamic temporal consistency for multi-process of workflow
    Du Y.
    Fan Y.
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2010, 46 (02): : 185 - 191
  • [23] Task Splitting and Load Balancing of Dynamic Real-Time Workloads for Semi-Partitioned EDF
    Casini, Daniel
    Biondi, Alessandro
    Buttazzo, Giorgio Carlo
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (12) : 2168 - 2181
  • [24] ATLAS: Dynamic real-time multi-satellite planning
    ATLAS: Planification multi-satellite dynamique en temps réel
    1600, Lavoisier (30): : 1 - 2
  • [25] A Holistic Energy-Efficient Real-Time Scheduler for Mixed Stream and Batch Processing Workloads
    Maroulis, Stathis
    Zacheilas, Nikos
    Kalogeraki, Vana
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (12) : 2624 - 2635
  • [26] Real-time multi-model interpolation of range-varying acoustic propagation
    Chin, DC
    Biondo, AC
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOL I, 2002, : 207 - 212
  • [27] Potent Real-Time Recommendations Using Multi-Model Contextual Reinforcement Learning
    Kabra, Anubha
    Agarwal, Anu
    Parihar, Anil Singh
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 9 (02): : 581 - 593
  • [28] Accuracy of real-time multi-model ensemble forecasts for seasonal influenza in the US
    Reich, Nicholas G.
    McGowan, Craig J.
    Yamana, Teresa K.
    Tushar, Abhinav
    Ray, Evan L.
    Osthus, Dave
    Kandula, Sasikiran
    Brooks, Logan C.
    Crawford-Crudell, Willow
    Gibson, Graham Casey
    Moore, Evan
    Silva, Rebecca
    Biggerstaff, Matthew
    Johansson, Michael A.
    Rosenfeld, Roni
    Shaman, Jeffrey
    PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (11)
  • [29] Multi-Model and Learning-Based Framework for Real-Time Trajectory Prediction
    Benterki, Abdelmoudjib
    Judalet, Vincent
    Maaoui, Choubeila
    Boukhnifer, Moussa
    2020 28TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2020, : 776 - 781
  • [30] DPS: A Dynamic Procrastination Scheduler for Multi-core/Multi-processor Hard Real Time Systems
    Gawali, Shubhangi K.
    Raveendran, Biju K.
    2016 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2016, : 286 - 291