Declarative Data Serving: The Future of Machine Learning Inference on the Edge

被引:2
|
作者
Shaowang, Ted [1 ]
Jain, Nilesh [2 ]
Matthews, Dennis D. [2 ]
Krishnan, Sanjay [1 ]
机构
[1] Univ Chicago, Chicago, IL 60637 USA
[2] Intel Corp, Mountain View, CA USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 14卷 / 11期
关键词
CLOUD; SYSTEMS;
D O I
10.14778/3476249.3476302
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in computer architecture and networking have ushered in a new age of edge computing, where computation is placed close to the point of data collection to facilitate low-latency decision making. As the complexity of such deployments grow into networks of interconnected edge devices, getting the necessary data to be in "the right place at the right time" can become a challenge. We envision a future of edge analytics where data flows between edge nodes are declaratively configured through high-level constraints. Using machine learning model-serving as a prototypical task, we illustrate how the heterogeneity and specialization of edge devices can lead to complex, task-specific communication patterns even in relatively simple situations. Without a declarative framework, managing this complexity will be challenging for developers and will lead to brittle systems. We conclude with a research vision for database community that brings our perspective to the emergent area of edge computing.
引用
收藏
页码:2555 / 2562
页数:8
相关论文
共 50 条
  • [1] Serving Machine Learning Inference Using Heterogeneous Hardware
    Li, Baolin
    Gadepally, Vijay
    Samsi, Siddharth
    Veillette, Mark
    Tiwari, Devesh
    [J]. 2021 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2021,
  • [2] Machine Learning at Facebook: Understanding Inference at the Edge
    Wu, Carole-Jean
    Brooks, David
    Chen, Kevin
    Chen, Douglas
    Choudhury, Sy
    Dukhan, Marat
    Hazelwood, Kim
    Isaac, Eldad
    Jia, Yangqing
    Jia, Bill
    Leyvand, Tommer
    Lu, Hao
    Lu, Yang
    Qiao, Lin
    Reagen, Brandon
    Spisak, Joe
    Sun, Fei
    Tulloch, Andrew
    Vajda, Peter
    Wang, Xiaodong
    Wang, Yanghan
    Wasti, Bram
    Wu, Yiming
    Xian, Ran
    Yoo, Sungjoo
    Zhang, Peizhao
    [J]. 2019 25TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2019, : 331 - 344
  • [3] BATCH: Machine Learning Inference Serving on Serverless Platforms with Adaptive Batching
    Ali, Ahsan
    Pinciroli, Riccardo
    Yan, Feng
    Smirni, Evgenia
    [J]. PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20), 2020,
  • [4] Hierarchical and Distributed Machine Learning Inference Beyond the Edge
    Thomas, Anthony
    Guo, Yunhui
    Kim, Yeseong
    Aksanli, Baris
    Kumar, Arun
    Rosing, Tajana S.
    [J]. PROCEEDINGS OF THE 2019 IEEE 16TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2019), 2019, : 18 - 23
  • [5] S3ML: Secure Serving System for Machine Learning Inference
    Ma, Jun-Ming
    Wu, Bing-Zhe
    Yu, Chao-Fan
    Zhou, Ai-Hui
    Wu, Xi-Bin
    Chen, Xiang-Qun
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (09):
  • [6] Declarative Machine Learning Systems
    Molino, Piero
    Ré, Christopher
    [J]. 2021, Association for Computing Machinery (19):
  • [7] Declarative Machine Learning Systems
    Molino, Piero
    Re, Christopher
    [J]. COMMUNICATIONS OF THE ACM, 2022, 65 (01) : 42 - 49
  • [8] Reconfigurable Intelligent Surface for Green Edge Inference in Machine Learning
    Hua, Sheng
    Shi, Yuanming
    [J]. 2019 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2019,
  • [9] A Machine Learning Approach to Edge Type Inference in Internet AS Graphs
    Varghese, Jinu Susan
    Ruan, Lu
    [J]. 2016 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2016,
  • [10] Jellyfish: Timely Inference Serving for Dynamic Edge Networks
    Nigade, Vinod
    Bauszat, Pablo
    Bal, Henri
    Wang, Lin
    [J]. 2022 IEEE 43RD REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2022), 2022, : 277 - 290