Ditto: Efficient Serverless Analytics with Elastic Parallelism

被引:5
|
作者
Jin, Chao [1 ]
Zhang, Zili [1 ]
Xiang, Xingyu [1 ]
Zou, Songyun [1 ]
Huang, Gang [1 ]
Liu, Xuanzhe [1 ]
Jin, Xin [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
关键词
Serverless computing; data analytics; task scheduling;
D O I
10.1145/3603269.3604816
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Serverless computing provides fine-grained resource elasticity for data analytics-a job can flexibly scale its resources for each stage, instead of sticking to a fixed pool of resources throughout its lifetime. Due to different data dependencies and different shuffling overheads caused by intra- and inter-server communication, the best degree of parallelism (DoP) for each stage varies based on runtime conditions. We present Ditto, a job scheduler for serverless analytics that leverages fine-grained resource elasticity to optimize for job completion time (JCT) and cost. The key idea of Ditto is to use a new scheduling granularity-stage group-to decouple parallelism configuration from function placement. Ditto bundles stages into stage groups based on their data dependencies and IO characteristics. It exploits the parallelized time characteristics of the stages to determine the parallelism configuration, and prioritizes the placement of stage groups with large shuffling traffic, so that the stages in these groups can leverage zero-copy intra-server communication for efficient shuffling. We build a system prototype of Ditto and evaluate it with a variety of benchmarking workloads. Experimental results show that Ditto outperforms existing solutions by up to 2.5x on JCT and up to 1.8x on cost.
引用
收藏
页码:406 / 419
页数:14
相关论文
共 50 条
  • [21] Serverless Empowered Video Analytics for Ubiquitous Networked Cameras
    Zhang, Miao
    Wang, Fangxin
    Zhu, Yifei
    Liu, Jiangchuan
    Li, Bo
    IEEE NETWORK, 2021, 35 (06): : 186 - 193
  • [22] PolicyCLOUD: A prototype of a cloud serverless ecosystem for policy analytics
    Biran, Ofer
    Feder, Oshrit
    Moatti, Yosef
    Kiourtis, Athanasios
    Kyriazis, Dimosthenis
    Manias, George
    Mavrogiorgou, Argyro
    Sgouros, Nikitas M.
    Barata, Martim T.
    Oldani, Isabella
    Sanguino, Maria A.
    Kranas, Pavlos
    Baroni, Samuele
    DATA & POLICY, 2022, 4
  • [23] FaaSBatch: Boosting Serverless Efficiency With In-Container Parallelism and Resource Multiplexing
    Wu, Zhaorui
    Deng, Yuhui
    Zhou, Yi
    Li, Jie
    Pang, Shujie
    Qin, Xiao
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (04) : 1071 - 1085
  • [24] Pipeline Parallelism With Elastic Averaging
    Jang, Bongwon
    Yoo, In-Chul
    Yook, Dongsuk
    IEEE ACCESS, 2024, 12 : 5477 - 5489
  • [25] Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings
    Chen, Qian
    Wang, Wen
    Zhang, Qinglin
    Zheng, Siqi
    Deng, Chong
    Yu, Hai
    Liu, Jiaqing
    Ma, Yukun
    Zhang, Chong
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 5868 - 5875
  • [26] Pipeline Parallelism With Elastic Averaging
    Jang, Bongwon
    Yoo, In-Chul
    Yook, Dongsuk
    IEEE Access, 2024, 12 : 5477 - 5489
  • [27] FnSched: An Efficient Scheduler for Serverless Functions
    Suresh, Amoghvarsha
    Gandhi, Anshul
    PROCEEDINGS OF THE 2019 FIFTH INTERNATIONAL WORKSHOP ON SERVERLESS COMPUTING (WOSC '19), 2019, : 19 - 24
  • [28] Netherite: Efficient Execution of Serverless Workflows
    Burckhardt, Sebastian
    Chandramouli, Badrish
    Gillum, Chris
    Justo, David
    Kallas, Konstantinos
    McMahon, Connor
    Meiklejohn, Christopher S.
    Zhu, Xiangfeng
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (08): : 1591 - 1604
  • [29] Netherite: efficient execution of serverless workflows
    Burckhardt, Sebastian
    Chandramouli, Badrish
    Gillum, Chris
    Justo, David
    Kallas, Konstantinos
    Mcmahon, Connor
    Meiklejohn, Christopher S.
    Zhu, Xiangfeng
    VLDB JOURNAL, 2025, 34 (02):
  • [30] Energy Efficient Scheduling for Serverless Systems
    Tsenos, Michail
    Peri, Aristotelis
    Kalogeraki, Vana
    2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS, ACSOS, 2023, : 27 - 36