Pocket: Elastic Ephemeral Storage for Serverless Analytics

被引:0
|
作者
Klimovic, Ana [1 ]
Wang, Yawen [1 ]
Stuedi, Patrick [2 ]
Trivedi, Animesh [2 ]
Pfefferle, Jonas [2 ]
Kozyrakis, Christos [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] IBM Res, Armonk, NY USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Serverless computing is becoming increasingly popular, enabling users to quickly launch thousands of short-lived tasks in the cloud with high elasticity and fine-grain billing. These properties make serverless computing appealing for interactive data analytics. However exchanging intermediate data between execution stages in an analytics job is a key challenge as direct communication between serverless tasks is difficult. The natural approach is to store such ephemeral data in a remote data store. However, existing storage systems are not designed to meet the demands of serverless applications in terms of elasticity, performance, and cost. We present Pocket, an elastic, distributed data store that automatically scales to provide applications with desired performance at low cost. Pocket dynamically rightsizes resources across multiple dimensions (CPU cores, network bandwidth, storage capacity) and leverages multiple storage technologies to minimize cost while ensuring applications are not bottlenecked on I/O. We show that Pocket achieves similar performance to ElastiCache Redis for serverless analytics applications while reducing cost by almost 60%.
引用
收藏
页码:427 / 444
页数:18
相关论文
共 50 条
  • [31] Data-Driven Serverless Functions for Object Storage
    Sampe, Josep
    Sanchez-Artigas, Marc
    Garcia-Lopez, Pedro
    Paris, Gerard
    PROCEEDINGS OF THE 2017 INTERNATIONAL MIDDLEWARE CONFERENCE (MIDDLEWARE'17), 2017, : 121 - 133
  • [32] Lambada: Interactive Data Analytics on Cold Data Using Serverless Cloud Infrastructure
    Muller, Ingo
    Marroquin, Renato
    Alonso, Gustavo
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 115 - 130
  • [33] Canopus: A Paradigm Shift Towards Elastic Extreme-Scale Data Analytics on HPC Storage
    Lu, Tao
    Suchyta, Eric
    Pugmire, Dave
    Choi, Jong
    Klasky, Scott
    Liu, Qing
    Podhorszki, Norbert
    Ainsworth, Mark
    Wolf, Matthew
    2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 58 - 69
  • [34] Practical cloud storage auditing using serverless computing
    Fei CHEN
    Jianquan CAI
    Tao XIANG
    Xiaofeng LIAO
    Science China(Information Sciences), 2024, 67 (03) : 33 - 47
  • [35] Astra: Autonomous Serverless Analytics with Cost-Efficiency and QoS-Awareness
    Jarachanthan, Jananie
    Chen, Li
    Xu, Fei
    Li, Bo
    2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 756 - 765
  • [36] Smartpick: Workload Prediction for Serverless-enabled Scalable Data Analytics Systems
    Das Mohapatra, Anshuman
    Oh, Kwangsung
    PROCEEDINGS OF THE 24TH ACM/IFIP INTERNATIONAL MIDDLEWARE CONFERENCE, MIDDLEWARE 2023, 2023, : 29 - 42
  • [37] Practical cloud storage auditing using serverless computing
    Chen, Fei
    Cai, Jianquan
    Xiang, Tao
    Liao, Xiaofeng
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (03)
  • [38] Edge-assisted Adaptive Configuration for Serverless-based Video Analytics
    Wang, Ziyi
    Zhang, Songyu
    Cheng, Jing
    Wu, Zhixiong
    Cao, Zhen
    Cui, Yong
    2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 248 - 258
  • [39] ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning
    Gu, Diandian
    Zhao, Yihao
    Zhong, Yinmin
    Xiong, Yifan
    Han, Zhenhua
    Cheng, Peng
    Yang, Fan
    Huang, Gang
    Jin, Xin
    Liu, Xuanzhe
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 2, ASPLOS 2023, 2023, : 266 - 280
  • [40] LLAMA: A Heterogeneous & Serverless Framework for Auto-Tuning Video Analytics Pipelines
    Romero, Francisco
    Zhao, Mark
    Yadwadkar, Neeraja J.
    Kozyrakis, Christos
    PROCEEDINGS OF THE 2021 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '21), 2021, : 1 - 17