Pocket: Elastic Ephemeral Storage for Serverless Analytics

被引:0
|
作者
Klimovic, Ana [1 ]
Wang, Yawen [1 ]
Stuedi, Patrick [2 ]
Trivedi, Animesh [2 ]
Pfefferle, Jonas [2 ]
Kozyrakis, Christos [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] IBM Res, Armonk, NY USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Serverless computing is becoming increasingly popular, enabling users to quickly launch thousands of short-lived tasks in the cloud with high elasticity and fine-grain billing. These properties make serverless computing appealing for interactive data analytics. However exchanging intermediate data between execution stages in an analytics job is a key challenge as direct communication between serverless tasks is difficult. The natural approach is to store such ephemeral data in a remote data store. However, existing storage systems are not designed to meet the demands of serverless applications in terms of elasticity, performance, and cost. We present Pocket, an elastic, distributed data store that automatically scales to provide applications with desired performance at low cost. Pocket dynamically rightsizes resources across multiple dimensions (CPU cores, network bandwidth, storage capacity) and leverages multiple storage technologies to minimize cost while ensuring applications are not bottlenecked on I/O. We show that Pocket achieves similar performance to ElastiCache Redis for serverless analytics applications while reducing cost by almost 60%.
引用
收藏
页码:427 / 444
页数:18
相关论文
共 50 条
  • [21] PolicyCLOUD: A prototype of a cloud serverless ecosystem for policy analytics
    Biran, Ofer
    Feder, Oshrit
    Moatti, Yosef
    Kiourtis, Athanasios
    Kyriazis, Dimosthenis
    Manias, George
    Mavrogiorgou, Argyro
    Sgouros, Nikitas M.
    Barata, Martim T.
    Oldani, Isabella
    Sanguino, Maria A.
    Kranas, Pavlos
    Baroni, Samuele
    DATA & POLICY, 2022, 4
  • [22] Commentary: Learning Analytics: Ephemeral Rhetoric or Valuable Approach?
    Parslow, Graham R.
    BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION, 2014, 42 (02) : 178 - 178
  • [23] INFINICACHE: Exploiting Ephemeral Serverless Functions to Build a Cost-Effective Memory Cache
    Wang, Ao
    Zhang, Jingyuan
    Ma, Xiaolong
    Anwar, Ali
    Rupprecht, Lukas
    Skourtis, Dimitrios
    Tarasov, Vasily
    Yan, Feng
    Cheng, Yue
    PROCEEDINGS OF THE 18TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, 2020, : 267 - 281
  • [24] SnapStore: A Snapshot Storage System for Serverless Systems
    Panda, Abhisek
    Sarangi, Smruti R.
    PROCEEDINGS OF THE 24TH ACM/IFIP INTERNATIONAL MIDDLEWARE CONFERENCE, MIDDLEWARE 2023, 2023, : 261 - 274
  • [25] StructMesh: A storage framework for serverless computing continuum
    Carrizales-Espinoza, Diana
    Sanchez-Gallegos, DanteD.
    Gonzalez-Compean, J. L.
    Carretero, Jesus
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 159 : 353 - 369
  • [26] A Serverless Real-Time Data Analytics Platform for Edge Computing
    Nastic, Stefan
    Rausch, Thomas
    Scekic, Ognjen
    Dustdar, Schahram
    Gusev, Marjan
    Koteska, Bojana
    Kostoska, Magdalena
    Jakimovski, Boro
    Ristov, Sasko
    Prodan, Radu
    IEEE INTERNET COMPUTING, 2017, 21 (04) : 64 - 71
  • [27] Pocket Data Analytics: An Introduction to Practical Applications of Course Analytics in Online Learning
    Vivolo, John
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ANALYTICS DRIVEN SOLUTIONS (ICAS 2014), 2014, : 102 - 110
  • [28] Cherry: A Distributed Task-Aware Shuffle Service for Serverless Analytics
    Nikitas, Nikolaos
    Konstantinou, Ioannis
    Kalogeraki, Vana
    Koziris, Nectarios
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 120 - 130
  • [29] SeRSS: a storage mesh architecture to build serverless reliable storage services
    Carrizales-Espinoza, Diana
    Sanchez-Gallegos, Dante D.
    Gonzalez-Compean, J. L.
    Carretero, Jesus
    Marcelin-Jimenez, Ricardo
    30TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2022), 2022, : 88 - 91
  • [30] Proactive Elastic Scheduling for Serverless Ensemble Inference Services
    He, Shikun
    Feng, Binbin
    Ding, Zhijun
    2024 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2024, 2024, : 1025 - 1035