Lambada: Interactive Data Analytics on Cold Data Using Serverless Cloud Infrastructure

被引:60
|
作者
Muller, Ingo [1 ]
Marroquin, Renato [1 ]
Alonso, Gustavo [1 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
关键词
Serverless Computing; Serverless Functions; Cloud Computing; Interactive Analytics; Data Lake; Elasticity;
D O I
10.1145/3318464.3389758
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Serverless computing has recently attracted a lot of attention from research and industry due to its promise of ultimate elasticity and operational simplicity. However, there is no consensus yet on whether or not the approach is suitable for data processing. In this paper, we present Lambada, a serverless distributed data processing framework designed to explore how to perform data analytics on serverless computing. In our analysis, supported with extensive experiments, we show in which scenarios serverless makes sense from an economic and performance perspective. We address several important technical questions that need to be solved to support data analytics and present examples from several domains where serverless offers a cost and performance advantage over existing solutions.
引用
收藏
页码:115 / 130
页数:16
相关论文
共 50 条
  • [1] Serverless Data Analytics in the IBM Cloud
    Sampe, Josep
    Vernik, Gil
    Sanchez-Artigas, Marc
    Garcia-Lopez, Pedro
    [J]. MIDDLEWARE INDUSTRY'18: PROCEEDINGS OF THE 2018 ACM/IFIP/USENIX MIDDLEWARE CONFERENCE (INDUSTRIAL TRACK), 2018, : 1 - 8
  • [2] Enabling Interactive Analytics of Secure Data using Cloud Kotta
    Babuji, Yadu N.
    Chard, Kyle
    Duede, Eamon
    [J]. SCIENCECLOUD'17: PROCEEDINGS OF THE 8TH WORKSHOP ON SCIENTIFIC CLOUD COMPUTING, 2017, : 9 - 15
  • [3] Visual Analytics Framework for Cloud Infrastructure Data
    Kejariwal, Arun
    Lee, Winston
    Vallis, Owen
    Hochenbaum, Jordan
    Yan, Bryce
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 886 - 893
  • [4] Serverless Data Analytics with Flint
    Kim, Youngbin
    Lin, Jimmy
    [J]. PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2018, : 451 - 455
  • [5] Addressing Big Data Analytics Issues and Challenges Using Cloud Infrastructure
    Moosa, Hanna
    Rana, Muhammad Ehsan
    [J]. 2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 61 - 65
  • [6] Data Analytics using Cloud Computing
    Maheshwari, Prakhar
    Singhal, Alankar
    Qadeer, Mohammed A.
    [J]. 2017 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2017, : 82 - 87
  • [7] Big Data Analytics using Public Cloud Infrastructure: Use cases and Cost Economics
    Deshmukh, Sanjay
    Sumeet, Shailja
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 782 - 784
  • [8] A Scalable Cloud Computing Infrastructure for Geospatial Data Analytics for Change Detection
    Jacobsen, Rune Hylsberg
    Jeppesen, Jacob Hoxbroe
    Laursen, Kim Fibiger
    Skovsgaard, John
    Jensen, Henrik Nymann
    Toftegaard, Thomas Skjodeberg
    [J]. 2017 EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2017, : 403 - 410
  • [9] Big Data Infrastructure for Aviation Data Analytics
    Murugan, Anandavel
    Mylaraswamy, Dinkar
    Xu, Brian
    Dietrich, Paul
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING IN EMERGING MARKETS (CCEM), 2014, : 87 - 92
  • [10] Cloud-based interactive analytics for terabytes of genomic variants data
    Pan, Cuiping
    McInnes, Gregory
    Deflaux, Nicole
    Snyder, Michael
    Bingham, Jonathan
    Datta, Somalee
    Tsao, Philip S.
    [J]. BIOINFORMATICS, 2017, 33 (23) : 3709 - 3715