A reference architecture for serverless big data processing

被引:0
|
作者
Werner, Sebastian [1 ]
Tai, Stefan [1 ]
机构
[1] TU Berlin, ISE, Einsteinufer 17, D-10587 Berlin, Germany
关键词
Serverless data processing; Application platform co -design; Serverless reference architecture; Function as a Service; Software engineering; Cloud computing; FRAMEWORK; SUPPORT;
D O I
10.1016/j.future.2024.01.029
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Despite significant advances in data management systems in recent decades, the processing of big data at scale remains very challenging. While cloud computing has been well -accepted as a solution to address scalability needs, cloud configuration and operation complexity persist and often present themselves as entry barriers, especially for novice data analysts. Serverless computing and Function -as -a -Service (FaaS) platforms have been suggested to reduce such entry barriers by shifting configuration and operational responsibilities from the application developer to the FaaS platform provider. Naturally, "serverless data processing (SDP)'', that is, using FaaS for (big) data processing, has received increasing interest in recent years. However, FaaS platforms were never intended to support large data processing tasks primarily. SDP, therefore, manifests itself through workarounds and adaptations on the application level, addressing various quirks and limitations of the FaaS platforms in use for data processing needs. This, in turn, creates tensions between the platforms and the applications using them, again encouraging the constant (re -)design of both. Consequently, we present lessons learned from a series of application and platform re -designs that address these tensions, leading to the development of an SDP reference architecture and a platform instantiation and implementation thereof called CREW. Mitigating the tensions through the process of application platform codesign proves to reduce both entry barriers and costs significantly. In some experiments, CREW outperforms traditional, non -SDP big data processing frameworks by factors.
引用
收藏
页码:179 / 192
页数:14
相关论文
共 50 条
  • [1] Evaluating Serverless Architecture for Big Data Enterprise Applications
    Bhat, Aimer
    Park, Heeki
    Roy, Madhumonti
    [J]. 8TH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, BDCAT 2021, 2021, : 1 - 8
  • [2] Serverless Big Data Processing using Matrix Multiplication as Example
    Werner, Sebastian
    Kuhlenkamp, Jeorn
    Klems, Markus
    Mueller, Johannes
    Tai, Stefan
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 358 - 365
  • [3] A Reference Architecture for Big Data Systems
    Sang, Go Muan
    Xu, Lai
    de Vrieze, Paul
    [J]. PROCEEDINGS OF 2016 10TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT & APPLICATIONS (SKIMA), 2016, : 370 - 375
  • [4] A Big Data Reference Architecture for Emergency Management
    Iglesias, Carlos A.
    Favenza, Alfredo
    Carrera, Alvaro
    [J]. INFORMATION, 2020, 11 (12) : 1 - 24
  • [5] Research on Big Data Reference Architecture Model
    Luo Xiaofeng
    Luo Jing
    [J]. 2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 205 - 209
  • [6] SPSC: Stream Processing Framework Atop Serverless Computing for Industrial Big Data
    Cai, Zinuo
    Chen, Zebin
    Chen, Xinglei
    Ma, Ruhui
    Guan, Haibing
    Buyya, Rajkumar
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024,
  • [7] Design of a Reference Architecture for Serverless IoT Systems
    Shah, Neel Pradip
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS (IEEE COINS 2021), 2021, : 59 - 64
  • [8] Serverless Architecture for Data Processing and Detecting Anomalies with the Mars Express MARSIS Instrument
    Pacios, David
    Vazquez-Poletti, Jose Luis
    Sanchez-Cano, Beatriz
    Moreno-Vozmediano, Rafael
    Schetakis, Nikolaos
    Vazquez, Luis
    Titov, Dmitrij V.
    [J]. ASTRONOMICAL JOURNAL, 2023, 166 (01):
  • [9] Simplifying Big Data Analytics Systems with a Reference Architecture
    Sang, Go Muan
    Xu, Lai
    de Vrieze, Paul
    [J]. COLLABORATION IN A DATA-RICH WORLD, 2017, 506 : 242 - 249
  • [10] NIST Big Data Reference Architecture for Analytics and Beyond
    Chang, Wo
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC' 17), 2017, : 3 - 3