An Event-Driven Serverless ETL Pipeline on AWS

被引:7
|
作者
Pogiatzis, Antreas [1 ]
Samakovitis, Georgios [1 ]
机构
[1] Univ Greenwich, Sch Comp & Math Sci, Old Royal Naval Coll, Pk Row, London SE10 9LS, England
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 01期
关键词
serverless; FaaS; event-driven; distributed; AWS; ETL; architecture;
D O I
10.3390/app11010191
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This work presents an event-driven Extract, Transform, and Load (ETL) pipeline serverless architecture and provides an evaluation of its performance over a range of dataflow tasks of varying frequency, velocity, and payload size. We design an experiment while using generated tabular data throughout varying data volumes, event frequencies, and processing power in order to measure: (i) the consistency of pipeline executions; (ii) reliability on data delivery; (iii) maximum payload size per pipeline; and, (iv) economic scalability (cost of chargeable tasks). We run 92 parameterised experiments on a simple AWS architecture, thus avoiding any AWS-enhanced platform features, in order to allow for unbiased assessment of our model's performance. Our results indicate that our reference architecture can achieve time-consistent data processing of event payloads of more than 100 MB, with a throughput of 750 KB/s across four event frequencies. It is also observed that, although the utilisation of an SQS queue for data transfer enables easy concurrency control and data slicing, it becomes a bottleneck on large sized event payloads. Finally, we develop and discuss a candidate pricing model for our reference architecture usage.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [1] Formalizing Event-Driven Behavior of Serverless Applications
    Obetz, Matthew
    Das, Anirban
    Castiglia, Timothy
    Patterson, Stacy
    Milanova, Ana
    SERVICE-ORIENTED AND CLOUD COMPUTING (ESOCC 2020), 2020, 12054 : 19 - 29
  • [2] An Event-Driven Approach to Serverless Seismic Imaging in the Cloud
    Witte, Philipp A.
    Louboutin, Mathias
    Modzelewski, Henryk
    Jones, Charles
    Selvage, James
    Herrmann, Felix J.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (09) : 2032 - 2049
  • [3] Event-Driven Serverless Pipelines for Video Coding and Quality Metrics
    Wilmer Moina-Rivera
    Miguel Garcia-Pineda
    Jose M. Claver
    Juan Gutiérrez-Aguado
    Journal of Grid Computing, 2023, 21
  • [4] Event-Driven Serverless Pipelines for Video Coding and Quality Metrics
    Moina-Rivera, Wilmer
    Garcia-Pineda, Miguel
    Claver, Jose M.
    Gutierrez-Aguado, Juan
    JOURNAL OF GRID COMPUTING, 2023, 21 (02)
  • [5] On-premises Serverless Computing for Event-Driven Data Processing Applications
    Perez, Alfonso
    Risco, Sebastian
    Naranjo, Diana Maria
    Caballer, Miguel
    Molto, German
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 414 - 421
  • [6] A serverless gateway for event-driven machine learning inference in multiple clouds
    Naranjo, Diana M.
    Risco, Sebastian
    Molto, German
    Blanquer, Ignacio
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (18):
  • [7] NMSTREAM: A SCALABLE EVENT-DRIVEN ETL FRAMEWORK FOR PROCESSING HETEROGENEOUS STREAMING DATA
    Xiao, Fei
    Li, Chengming
    Wu, Zheng
    Wu, Yinghao
    ISPRS TC IV MID-TERM SYMPOSIUM 3D SPATIAL INFORMATION SCIENCE - THE ENGINE OF CHANGE, 2018, 4-4 : 243 - 246
  • [8] Event-driven Image Sensor Application : Event-driven Image Segmentation
    Darwish, Amani
    Abbass, Hassan
    Fesquet, Laurent
    Sicard, Gilles
    2017 3RD INTERNATIONAL CONFERENCE ON EVENT-BASED CONTROL, COMMUNICATION AND SIGNAL PROCESSING (EBCCSP), 2017,
  • [9] A UML Profile for Multi-Cloud Service Configuration (UMLPMSC) in Event-driven Serverless Applications
    Samea, Fatima
    Azam, Farooque
    Anwar, Muhammad Waseem
    Khan, Mehreen
    Rashid, Muhammad
    2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2019), 2019, : 431 - 435
  • [10] Event-driven RBAC
    Bonatti, Piero
    Galdi, Clemente
    Torres, Davide
    JOURNAL OF COMPUTER SECURITY, 2015, 23 (06) : 709 - 757