Lambda Architecture for Cost-effective Batch and Speed Big Data processing

被引:0
|
作者
Kiran, Mariam [1 ]
Murphy, Peter [2 ]
Monga, Inder [2 ]
Dugan, Jon [2 ]
Baveja, Sartaj Singh [3 ]
机构
[1] Univ Bradford, Sch Comp Sci, Bradford BD7 1DP, W Yorkshire, England
[2] Energy Sci Network ESnet, Washington, DC USA
[3] Netaji Subhas Inst Technol, New Delhi, India
基金
英国工程与自然科学研究理事会;
关键词
big data processing; lambda architecture; Amazon EC2; sensor data analysis;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sensor and smart phone technologies present opportunities for data explosion, streaming and collecting from heterogeneous devices every second. Analyzing these large datasets can unlock multiple behaviors previously unknown, and help optimize approaches to city wide applications or societal use cases. However, collecting and handling of these massive datasets presents challenges in how to perform optimized online data analysis 'on-the-fly', as current approaches are often limited by capability, expense and resources. This presents a need for developing new methods for data management particularly using public clouds to minimize cost, network resources and on-demand availability. This paper presents an implementation of the lambda architecture design pattern to construct a data-handling backend on Amazon EC2, providing high throughput, dense and intense data demand delivered as services, minimizing the cost of the network maintenance. This paper combines ideas from database management, cost models, query management and cloud computing to present a general architecture that could be applied in any given scenario where affordable online data processing of Big Datasets is needed. The results are presented with a case study of processing router sensor data on the current ESnet network data as a working example of the approach. The results showcase a reduction in cost and argue benefits for performing online analysis and anomaly detection for sensor data.
引用
收藏
页码:2785 / 2792
页数:8
相关论文
共 50 条
  • [1] The VADA Architecture for Cost-Effective Data Wrangling
    Konstantinou, Nikolaos
    Koehler, Martin
    Abel, Edward
    Civili, Cristina
    Neumayr, Bernd
    Sallinger, Emanuel
    Fernandes, Alvaro A. A.
    Gottlob, Georg
    Keane, John A.
    Libkin, Leonid
    Paton, Norman W.
    SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 1599 - 1602
  • [2] Enabling Cost-effective Data Processing with Smart SSD
    Kang, Yangwook
    Kee, Yang-suk
    Miller, Ethan L.
    Park, Chanik
    2013 IEEE 29TH SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST), 2013,
  • [3] Towards Cost-Effective Cloud Downloading with Tencent Big Data
    Li, Zhen-Hua
    Liu, Gang
    Ji, Zhi-Yuan
    Zimmermann, Roger
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (06) : 1163 - 1174
  • [4] Towards Cost-Effective Cloud Downloading with Tencent Big Data
    Zhen-Hua Li
    Gang Liu
    Zhi-Yuan Ji
    Roger Zimmermann
    Journal of Computer Science and Technology, 2015, 30 : 1163 - 1174
  • [5] Cost-effective clustered architecture
    Canal, Ramon
    Parcerisa, Joan-Manuel
    Gonzalez, Antonio
    Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT, 1999, : 160 - 168
  • [6] A Cost-Effective Hybrid Cloud Resource Scaling Framework for Batch Processing Services
    Zhang, Qinzhi
    Pan, Li
    Liu, Shijun
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2025, 12 (01): : 476 - 487
  • [7] Cost-Effective Data Partition for Distributed Stream Processing System
    Wang, Xiaotong
    Fang, Junhua
    Li, Yuming
    Zhang, Rong
    Zhou, Aoying
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), PT II, 2017, 10178 : 623 - 635
  • [8] Cost-effective PVD coatings in batch systems
    Engers, B
    Bauer, HU
    SURFACE & COATINGS TECHNOLOGY, 1999, 116 : 705 - 710
  • [9] A cost-effective morphological filter architecture
    Ong, S
    Sunwoo, MH
    CAMP'97 - FOURTH IEEE INTERNATIONAL WORKSHOP ON COMPUTER ARCHITECTURE FOR MACHINE PERCEPTION, PROCEEDINGS, 1997, : 285 - 289
  • [10] Architecture of Geospatial Big-Data Batch Processing Model Based on Hadoop
    Kim, Sang-Su
    Yu, Sung-Hwan
    2015 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC), 2015, : 964 - 966