Processing and Analytics of Big Data Streams with Yahoo!S4

被引:24
|
作者
Xhafa, Fatos [1 ]
Naranjo, Victor [1 ]
Caballe, Santi [2 ]
机构
[1] Univ Politecn Cataluna, Barcelona, Spain
[2] Open Univ Catalonia, Barcelona, Spain
关键词
Big Data; Stream Processing; Parallel Processing; Data Mining; Yahoo!S4; Scalability; Global Flight Monitoring System;
D O I
10.1109/AINA.2015.194
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many Internet-based applications generate huge data streams, which are known as Big Data Streams. Such applications comprise IoT-based monitoring systems, data analytics from monitoring online learning workspaces and MOOCs, global flight monitoring systems, etc. Differently from Big Data processing in which the data is available in databases, file systems, etc., before processing, in Big Data Streams the data stream is unbounded and it is to be processed as it becomes available. Besides the challenges of processing huge amount of data, the Big Data Stream processing adds further challenges of coping with scalability and high throughput to enable real time decision taking. While for Big Data processing the MapReduce framework has resulted successful, its batch mode processing shows limitations to process Big Data Streams. Therefore there have been proposed alternative frameworks such as Yahoo!S4, TwitterStorm, etc., to Big Data Stream processing. In this paper we implement and evaluate the Yahoo!S4 for Big Data Stream processing and exemplify through the Big Data Stream from global flight monitoring system.
引用
收藏
页码:263 / 270
页数:8
相关论文
共 50 条
  • [1] Fog Data Processing and Analytics for Agriculture IoT Data Streams
    Islam, Shahidul
    Jamwal, Sanjay
    Mir, Mahmood Hussain
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2022, 13 (03): : 718 - 734
  • [2] Big Data Processing and Analytics for Process Industries
    Sarnovsky, Martin
    2018 IEEE 16TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2018): DEDICATED TO THE MEMORY OF PIONEER OF ROBOTICS ANTAL (TONY) K. BEJCZY, 2018, : 14 - 14
  • [3] Big Data Analytics on High Velocity Streams: A Case Study
    Chardonnens, Thibaud
    Cudre-Mauroux, Philippe
    Grund, Martin
    Perroud, Benoit
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [4] Incremental Query Processing on Big Data Streams
    Fegaras, Leonidas
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (11) : 2998 - 3012
  • [5] Shared Execution Techniques for Business Data Analytics over Big Data Streams
    Uzunbaz, Serkan
    Aref, Walid G.
    PROCEEDINGS OF THE 32TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2020, 2020,
  • [6] A Survey on Big Data Processing Frameworks for Mobility Analytics
    Doulkeridis C.
    Vlachou A.
    Pelekis N.
    Theodoridis Y.
    SIGMOD Record, 2021, 50 (02): : 18 - 29
  • [7] A Survey on Big Data Processing Frameworks for Mobility Analytics
    Doulkeridis, Christos
    Vlachou, Akrivi
    Pelekis, Nikos
    Theodoridis, Yannis
    SIGMOD RECORD, 2021, 50 (02) : 18 - 30
  • [8] Big Data Analytics Using Graph Signal Processing
    Amin, Farhan
    Barukab, Omar M.
    Choi, Gyu Sang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 489 - 502
  • [9] Boosting Heapsort Performance of Processing Big Data Streams
    Algemili, Usamah
    Alhudhaif, Adi
    SOUTHEASTCON 2016, 2016,
  • [10] An Efficient Framework of Data Mining and its Analytics on Massive Streams of Big Data Repositories
    Disha, D. N.
    Sowmya, B. J.
    Chetan
    Seema, S.
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING, VLSI, ELECTRICAL CIRCUITS AND ROBOTICS (DISCOVER), 2016, : 195 - 200