Big Data Streaming Platforms to Support Real-time Analytics

被引:1
|
作者
Fernandes, Eliana [1 ]
Salgado, Ana Carolina [2 ]
Bernardino, Jorge [1 ,3 ]
机构
[1] Polytech Coimbra, Quinta Nora, ISEC, Rua Pedro Nunes, P-3030199 Coimbra, Portugal
[2] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
[3] Univ Coimbra CISUC, Ctr Informat & Syst, Coimbra, Portugal
关键词
Streaming; Real-time Analytics; Big Data; Fault-Tolerance;
D O I
10.5220/0009817304260433
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years data has grown exponentially due to the evolution of technology. The data flow circulates in a very fast and continuous way, so it must be processed in real time. Therefore, several big data streaming platforms have emerged for processing large amounts of data. Nowadays, companies have difficulties in choosing the platform that best suits their needs. In addition, the information about the platforms is scattered and sometimes omitted, making it difficult for the company to choose the right platform. This work focuses on helping companies or organizations to choose a big data streaming platform to analyze and process their data flow. We provide a description of the most popular platforms, such as: Apache Flink, Apache Kafka, Apache Samza, Apache Spark and Apache Storm. To strengthen the knowledge about these platforms, we also approached their architectures, advantages and limitations. Finally, a comparison among big data streaming platforms will be provided, using as attributes the characteristics that companies usually most need.
引用
收藏
页码:426 / 433
页数:8
相关论文
共 50 条
  • [1] Automated Real-Time Analysis of Streaming Big and Dense Data on Reconfigurable Platforms
    Rouhani, Bita Darvish
    Mirhoseini, Azalia
    Songhori, Ebrahim M.
    Koushanfar, Farinaz
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2016, 10 (01)
  • [2] The growing role of integrated and insightful big and real-time data analytics platforms
    Ranganathan, Indrakumari
    Thangamuthu, Poongodi
    Palanimuthu, Suresh
    Balusamy, Balamurugan
    [J]. DIGITAL TWIN PARADIGM FOR SMARTER SYSTEMS AND ENVIRONMENTS: THE INDUSTRY USE CASES, 2020, 117 : 165 - 186
  • [3] Mapping the Big Data Landscape: Technologies, Platforms and Paradigms for Real-Time Analytics of Data Streams
    Dubuc, Timothee
    Stahl, Frederic
    Roesch, Etienne B.
    [J]. IEEE ACCESS, 2021, 9 : 15351 - 15374
  • [4] Real-time processing of streaming big data
    Safaei, Ali A.
    [J]. REAL-TIME SYSTEMS, 2017, 53 (01) : 1 - 44
  • [5] Real-time processing of streaming big data
    Ali A. Safaei
    [J]. Real-Time Systems, 2017, 53 : 1 - 44
  • [6] Using Big Data and Real-Time Analytics to Support Smart City Initiatives
    Souza, Arthur
    Figueredo, Mickael
    Cacho, Nelio
    Araujo, Daniel
    Prolo, Carlos A.
    [J]. IFAC PAPERSONLINE, 2016, 49 (30): : 257 - 262
  • [7] Real-Time Big Data Analytics: Applications and Challenges
    Mohamed, Nader
    Al-Jaroodi, Jameela
    [J]. 2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2014, : 305 - 310
  • [8] Developing a Real-time Data Analytics Framework For Twitter Streaming Data
    Yadranjiaghdam, Babak
    Yasrobi, Seyedfaraz
    Tabrizi, Nasseh
    [J]. 2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017), 2017, : 329 - 336
  • [9] Real-time streaming mobility analytics
    Garzo, Andras
    Benczur, Andras A.
    Sidlo, Csaba Istvan
    Tahara, Daniel
    Wyatt, Erik Francis
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [10] A Methodology of Real-Time Data Fusion for Localized Big Data Analytics
    Jabbar, Sohail
    Malik, Kaleem R.
    Ahmad, Mudassar
    Aldabbas, Omar
    Asif, Muhammad
    Khalid, Shehzad
    Han, Kijun
    Ahmed, Syed Hassan
    [J]. IEEE ACCESS, 2018, 6 : 24510 - 24520