Big Data Processing: Batch-based processing and stream-based processing

被引:3
|
作者
Benjelloun, Sarah [1 ]
El Aissi, Mohamed El Mehdi [1 ]
Loukili, Yassine [1 ]
Lakhrissi, Younes [1 ]
Ben Ali, Safae Elhaj [1 ]
Chougrad, Hiba [1 ]
El Boushaki, Abdessamad [1 ]
机构
[1] Sidi Mohamed Ben Abdellah Univ, SIGER Lab, Fes, Morocco
关键词
Big Data; Data Source; Data Insights; Batch-based Processing; Stream-based Processing; DATA ANALYTICS; CHALLENGES;
D O I
10.1109/icds50568.2020.9268684
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the exponential increase of the data amount in the past years, data analytics and data processing became essential to any organization. As Moore's law has been exceeded since several years ago, the excessive data hides indeed highly useful information. The real challenge is to successfully extract the information using an effective process and with a reasonable cost. Therefore, various processing techniques have emerged. Indeed, big data processing methods can be classified into several types like batch based, stream based, Graph based, DAG based, interactive based and visual based. All data processing techniques follow the same cycle: data collection, data preparation, data input, processing, data output/interpretation and data storage. Although having this similarity, these approaches have certainly different use cases, architectures and tools. This paper focuses on two types, namely: Batch-based processing and stream-based processing. After defining these two approaches, a comparative study is conducted and some key features are highlighted.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Implementation of a Distributed Processing Engine for Spatial Big-Data Processing based on Batch and Stream
    Kim, Sang-Su
    Song, Kwaun-Sik
    [J]. 2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1196 - 1198
  • [2] Holistically Stream-based Processing Xtwig Queries
    Wang, Guoren
    Ning, Bo
    Yu, Ge
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2008, 11 (04): : 407 - 425
  • [3] Holistically Stream-based Processing Xtwig Queries
    Guoren Wang
    Bo Ning
    Ge Yu
    [J]. World Wide Web, 2008, 11
  • [4] Frame-based Programming, Stream-Based Processing for Medical Image Processing Applications
    Joost Hoozemans
    Rob de Jong
    Steven van der Vlugt
    Jeroen Van Straten
    Uttam Kumar Elango
    Zaid Al-Ars
    [J]. Journal of Signal Processing Systems, 2019, 91 : 47 - 59
  • [5] Frame-based Programming, Stream-Based Processing for Medical Image Processing Applications
    Hoozemans, Joost
    de Jong, Rob
    van der Vlugt, Steven
    Van Straten, Jeroen
    Elango, Uttam Kumar
    Al-Ars, Zaid
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 91 (01): : 47 - 59
  • [6] Optimization of data-intensive workflows in stream-based data processing models
    Saima Gulzar Ahmad
    Chee Sun Liew
    M. Mustafa Rafique
    Ehsan Ullah Munir
    [J]. The Journal of Supercomputing, 2017, 73 : 3901 - 3923
  • [7] Optimization of data-intensive workflows in stream-based data processing models
    Ahmad, Saima Gulzar
    Liew, Chee Sun
    Rafique, M. Mustafa
    Munir, Ehsan Ullah
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (09): : 3901 - 3923
  • [8] GraphTides: A Framework for Evaluating Stream-based Graph Processing Platforms
    Erb, Benjamin
    Meissner, Dominik
    Kargl, Frank
    Steer, Benjamin A.
    Cuadrado, Felix
    Margan, Domagoj
    Pietzuch, Peter
    [J]. GRADES-NDA '18: PROCEEDINGS OF THE 1ST ACM SIGMOD JOINT INTERNATIONAL WORKSHOP ON GRAPH DATA MANAGEMENT EXPERIENCES & SYSTEMS (GRADES) AND NETWORK DATA ANALYTICS (NDA) 2018 (GRADES-NDA 2018), 2018,
  • [9] Authoring Processing Chains for Stream-Based Internet Information Retrieval Systems
    Katz, Philipp
    Feldmann, Marius
    Lunze, Torsten
    Sprenger, Sebastian
    Schill, Alexander
    [J]. BUSINESS INFORMATION SYSTEMS, BIS 2012, 2012, 117 : 189 - 200
  • [10] Feasibility analysis of AsterixDB and Spark streaming with Cassandra for stream-based processing
    Pääkkönen P.
    [J]. Journal of Big Data, 3 (1)