Mapping the Big Data Landscape: Technologies, Platforms and Paradigms for Real-Time Analytics of Data Streams

被引:5
|
作者
Dubuc, Timothee [1 ,2 ]
Stahl, Frederic [3 ]
Roesch, Etienne B. [1 ,2 ]
机构
[1] Univ Reading, Sch Psychol & Clin Language Sci, Reading RG6 6AH, Berks, England
[2] Univ Reading, Ctr Integrat Neurosci & Neurodynam, Reading RG6 6AH, Berks, England
[3] German Res Ctr Artificial Intelligence GmbH DFKI, Lab Niedersachsen, D-26129 Oldenburg, Germany
基金
英国工程与自然科学研究理事会;
关键词
Big Data; Business; Benchmark testing; Tools; Hardware; Task analysis; Measurement; Big data applications; Internet of Things (IoT); edge computing; distributed computing; pipeline processing; CHALLENGES; MOBILE; SYSTEM;
D O I
10.1109/ACCESS.2020.3046132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The 'Big Data' of yesterday is the 'data' of today. As technology progresses, new challenges arise and new solutions are developed. Due to the emergence of Internet of Things applications within the last decade, the field of Data Mining has been faced with the challenge of processing and analysing data streams in real-time, and under high data throughput conditions. This is often referred to as the Velocity aspect of Big Data. Whereas there are numerous reviews on Data Stream Mining techniques and applications, there is very little work surveying Data Stream processing paradigms and associated technologies, from data collection through to pre-processing and feature processing, from the perspective of the user, not that of the service provider. In this article, we evaluate a particular type of solution, which focuses on streaming data, and processing pipelines that permit online analysis of data streams that cannot be stored as-is on the computing platform. We review foundational computational concepts such as distributed computation, fault-tolerant computing, and computational paradigms/architectures. We then review the available technological solutions, and applications that pertain to data stream mining as case studies of these theoretical concepts. We conclude with a discussion of the field of data stream processing/analytics, future directions and research challenges.
引用
收藏
页码:15351 / 15374
页数:24
相关论文
共 50 条
  • [1] Big Data Streaming Platforms to Support Real-time Analytics
    Fernandes, Eliana
    Salgado, Ana Carolina
    Bernardino, Jorge
    [J]. ICSOFT: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2020, : 426 - 433
  • [2] The growing role of integrated and insightful big and real-time data analytics platforms
    Ranganathan, Indrakumari
    Thangamuthu, Poongodi
    Palanimuthu, Suresh
    Balusamy, Balamurugan
    [J]. DIGITAL TWIN PARADIGM FOR SMARTER SYSTEMS AND ENVIRONMENTS: THE INDUSTRY USE CASES, 2020, 117 : 165 - 186
  • [3] Real-time tweet analytics using hybrid hashtags on twitter big data streams
    Gupta, Vibhuti
    Hewett, Rattikorn
    [J]. Information (Switzerland), 2020, 11 (07):
  • [4] Real-Time Tweet Analytics Using Hybrid Hashtags on Twitter Big Data Streams
    Gupta, Vibhuti
    Hewett, Rattikorn
    [J]. INFORMATION, 2020, 11 (07)
  • [5] Real-Time Big Data Analytics: Applications and Challenges
    Mohamed, Nader
    Al-Jaroodi, Jameela
    [J]. 2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2014, : 305 - 310
  • [6] A Methodology of Real-Time Data Fusion for Localized Big Data Analytics
    Jabbar, Sohail
    Malik, Kaleem R.
    Ahmad, Mudassar
    Aldabbas, Omar
    Asif, Muhammad
    Khalid, Shehzad
    Han, Kijun
    Ahmed, Syed Hassan
    [J]. IEEE ACCESS, 2018, 6 : 24510 - 24520
  • [7] Logical big data integration and near real-time data analytics
    Silva, Bruno
    Moreira, Jose
    Costa, Rogerio Luis de C.
    [J]. DATA & KNOWLEDGE ENGINEERING, 2023, 146
  • [8] Big Data Stream Computing in Healthcare Real-Time Analytics
    Ta, Van-Dai
    Liu, Chuan-Ming
    Nkabinde, Goodwill Wandile
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2016), 2016, : 37 - 42
  • [9] A Survey on Real-time Big Data Analytics: Applications and Tools
    Yadranjiaghdam, Babak
    Pool, Nathan
    Tabrizi, Nasseh
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 404 - 409
  • [10] An incremental approach for real-time Big Data visual analytics
    Garcia, Ignacio
    Casado, Ruben
    Bouchachia, Abdelhamid
    [J]. 2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW), 2016, : 177 - 182