Mapping the Big Data Landscape: Technologies, Platforms and Paradigms for Real-Time Analytics of Data Streams

被引:5
|
作者
Dubuc, Timothee [1 ,2 ]
Stahl, Frederic [3 ]
Roesch, Etienne B. [1 ,2 ]
机构
[1] Univ Reading, Sch Psychol & Clin Language Sci, Reading RG6 6AH, Berks, England
[2] Univ Reading, Ctr Integrat Neurosci & Neurodynam, Reading RG6 6AH, Berks, England
[3] German Res Ctr Artificial Intelligence GmbH DFKI, Lab Niedersachsen, D-26129 Oldenburg, Germany
基金
英国工程与自然科学研究理事会;
关键词
Big Data; Business; Benchmark testing; Tools; Hardware; Task analysis; Measurement; Big data applications; Internet of Things (IoT); edge computing; distributed computing; pipeline processing; CHALLENGES; MOBILE; SYSTEM;
D O I
10.1109/ACCESS.2020.3046132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The 'Big Data' of yesterday is the 'data' of today. As technology progresses, new challenges arise and new solutions are developed. Due to the emergence of Internet of Things applications within the last decade, the field of Data Mining has been faced with the challenge of processing and analysing data streams in real-time, and under high data throughput conditions. This is often referred to as the Velocity aspect of Big Data. Whereas there are numerous reviews on Data Stream Mining techniques and applications, there is very little work surveying Data Stream processing paradigms and associated technologies, from data collection through to pre-processing and feature processing, from the perspective of the user, not that of the service provider. In this article, we evaluate a particular type of solution, which focuses on streaming data, and processing pipelines that permit online analysis of data streams that cannot be stored as-is on the computing platform. We review foundational computational concepts such as distributed computation, fault-tolerant computing, and computational paradigms/architectures. We then review the available technological solutions, and applications that pertain to data stream mining as case studies of these theoretical concepts. We conclude with a discussion of the field of data stream processing/analytics, future directions and research challenges.
引用
收藏
页码:15351 / 15374
页数:24
相关论文
共 50 条
  • [21] Real-Time Machine Learning Competition on Data Streams at the IEEE Big Data 2019
    Boulegane, Dihia
    Radulovic, Nedeljko
    Bifet, Albert
    Fievet, Ghislain
    Sohn, Jimin
    Nam, Yeonwoo
    Yu, Seojeong
    Choi, Dong-Wan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 3493 - 3497
  • [22] Real-time Big Data Technologies of Energy Internet Platform
    Wang Guilan
    Zhou Guoliang
    Zhao Hongshan
    Liu Hongyang
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY (POWERCON), 2016,
  • [23] Big data analytics on social networks for real-time depression detection
    Angskun, Jitimon
    Tipprasert, Suda
    Angskun, Thara
    [J]. JOURNAL OF BIG DATA, 2022, 9 (01)
  • [24] Engineering Scalable Distributed Services for Real-Time Big Data Analytics
    Jambi, Sahar
    Anderson, Kenneth M.
    [J]. 2017 THIRD IEEE INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2017), 2017, : 131 - 140
  • [25] Big Data Analytics of Geosocial Media for Planning and Real-Time Decisions
    Rathore, M. Mazhar
    Paul, Anand
    Ahmad, Awais
    Imran, Muhammad
    Guizani, Mohsen
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
  • [26] Big data analytics on social networks for real-time depression detection
    Jitimon Angskun
    Suda Tipprasert
    Thara Angskun
    [J]. Journal of Big Data, 9
  • [27] Big Data Analytics for Real Time Dispatch
    Mogra, Himanshu
    Segu, SaiNikhil
    DeLong, James
    Canales-Vaschy, Remy
    Ramakrishnan, Srikanth
    Sridharan, Sriram
    Penumutchu, Srikanth
    [J]. 2024 35TH ANNUAL SEMI ADVANCED SEMICONDUCTOR MANUFACTURING CONFERENCE, ASMC, 2024,
  • [28] Distributed Real-Time Sentiment Analysis for Big Data Social Streams
    Rahnama, Amir Hossein Akhavan
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2014, : 789 - 794
  • [29] Survey on Real-time Anomaly Detection Technology for Big Data Streams
    Luo, Yuanvan
    Du, Xuehui
    Sun, Yi
    [J]. PROCEEDINGS OF 2018 12TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2018, : 26 - 30
  • [30] Research on real-time outlier detection over big data streams
    Chen, Liangchen
    Gao, Shu
    Cao, Xiufeng
    [J]. International Journal of Computers and Applications, 2020, 42 (01): : 93 - 101