Implementation of Data Stream Classification Neural Network Models Over Big Data Platforms

被引:0
|
作者
Puentes-Marchal, Fernando [1 ]
Dolores Perez-Godoy, Maria [1 ]
Gonzalez, Pedro [1 ]
Jose Del Jesus, Maria [1 ]
机构
[1] Univ Jaen, Jaen, Spain
关键词
Datastream; Classification; Extreme learning machine; Big data; Spark streaming; EXTREME LEARNING-MACHINE;
D O I
10.1007/978-3-030-85099-9_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Streaming is being increasingly demanded because it helps in analyzing data in real-time and in decision making. Over time, the number of existing devices increases continuously, generating a huge amount of data. Processing this data with traditional algorithms is impractical, so it is necessary to apply distributed algorithms in a Big Data context. In this paper, Apache Spark is used to implement some distributed versions of algorithms based on Extreme Learning Machine (ELM). In addition, these algorithms are evaluated with different real and synthetic datasets by performing two experiments. The first one tries to demonstrate that the performance of the distributed algorithms is the same as that of the sequential versions. The second experiment is a study about the behaviour of the algorithms in the presence of concept drift, an important research area within streaming.
引用
收藏
页码:272 / 280
页数:9
相关论文
共 50 条
  • [1] Data stream classification and big data analytics
    Krawczyk, Bartosz
    Wozniak, Michal
    Stefanowski, Jerzy
    [J]. NEUROCOMPUTING, 2015, 150 : 238 - 239
  • [2] Implementation of Data Preprocessing Techniques on Distributed Big Data Platforms
    Celik, Oguz
    Hasanbasoglu, Muruvvet
    Aktas, Mehmet S.
    Kalipsiz, Oya
    Kanli, Alper Nebi
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 73 - 78
  • [3] Stream processing platforms for analyzing big dynamic data
    Hagedorn, Stefan
    Goetze, Philipp
    Saleh, Omran
    Sattler, Kai-Uwe
    [J]. IT-INFORMATION TECHNOLOGY, 2016, 58 (04): : 195 - 205
  • [4] Stream of Unbalanced Medical Big Data Using Convolutional Neural Network
    Gao, Weiwei
    Chen, Li
    Shang, Tao
    [J]. IEEE ACCESS, 2020, 8 : 81310 - 81319
  • [5] Stream Processing of Scientific Big Data on Heterogeneous Platforms - Image Analytics on Big Data in Motion
    Najmabadi, S. M.
    Klaiber, M.
    Wang, Z.
    Baroud, Y.
    Simon, S.
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 965 - 970
  • [6] Online Classification Algorithm for Uncertain Data Stream in Big Data
    [J]. Lyu, Yan Xia (shaoqilyx@163.com), 1600, Northeast University (37):
  • [7] Evolving Big Data Stream Classification with MapReduce
    Haque, Ahsanul
    Parker, Brandon
    Khan, Latifur
    Thuraisingham, Bhavani
    [J]. 2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 570 - 577
  • [8] Big Data Monetization: Platforms and Business Models
    Monteiro, Domingos S. M. P.
    Meira, Silvio R. L.
    Ferraz, Felipe Silva
    [J]. PROCEEDINGS OF 2021 16TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI'2021), 2021,
  • [9] Neural Network Models in Big Data Analytics and Cyber Security
    Ghimes, Ana-Maria
    Patriciu, Victor-Valeriu
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE - ECAI 2017, 2017,
  • [10] Digital Artificial Neural Network Implementation on a FPGA for Data Classification
    Morales, C.
    Flores, U.
    Adam, M.
    Diaz, M.
    Caballero, J. A.
    Criado, D.
    Pavoni, S.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2015, 13 (10) : 3216 - 3220