ElStream: An Ensemble Learning Approach for Concept Drift Detection in Dynamic Social Big Data Stream Learning

被引:63
|
作者
Abbasi, Ahmad [1 ]
Javed, Abdul Rehman [2 ]
Chakraborty, Chinmay [3 ]
Nebhen, Jamel [4 ]
Zehra, Wisha [1 ]
Jalil, Zunera [2 ]
机构
[1] Air Univ, Fac Comp & AI, Islamabad 44000, Pakistan
[2] Air Univ, Dept Cyber Secur, Islamabad 44000, Pakistan
[3] Birla Inst Technol, Dept Elect & Commun Engn, Ranchi 835215, Bihar, India
[4] Prince Sattam Bin Abdulaziz Univ, Coll Comp Sci & Engn, Al Kharj 11942, Saudi Arabia
关键词
Big Data; Machine learning; Light emitting diodes; Training; Data models; Standards; Licenses; Internet of Things; big data; smart concept drift; social data; online learning; ensemble learning; HETEROGENEOUS ENSEMBLE; ONLINE; CLASSIFIER;
D O I
10.1109/ACCESS.2021.3076264
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid increase in communication technologies and smart devices, an enormous surge in data traffic has been observed. A huge amount of data gets generated every second by different applications, users, and devices. This rapid generation of data has created the need for solutions to analyze the change in data over time in unforeseen ways despite resource constraints. These unforeseeable changes in the underlying distribution of streaming data over time are identified as concept drifts. This paper presents a novel approach named ElStream that detects concept drift using ensemble and conventional machine learning techniques using both real and artificial data. ElStream utilizes the majority voting technique making only optimum classifier to vote for decision. Experiments were conducted to evaluate the performance of the proposed approach. According to experimental analysis, the ensemble learning approach provides a consistent performance for both artificial and real-world data sets. Experiments prove that the ElStream provides better accuracy of 12.49%, 11.98%, 10.06%, 1.2%, and 0.33% for PokerHand, LED, Random RBF, Electricity, and SEA dataset respectively, which is better as compared to previous state-of-the-art studies and conventional machine learning algorithms.
引用
收藏
页码:66408 / 66419
页数:12
相关论文
共 50 条
  • [31] Dynamic Clustering Forest: An ensemble framework to efficiently classify textual data stream with concept drift
    Song, Ge
    Ye, Yunming
    Zhang, Haijun
    Xu, Xiaofei
    Lau, Raymond Y. K.
    Liu, Feng
    INFORMATION SCIENCES, 2016, 357 : 125 - 143
  • [32] On learning guarantees to unsupervised concept drift detection on data streams
    de Mello, Rodrigo F.
    Vaz, Yule
    Grossi, Carlos H.
    Bifet, Albert
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 117 : 90 - 102
  • [33] Semi-supervised Ensemble Learning of Data Streams in the Presence of Concept Drift
    Ahmadi, Zahra
    Beigy, Hamid
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PT II, 2012, 7209 : 526 - 537
  • [34] A Classifier Using Online Bagging Ensemble Method for Big Data Stream Learning
    Lv, Yanxia
    Peng, Sancheng
    Yuan, Ying
    Wang, Cong
    Yin, Pengfei
    Liu, Jiemin
    Wang, Cuirong
    TSINGHUA SCIENCE AND TECHNOLOGY, 2019, 24 (04) : 379 - 388
  • [35] Learning Concept Drift in Nonstationary Environments Using an Ensemble of Classifiers Based Approach
    Karnick, Matthew
    Ahiskali, Metin
    Muhlbaier, Michael D.
    Polikar, Robi
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3455 - 3462
  • [36] A Classifier Using Online Bagging Ensemble Method for Big Data Stream Learning
    Yanxia Lv
    Sancheng Peng
    Ying Yuan
    Cong Wang
    Pengfei Yin
    Jiemin Liu
    Cuirong Wang
    Tsinghua Science and Technology, 2019, (04) : 379 - 388
  • [37] A Classifier Using Online Bagging Ensemble Method for Big Data Stream Learning
    Yanxia Lv
    Sancheng Peng
    Ying Yuan
    Cong Wang
    Pengfei Yin
    Jiemin Liu
    Cuirong Wang
    Tsinghua Science and Technology, 2019, 24 (04) : 379 - 388
  • [38] Big Data Stream Learning with SAMOA
    Bifet, Albert
    De Francisci Morales, Gianmarco
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 1199 - 1202
  • [39] Concept Drift Detection for Deep Learning Aided Receivers in Dynamic Channels
    Uzlaner, Nicole
    Raviv, Tomer
    Shlezinger, Nir
    Todros, Koby
    2024 IEEE 25TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, SPAWC 2024, 2024, : 371 - 375
  • [40] Enhanced Intrusion Detection with Data Stream Classification and Concept Drift Guided by the Incremental Learning Genetic Programming Combiner
    Shyaa, Methaq A.
    Zainol, Zurinahni
    Abdullah, Rosni
    Anbar, Mohammed
    Alzubaidi, Laith
    Santamaria, Jose
    SENSORS, 2023, 23 (07)