Improving the Efficiency of Ensemble Classifier Adaptive Random Forest with Meta Level Learning for Real-Time Data Streams

被引:2
|
作者
Arya, Monika [1 ]
Choudhary, Chaitali [1 ]
机构
[1] Univ Petr & Energy Studies, Dehra Dun, Uttarakhand, India
来源
INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019 | 2020年 / 1034卷
关键词
Data stream mining; Random forests; Ensemble; Concept drift; Pruning; Forest; Adaptive random forest; Data streams;
D O I
10.1007/978-981-15-1084-7_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
New challenges have emerged in data mining as the traditional techniques have floundered with real-time data streams. The traditional technique needs refurbishing so as to acclimatize with concept drifting data streams. Thus dealing with the concept changes is the most imperative task of stream data mining. Ensemble classifiers have the ability to automatically adapt with the incoming drifts and, therefore, it is the most interesting research area in data stream mining. Bagging, Boosting and Random forest generation are the common ensemble techniques and are the most popular machine learning approaches in the current scenario for static data (Gomes HM, Bifet A, Read J, Barddal JP, Enembreck F, Pfharinger B, Abdessalem T (2017) Adaptive random forests for evolving data stream classification. Mach Learn 106(910):469-1495, [1]). A large number of base classifiers in an ensemble can cause computational overhead. Data mining classifiers for real-time data streams, therefore, need to be updated constantly and retrained with the labeled instances of the newly arrived novel classes in data streams and to cope with concept drift; otherwise, the mining models will become less and less accurate as time passes by. However, for data streams, adaptive random forest algorithms have been widely used for ensemble generation due to its competence to handle different types of drifts. This paper proposes a modified adaptive random forest with meta level learner algorithm and concept adaptive very fast decision tree to overcome the concept drift problem in real-time data streams. The proposed algorithm is experimentally compared with state-of-the-art adaptive random forest algorithm on several real synthetic datasets. Results indicate its efficiency in terms of accuracy and processing time.
引用
收藏
页码:11 / 21
页数:11
相关论文
共 50 条
  • [31] Near Real-Time Burned Area Progression Mapping With Multispectral Data Using Ensemble Learning
    Hu, Xikun
    Wen, Hao
    Zhang, Puzhao
    Yuen, Ka-Veng
    Zhong, Ping
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [32] Evaluating the Effectiveness of Machine Learning Technologies in Improving Real-Time Drilling Data Quality
    Al-Gharbi, Salem
    Al-Majed, Abdulaziz
    Elkatatny, Salaheldin
    Abdulraheem, Abdulazeez
    JOURNAL OF ENERGY RESOURCES TECHNOLOGY-TRANSACTIONS OF THE ASME, 2022, 144 (09):
  • [33] A Fully Embedded Adaptive Real-Time Hand Gesture Classifier Leveraging HD-sEMG and Deep Learning
    Tam, Simon
    Boukadoum, Mounir
    Campeau-Lecours, Alexandre
    Gosselin, Benoit
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2020, 14 (02) : 232 - 243
  • [34] Adaptive Reservoir Neural Gas: An Effective Clustering Algorithm for Addressing Concept Drift in Real-Time Data Streams
    Demertzis, Konstantinos
    Iliadis, Lazaros
    Papaleonidas, Antonios
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 152 - 166
  • [35] A Data-Flow Oriented Deep Ensemble Learning Method for Real-Time Surface Defect Inspection
    Liu, Yuekai
    Gao, Hongli
    Guo, Liang
    Qin, Aoping
    Cai, Canyu
    You, Zhichao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (07) : 4681 - 4691
  • [36] Time-Distributed Attention-Layered Convolution Neural Network with Ensemble Learning using Random Forest Classifier for Speech Emotion Recognition
    Bhanusree, Yalamanchili
    Kumar, Samayamantula Srinivas
    Rao, Anne Koteswara
    JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2023, 22 (01): : 49 - 76
  • [37] Random-forest-based real-time contrasts control chart using adaptive breakpoints with symbolic aggregate approximation
    Lee, In-seok
    Park, Seung Hwan
    Baek, Jun-Geol
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 158
  • [38] Online Real-Time Analysis of Data Streams Based on an Incremental High-Order Deep Learning Model
    Li, Yuliang
    Zhang, Min
    Wang, Wei
    IEEE ACCESS, 2018, 6 : 77615 - 77623
  • [39] Real-Time Deep Learning-Based Anomaly Detection Approach for Multivariate Data Streams with Apache Flink
    Ha, Tae Wook
    Kang, Jung Mo
    Kim, Myoung Ho
    ICWE 2021 WORKSHOPS, ICWE 2021 INTERNATIONAL WORKSHOPS, 2022, 1508 : 39 - 49
  • [40] A task-level adaptive Map Reduce framework for real-time streaming data in healthcare applications
    Zhang, Fan
    Cao, Junwei
    Khan, Samee U.
    Li, Keqin
    Hwang, Kai
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 43-44 : 149 - 160