Improving the Efficiency of Ensemble Classifier Adaptive Random Forest with Meta Level Learning for Real-Time Data Streams

被引:2
|
作者
Arya, Monika [1 ]
Choudhary, Chaitali [1 ]
机构
[1] Univ Petr & Energy Studies, Dehra Dun, Uttarakhand, India
关键词
Data stream mining; Random forests; Ensemble; Concept drift; Pruning; Forest; Adaptive random forest; Data streams;
D O I
10.1007/978-981-15-1084-7_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
New challenges have emerged in data mining as the traditional techniques have floundered with real-time data streams. The traditional technique needs refurbishing so as to acclimatize with concept drifting data streams. Thus dealing with the concept changes is the most imperative task of stream data mining. Ensemble classifiers have the ability to automatically adapt with the incoming drifts and, therefore, it is the most interesting research area in data stream mining. Bagging, Boosting and Random forest generation are the common ensemble techniques and are the most popular machine learning approaches in the current scenario for static data (Gomes HM, Bifet A, Read J, Barddal JP, Enembreck F, Pfharinger B, Abdessalem T (2017) Adaptive random forests for evolving data stream classification. Mach Learn 106(910):469-1495, [1]). A large number of base classifiers in an ensemble can cause computational overhead. Data mining classifiers for real-time data streams, therefore, need to be updated constantly and retrained with the labeled instances of the newly arrived novel classes in data streams and to cope with concept drift; otherwise, the mining models will become less and less accurate as time passes by. However, for data streams, adaptive random forest algorithms have been widely used for ensemble generation due to its competence to handle different types of drifts. This paper proposes a modified adaptive random forest with meta level learner algorithm and concept adaptive very fast decision tree to overcome the concept drift problem in real-time data streams. The proposed algorithm is experimentally compared with state-of-the-art adaptive random forest algorithm on several real synthetic datasets. Results indicate its efficiency in terms of accuracy and processing time.
引用
收藏
页码:11 / 21
页数:11
相关论文
共 50 条
  • [1] An adaptive ensemble classification framework for real-time data streams by distributed control systems
    Wang Sufang
    Neural Computing and Applications, 2020, 32 : 4139 - 4149
  • [2] An adaptive ensemble classification framework for real-time data streams by distributed control systems
    Wang Sufang
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (09): : 4139 - 4149
  • [3] Accelerated Real-Time Classification of Evolving Data Streams using Adaptive Random Forests
    Ridder, Frank
    Chen, Kuan-Hsun
    Alachiotis, Nikolaos
    2023 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY, ICFPT, 2023, : 232 - 237
  • [4] Dynamically Evolving Fuzzy Classifier for Real-time Classification of Data Streams
    Baruah, Rashmi Dutta
    Angelov, Plamen
    Baruah, Diganta
    2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2014, : 383 - 389
  • [5] Adaptive load management over real-time data streams
    Li, Xin
    Ma, Li
    Li, Kun
    Wang, Kun
    Wang, Hong-An
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 719 - +
  • [6] Real-time adaptive event detection in astronomical data streams
    1600, Institute of Electrical and Electronics Engineers Inc., United States (29):
  • [7] Real-Time Adaptive Event Detection in Astronomical Data Streams
    Thompson, David R.
    Burke-Spolaor, Sarah
    Deller, Adam T.
    Majid, Walid A.
    Palaniswamy, Divya
    Tingay, Steven J.
    Wagstaff, Kiri L.
    Wayth, Randall B.
    IEEE INTELLIGENT SYSTEMS, 2014, 29 (01) : 48 - 55
  • [8] A Novel Online Real-time Classifier for Multi-label Data Streams
    Venkatesan, Rajasekar
    Er, Meng Joo
    Wu, Shiqian
    Pratama, Mahardhika
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1833 - 1840
  • [9] Online real-time learning strategies for data streams for Neurocomputing
    Pratama, Mahardhika
    Lughofer, Edwin
    Wang, Dianhui
    NEUROCOMPUTING, 2017, 262 : 1 - 3
  • [10] Fast Adaptive Real-Time Classification for Data Streams with Concept Drift
    Tennant, Mark
    Stahl, Frederic
    Gomes, Joao Bartolo
    INTERNET AND DISTRIBUTED COMPUTING SYSTEMS, IDCS 2015, 2015, 9258 : 265 - 272