Variance Feedback Drift Detection Method for Evolving Data Streams Mining

被引:0
|
作者
Han, Meng [1 ,2 ]
Meng, Fanxing [1 ]
Li, Chunpeng [1 ]
机构
[1] North Minzu Univ, Sch Comp Sci & Engn, Yinchuan 750021, Peoples R China
[2] North Minzu Univ, Key Lab Images & Graph Intelligent Proc State Ethn, Yinchuan 750021, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 16期
基金
中国国家自然科学基金;
关键词
concept drift; variance; data stream; classification; statistical test; ONLINE;
D O I
10.3390/app14167157
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Learning from changing data streams is one of the important tasks of data mining. The phenomenon of the underlying distribution of data streams changing over time is called concept drift. In classification decision-making, the occurrence of concept drift will greatly affect the classification efficiency of the original classifier, that is, the old decision-making model is not suitable for the new data environment. Therefore, dealing with concept drift from changing data streams is crucial to guarantee classifier performance. Currently, most concept drift detection methods apply the same detection strategy to different data streams, with little attention to the uniqueness of each data stream. This limits the adaptability of drift detectors to different environments. In our research, we designed a unique solution to address this issue. First, we proposed a variance estimation strategy and a variance feedback strategy to characterize the data stream's characteristics through variance. Based on this variance, we developed personalized drift detection schemes for different data streams, thereby enhancing the adaptability of drift detection in various environments. We conducted experiments on data streams with various types of drifts. The experimental results show that our algorithm achieves the best average ranking for accuracy on the synthetic dataset, with an overall ranking 1.12 to 1.5 higher than the next-best algorithm. In comparison with algorithms using the same tests, our method improves the ranking by 3 to 3.5 for the Hoeffding test and by 1.12 to 2.25 for the McDiarmid test. In addition, they achieve a good balance between detection delay and false positive rates. Finally, our algorithm ranks higher than existing drift detection methods across the four key metrics of accuracy, CPU time, false positives, and detection delay, meeting our expectations.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] Reservoir of diverse adaptive learners and stacking fast hoeffding drift detection methods for evolving data streams
    Pesaranghader, Ali
    Viktor, Herna
    Paquet, Eric
    MACHINE LEARNING, 2018, 107 (11) : 1711 - 1743
  • [22] Concept learning using one-class classifiers for implicit drift detection in evolving data streams
    Ömer Gözüaçık
    Fazli Can
    Artificial Intelligence Review, 2021, 54 : 3725 - 3747
  • [23] Drift Detection over Non-stationary Data Streams Using Evolving Spiking Neural Networks
    Lobo, Jesus L.
    Del Ser, Javier
    Lana, Ibai
    Nekane Bilbao, Miren
    Kasabov, Nikola
    INTELLIGENT DISTRIBUTED COMPUTING XII, 2018, 798 : 82 - 94
  • [24] Reservoir of diverse adaptive learners and stacking fast hoeffding drift detection methods for evolving data streams
    Ali Pesaranghader
    Herna Viktor
    Eric Paquet
    Machine Learning, 2018, 107 : 1711 - 1743
  • [25] Batch Weighted Ensemble for Mining Data Streams with Concept Drift
    Deckert, Magdalena
    FOUNDATIONS OF INTELLIGENT SYSTEMS, 2011, 6804 : 290 - 299
  • [26] Bhattacharyya distance based concept drift detection method for evolving data stream
    Baidari, Ishwar
    Honnikoll, Nagaraj
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
  • [27] SPEDS: A Framework for Mining Sequential Patterns in Evolving Data Streams
    Soliman, Amany F.
    Ebrahim, Gamal A.
    Mohammed, Hoda K.
    2011 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2011, : 464 - 469
  • [28] Extremely Fast Decision Tree Mining for Evolving Data Streams
    Bifet, Albert
    Zhang, Jiajin
    Fan, Wei
    He, Cheng
    Zhang, Jianfeng
    Qian, Jianfeng
    Holmes, Geoff
    Pfahringer, Bernhard
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1733 - 1742
  • [29] Handling Concept Drift in Data Streams by Using Drift Detection Methods
    Patil, Malini M.
    DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2018, VOL 2, 2019, 839 : 155 - 166
  • [30] High utility drift detection in quantitative data streams
    Quang-Huy Duong
    Ramampiaro, Heri
    Norvag, Kjetil
    Fournier-Viger, Philippe
    Thu-Lan Dam
    KNOWLEDGE-BASED SYSTEMS, 2018, 157 : 34 - 51