No Free Lunch Theorem for concept drift detection in streaming data classification: A review

被引:57
|
作者
Hu, Hanqing [1 ]
Kantardzic, Mehmed [1 ]
Sethi, Tegjyot S. [1 ]
机构
[1] Univ Louisville, CECS Dept, Louisville, KY 40292 USA
关键词
classification; concept drift; data stream; unlabeled samples; NONSTATIONARY DATA STREAMS; RECURRING CONCEPTS; EVOLVING DATA; ENSEMBLE; MODEL; CLASSIFIERS; ONLINE; SELECTION;
D O I
10.1002/widm.1327
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many real-world data mining applications have to deal with unlabeled streaming data. They are unlabeled because the sheer volume of the stream makes it impractical to label a significant portion of the data. The data streams can evolve over time and these changes are called concept drifts. Concept drifts have different characteristics, which can be used to categorize them into different types. A trade-off between performance and cost exists among many concept drift detection approaches. On the one hand, high accuracy detection approach usually requires labeled data, possibly involving high cost for labeling. On the other hand, a variety of methods have been devoted to the topic of concept drift detection with unlabeled data, but these approaches often are most suited for only a subset of the concept drift types. The objective of this survey is to present these methods, categorize them and give recommendations of usage based on their behaviors under different types of concept drift. This article is categorized under: Fundamental Concepts of Data and Knowledge > Data Concepts Fundamental Concepts of Data and Knowledge > Key Design Issues in Data Mining Explainable AI > Classification
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Streaming Data Classification with Concept Drift
    Althabiti, Mashail
    Abdullah, Manal
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2019, 12 (01): : 177 - 184
  • [2] Concept Drift Detection for Streaming Data
    Wang, Heng
    Abraham, Zubin
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [3] Concept Drift Detection in Streaming Classification of Mobile Application Traffic
    O. I. Sheluhin
    S. A. Sekretarev
    [J]. Automatic Control and Computer Sciences, 2021, 55 : 253 - 262
  • [4] Concept Drift Detection in Streaming Classification of Mobile Application Traffic
    Sheluhin, O., I
    Sekretarev, S. A.
    [J]. AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2021, 55 (03) : 253 - 262
  • [5] Concept drift in Streaming Data Classification: Algorithms, Platforms and Issues
    Janardan, Shikha Mehta
    [J]. 5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2017, 2017, 122 : 804 - 811
  • [6] No Free Lunch Theorem: A Review
    Adam, Stavros P.
    Alexandropoulos, Stamatios-Aggelos N.
    Pardalos, Panos M.
    Vrahatis, Michael N.
    [J]. APPROXIMATION AND OPTIMIZATION: ALGORITHMS, COMPLEXITY AND APPLICATIONS, 2019, 145 : 57 - 82
  • [7] On the reliable detection of concept drift from streaming unlabeled data
    Sethi, Tegjyot Singh
    Kantardzic, Mehmed
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 82 : 77 - 99
  • [8] Concept Drift Detection on Streaming Data with Dynamic Outlier Aggregation
    Zellner, Ludwig
    Richter, Florian
    Sontheim, Janina
    Maldonado, Andrea
    Seidl, Thomas
    [J]. PROCESS MINING WORKSHOPS, ICPM 2020 INTERNATIONAL WORKSHOPS, 2021, 406 : 206 - 217
  • [9] Concept Drift Detection on Streaming Data under Limited Labeling
    Kim, Young In
    Park, Cheong Hee
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2016, : 273 - 280
  • [10] Streaming Data Classification Based on Hierarchical Concept Drift and Online Ensemble
    Liu, Ning
    Zhao, Jianhua
    [J]. IEEE ACCESS, 2023, 11 : 126040 - 126051