Online Feature Screening for Data Streams With Concept Drift

被引:3
|
作者
Wang, Mingyuan [1 ]
Barbu, Adrian [1 ]
机构
[1] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
关键词
Feature extraction; Adaptation models; Fading channels; Data models; Computational modeling; Uncertainty; Indexes; Concept drift; data stream mining; feature screening; feature selection; model adaptation; DYNAMIC FEATURE-SELECTION; GENE-EXPRESSION; CLASSIFICATION;
D O I
10.1109/TKDE.2022.3232752
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Screening feature selection methods are often used as a preprocessing step for reducing the number of variables before training a model. Traditional screening methods only focus on dealing with complete high dimensional datasets. However, modern datasets not only have higher dimensions and larger sample size, but also have properties such as streaming input, sparsity, and concept drift. Therefore a considerable number of online feature selection methods were introduced to handle these kinds of problems in recent years. Online screening methods are one of the categories of online feature selection methods. The methods that we propose in this paper are capable of handling all three situations mentioned above, in classification settings. Our experiments show that the proposed methods can generate the same feature importance as their offline versions with faster speed and less storage requirements. Furthermore, the results show that online screening methods with integrated model adaptation have a higher true feature detection rate than without model adaptation on data streams exhibiting concept drift. Among the three large real datasets that potentially have concept drift, online screening methods with model adaptation show advantages in either saving computation time and space, reducing model complexity, or improving prediction accuracy.
引用
下载
收藏
页码:11693 / 11707
页数:15
相关论文
共 50 条
  • [21] An Active Learning Method for Data Streams with Concept Drift
    Park, Cheong Hee
    Kang, Youngsoon
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 746 - 752
  • [22] Incremental Weighted Ensemble for Data Streams With Concept Drift
    Jiao B.
    Guo Y.
    Yang C.
    Pu J.
    Zheng Z.
    Gong D.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (01): : 92 - 103
  • [23] A comprehensive analysis of concept drift locality in data streams
    Department of Computer Science, Virginia Commonwealth University, Richmond
    VA, United States
    Knowl Based Syst,
  • [24] Accuracy Updated Ensemble for Data Streams with Concept Drift
    Brzezinski, Dariusz
    Stefanowski, Jerzy
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART II, 2011, 6679 : 155 - 163
  • [25] An online fuzzy model for classification of data streams with drift
    Shahparast, Homeira
    Mansoori, Eghbal G.
    2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 91 - 95
  • [26] A Framework for Online Process Concept Drift Detection from Event Streams
    Liu, Na
    Huang, Jiwei
    Cui, Lizhen
    2018 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (IEEE SCC 2018), 2018, : 105 - 112
  • [27] Dynamic Ensemble Selection for Imbalanced Data Streams With Concept Drift
    Jiao, Botao
    Guo, Yinan
    Gong, Dunwei
    Chen, Qiuju
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 1278 - 1291
  • [28] On learning guarantees to unsupervised concept drift detection on data streams
    de Mello, Rodrigo F.
    Vaz, Yule
    Grossi, Carlos H.
    Bifet, Albert
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 117 : 90 - 102
  • [29] Diversity in Ensemble Model for Classification of Data Streams with Concept Drift
    Kolarik, Michal
    Sarnovsky, Martin
    Paralic, Jan
    2021 IEEE 19TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2021), 2021, : 355 - 359
  • [30] Adaptive Bagging Methods for Classification of Data Streams with Concept Drift
    Sarnovsky, Martin
    Marcinko, Jan
    ACTA POLYTECHNICA HUNGARICA, 2021, 18 (03) : 47 - 63