Concept drift detection and accelerated convergence of online learning

被引:5
|
作者
Guo, Husheng [1 ,2 ]
Li, Hai [1 ]
Sun, Ni [1 ]
Ren, Qiaoyan [1 ]
Zhang, Aijuan [1 ]
Wang, Wenjian [1 ,2 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Shanxi, Peoples R China
[2] Shanxi Univ, Key Lab Computat Intelligence & Chinese Informat, Minist Educ, Taiyuan 030006, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Streaming data; Concept drift; Authenticity; Model convergence; NEURAL-NETWORKS; DATA STREAMS; ENSEMBLE; CLASSIFICATION; MODELS;
D O I
10.1007/s10115-022-01790-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Streaming data has become an important form in the era of big data, and the concept drift, as one of the most important problem of it, is often studied deeply. However, similar to true concept drift, noise and too small training samples will also lead to the classification performance fluctuation, which is easy to confuse with true concept drift. To solve this problem, an improved concept drift detection method is proposed, and the accelerated convergence of the model after concept drift is also studied. Firstly, the effective fluctuation sites can be obtained by group detection method. Secondly, the authenticity of concept drift can be determined by tracking the testing accuracy of reference sites near the effective fluctuation site. Lastly, in the convergence acceleration stage, the time sequential distance is designed to measure the similarity of these sequential data blocks during different time periods, and the noncritical disturbance data with the largest time sequential distance are removed sequentially to improve the convergence speed of the model after concept drift occurs. The experimental results demonstrate that the proposed method not only produces better identification results in distinguishing true and false concept drift but also improves the convergence speed of the model.
引用
下载
收藏
页码:1005 / 1043
页数:39
相关论文
共 50 条
  • [1] Concept drift detection and accelerated convergence of online learning
    Husheng Guo
    Hai Li
    Ni Sun
    Qiaoyan Ren
    Aijuan Zhang
    Wenjian Wang
    Knowledge and Information Systems, 2023, 65 : 1005 - 1043
  • [2] Learning with Online Drift Detection
    Frias Blanco, Isvani
    del Campo Avila, Jose
    Ramos Jimenez, Gonzalo
    Morales Bueno, Rafael
    Ortiz Diaz, Agustin
    Caballero Mota, Yaile
    COMPUTACION Y SISTEMAS, 2014, 18 (01): : 169 - 183
  • [3] Online Detection of Concept Drift in Visual Tracking
    Liu, Yichen
    Zhou, Yue
    NEURAL INFORMATION PROCESSING, ICONIP 2014, PT III, 2014, 8836 : 159 - 166
  • [4] Big-Data Streaming Applications Scheduling with Online Learning and Concept Drift Detection
    Kanoun, Karim
    van der Schaar, Mihaela
    2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2015, : 1547 - 1550
  • [5] Adaptive online learning for classification under concept drift
    Goel, Kanu
    Batra, Shalini
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2021, 24 (02) : 128 - 135
  • [6] The Impact of Latency on Online Classification Learning with Concept Drift
    Marrs, Gary R.
    Hickey, Ray J.
    Black, Michaela M.
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2010, 6291 : 459 - 469
  • [7] A Method Aware of Concept Drift for Online Botnet Detection
    Schwengber, Bruno Henrique
    Vergutz, Andressa
    Prates, Nelson G., Jr.
    Nogueira, Michele
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [8] Online Federated Learning via Non-Stationary Detection and Adaptation Amidst Concept Drift
    Ganguly, Bhargav
    Aggarwal, Vaneet
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (01) : 643 - 653
  • [9] A Systematic Study of Online Class Imbalance Learning With Concept Drift
    Wang, Shuo
    Minku, Leandro L.
    Yao, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 4802 - 4821
  • [10] The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift
    Minku, Leandro L.
    White, Allan P.
    Yao, Xin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (05) : 730 - 742