Concept drift detection and accelerated convergence of online learning

被引:5
|
作者
Guo, Husheng [1 ,2 ]
Li, Hai [1 ]
Sun, Ni [1 ]
Ren, Qiaoyan [1 ]
Zhang, Aijuan [1 ]
Wang, Wenjian [1 ,2 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Shanxi, Peoples R China
[2] Shanxi Univ, Key Lab Computat Intelligence & Chinese Informat, Minist Educ, Taiyuan 030006, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Streaming data; Concept drift; Authenticity; Model convergence; NEURAL-NETWORKS; DATA STREAMS; ENSEMBLE; CLASSIFICATION; MODELS;
D O I
10.1007/s10115-022-01790-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Streaming data has become an important form in the era of big data, and the concept drift, as one of the most important problem of it, is often studied deeply. However, similar to true concept drift, noise and too small training samples will also lead to the classification performance fluctuation, which is easy to confuse with true concept drift. To solve this problem, an improved concept drift detection method is proposed, and the accelerated convergence of the model after concept drift is also studied. Firstly, the effective fluctuation sites can be obtained by group detection method. Secondly, the authenticity of concept drift can be determined by tracking the testing accuracy of reference sites near the effective fluctuation site. Lastly, in the convergence acceleration stage, the time sequential distance is designed to measure the similarity of these sequential data blocks during different time periods, and the noncritical disturbance data with the largest time sequential distance are removed sequentially to improve the convergence speed of the model after concept drift occurs. The experimental results demonstrate that the proposed method not only produces better identification results in distinguishing true and false concept drift but also improves the convergence speed of the model.
引用
下载
收藏
页码:1005 / 1043
页数:39
相关论文
共 50 条
  • [31] Machine learning in concept drift detection using statistical measures
    Ali Abdu, Nail Adeeb
    Basulaim, Khaled Omer
    International Journal of Computers and Applications, 2024, 46 (05) : 281 - 291
  • [32] An online ensembles approach for handling concept drift in data streams: diversified online ensembles detection
    Parneeta Sidhu
    M. P. S. Bhatia
    International Journal of Machine Learning and Cybernetics, 2015, 6 : 883 - 909
  • [33] An online ensembles approach for handling concept drift in data streams: diversified online ensembles detection
    Sidhu, Parneeta
    Bhatia, M. P. S.
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) : 883 - 909
  • [34] C oncept Drift Detection for Online Class Imbalance Learning
    Wang, Shuo
    Minku, Leandro L.
    Ghezzi, Davide
    Caltabiano, Daniele
    Tino, Peter
    Yao, Xin
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [35] GPU-Accelerated Extreme Learning Machines for Imbalanced Data Streams with Concept Drift
    Krawczyk, Bartosz
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 1692 - 1701
  • [36] FP-ELM: An online sequential learning algorithm for dealing with concept drift
    Liu, Dong
    Wu, YouXi
    Jiang, He
    NEUROCOMPUTING, 2016, 207 : 322 - 334
  • [37] Towards Online Learning and Concept Drift for Offloading Complex Event Processing in the Edge
    Neto, Joao Alexandre
    Fonseca, Jorge C. B.
    Gama, Kiev
    2020 IEEE/ACM SYMPOSIUM ON EDGE COMPUTING (SEC 2020), 2020, : 167 - 169
  • [38] Online Extreme Learning Machine for Handling Concept Drift and Class Imbalance Problem
    Vinayagasundaram, B.
    Aarthi, R. J.
    Abirami, N.
    2017 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2017,
  • [39] Online Anomaly Detection with Concept Drift Adaptation using Recurrent Neural Networks
    Saurav, Sakti
    Malhotra, Pankaj
    Tv, Vishnu
    Gugulothu, Narendhar
    Vig, Lovekesh
    Agarwal, Puneet
    Shroff, Gautam
    PROCEEDINGS OF THE ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MANAGEMENT OF DATA (CODS-COMAD'18), 2018, : 78 - 87
  • [40] Online eigenvector transformation reflecting concept drift for improving network intrusion detection
    Park, Seongchul
    Seo, Sanghyun
    Jeong, Changhoon
    Kim, Juntae
    EXPERT SYSTEMS, 2020, 37 (05)