Concept drift detection and accelerated convergence of online learning

被引:5
|
作者
Guo, Husheng [1 ,2 ]
Li, Hai [1 ]
Sun, Ni [1 ]
Ren, Qiaoyan [1 ]
Zhang, Aijuan [1 ]
Wang, Wenjian [1 ,2 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Shanxi, Peoples R China
[2] Shanxi Univ, Key Lab Computat Intelligence & Chinese Informat, Minist Educ, Taiyuan 030006, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Streaming data; Concept drift; Authenticity; Model convergence; NEURAL-NETWORKS; DATA STREAMS; ENSEMBLE; CLASSIFICATION; MODELS;
D O I
10.1007/s10115-022-01790-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Streaming data has become an important form in the era of big data, and the concept drift, as one of the most important problem of it, is often studied deeply. However, similar to true concept drift, noise and too small training samples will also lead to the classification performance fluctuation, which is easy to confuse with true concept drift. To solve this problem, an improved concept drift detection method is proposed, and the accelerated convergence of the model after concept drift is also studied. Firstly, the effective fluctuation sites can be obtained by group detection method. Secondly, the authenticity of concept drift can be determined by tracking the testing accuracy of reference sites near the effective fluctuation site. Lastly, in the convergence acceleration stage, the time sequential distance is designed to measure the similarity of these sequential data blocks during different time periods, and the noncritical disturbance data with the largest time sequential distance are removed sequentially to improve the convergence speed of the model after concept drift occurs. The experimental results demonstrate that the proposed method not only produces better identification results in distinguishing true and false concept drift but also improves the convergence speed of the model.
引用
下载
收藏
页码:1005 / 1043
页数:39
相关论文
共 50 条
  • [41] Towards Online Concept Drift Detection with Feature Selection for Data Stream Classification
    Hammoodi, Mahmood
    Stahl, Frederic
    Tennant, Mark
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1549 - 1550
  • [42] Network Intrusion Detection through Online Transformation of Eigenvector Reflecting Concept Drift
    Park, Seongchul
    Seo, Sanghyun
    Jeong, Changhoon
    Kim, Juntae
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE, E-LEARNING AND INFORMATION SYSTEMS 2018 (DATA'18), 2018,
  • [43] Online concept evolution detection based on active learning
    Guo, Husheng
    Li, Hai
    Cong, Lu
    Wang, Wenjian
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (04) : 1589 - 1633
  • [44] A Multiscale Concept Drift Detection Method for Learning from Data Streams
    Wang, XueSong
    Kang, Qi
    Zhou, MengChu
    Yao, SiYa
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2018, : 786 - 790
  • [45] A Framework to Monitor Machine Learning Systems Using Concept Drift Detection
    Zhou, Xianzhe
    Lo Faro, Wally
    Zhang, Xiaoying
    Arvapally, Ravi Santosh
    BUSINESS INFORMATION SYSTEMS, PT I, 2019, 353 : 218 - 231
  • [46] A Novel Concept Drift Detection Method for Incremental Learning in Nonstationary Environments
    Yang, Zhe
    Al-Dahidi, Sameer
    Baraldi, Piero
    Zio, Enrico
    Montelatici, Lorenzo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (01) : 309 - 320
  • [47] Machine Learning & Concept Drift based Approach for Malicious Website Detection
    Singhal, Siddharth
    Chawla, Utkarsh
    Shorey, Rajeev
    2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,
  • [48] Combining active learning with concept drift detection for data stream mining
    Krawczyk, Bartosz
    Pfahringer, Bernhard
    Wozniak, Michal
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2239 - 2244
  • [49] Detection & management of concept drift
    Mak, Lee-Onn
    Krause, Paul
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 3486 - +
  • [50] Unsupervised Concept Drift Detection via Imbalanced Cluster Discriminator Learning
    Zhao, Mingjie
    Zhang, Yiqun
    Ji, Yuzhu
    Lu, Yang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 31 - 43