Analyzing and repairing concept drift adaptation in data stream classification

被引:0
|
作者
Ben Halstead
Yun Sing Koh
Patricia Riddle
Russel Pears
Mykola Pechenizkiy
Albert Bifet
Gustavo Olivares
Guy Coulson
机构
[1] The University of Auckland,School of Computer Science
[2] Auckland University of Technology,undefined
[3] Eindhoven University of Technology,undefined
[4] University of Waikato,undefined
[5] LTCI,undefined
[6] Télécom Paris,undefined
[7] IP-Paris,undefined
[8] National Institute of Water and Atmospheric Research,undefined
来源
Machine Learning | 2022年 / 111卷
关键词
Concept drift; Data stream classification; Recurring concepts;
D O I
暂无
中图分类号
学科分类号
摘要
Data collected over time often exhibit changes in distribution, or concept drift, caused by changes in factors relevant to the classification task, e.g. weather conditions. Incorporating all relevant factors into the model may be able to capture these changes, however, this is usually not practical. Data stream based methods, which instead explicitly detect concept drift, have been shown to retain performance under unknown changing conditions. These methods adapt to concept drift by training a model to classify each distinct data distribution. However, we hypothesize that existing methods do not robustly handle real-world tasks, leading to adaptation errors where context is misidentified. Adaptation errors may cause a system to use a model which does not fit the current data, reducing performance. We propose a novel repair algorithm to identify and correct errors in concept drift adaptation. Evaluation on synthetic data shows that our proposed AiRStream system has higher performance than baseline methods, while is also better at capturing the dynamics of the stream. Evaluation on an air quality inference task shows AiRStream provides increased real-world performance compared to eight baseline methods. A case study shows that AiRStream is able to build a robust model of environmental conditions over this task, allowing the adaptions made to concept drift to be analysed and related to changes in weather. We discovered a strong predictive link between the adaptions made by AiRStream and changes in meteorological conditions.
引用
收藏
页码:3489 / 3523
页数:34
相关论文
共 50 条
  • [1] Analyzing and Repairing Concept Drift Adaptation in Data Stream Classification
    Halstead, Ben
    Koh, Yun Sing
    Riddle, Patricia
    Pears, Russel
    Pechenizkiy, Mykola
    Bifet, Albert
    Olivares, Gustavo
    Coulson, Guy
    2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2021,
  • [2] Analyzing and repairing concept drift adaptation in data stream classification
    Halstead, Ben
    Koh, Yun Sing
    Riddle, Patricia
    Pears, Russel
    Pechenizkiy, Mykola
    Bifet, Albert
    Olivares, Gustavo
    Coulson, Guy
    MACHINE LEARNING, 2022, 111 (10) : 3489 - 3523
  • [3] Uncertain Data Stream Classification with Concept Drift
    Lv Yanxia
    Wang Cuirong
    Wang Cong
    Liu Bingyu
    2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 265 - +
  • [4] Scalable concept drift adaptation for stream data mining
    Hu, Lisha
    Li, Wenxiu
    Lu, Yaru
    Hu, Chunyu
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) : 6725 - 6743
  • [5] Adaptive Classification Algorithm for Concept Drift Data Stream
    Cai H.
    Lu K.
    Wu Q.
    Wu D.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (03): : 633 - 646
  • [6] An ensemble method for data stream classification in the presence of concept drift
    Department of Computer Engineering, University of Zanjan, Zanjan
    45371-38791, Iran
    Front. Inf. Technol. Electr. Eng., 12 (1059-1068):
  • [7] Anensemble method for data stream classification in the presence of concept drift
    Abbaszadeh, Omid
    Amiri, Ali
    Khanteymoori, Ali Reza
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2015, 16 (12) : 1059 - 1068
  • [8] Study on a classification model of data stream based on concept drift
    1600, Science and Engineering Research Support Society (09):
  • [9] An ensemble method for data stream classification in the presence of concept drift
    Omid ABBASZADEH
    Ali AMIRI
    Ali Reza KHANTEYMOORI
    FrontiersofInformationTechnology&ElectronicEngineering, 2015, 16 (12) : 1059 - 1068
  • [10] An ensemble method for data stream classification in the presence of concept drift
    Omid Abbaszadeh
    Ali Amiri
    Ali Reza Khanteymoori
    Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 1059 - 1068