Analyzing and Repairing Concept Drift Adaptation in Data Stream Classification

被引:0
|
作者
Halstead, Ben [1 ]
Koh, Yun Sing [1 ]
Riddle, Patricia [1 ]
Pears, Russel [2 ]
Pechenizkiy, Mykola [3 ]
Bifet, Albert [4 ,5 ]
Olivares, Gustavo [6 ]
Coulson, Guy [6 ]
机构
[1] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
[2] Auckland Univ Technol, Auckland, New Zealand
[3] Eindhoven Univ Technol, Eindhoven, Netherlands
[4] Univ Waikato, Hamilton, New Zealand
[5] IP Paris, Telecom Paris, LTCI, Paris, France
[6] Natl Inst Water & Atmospher Res, Auckland, New Zealand
关键词
D O I
10.1109/DSAA53316.2021.9564191
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data collected over time often exhibit changes in distribution, or concept drift, caused by changes in hidden context relevant to the classification task, e.g. weather conditions. Adaptive learning methods are able to retain performance in changing conditions by explicitly detecting concept drift and changing the classifier used to make predictions. However, in real-world conditions, existing methods often select classifiers which poorly represent current data due to adaptation errors, where change in context is misidentified. We propose the AiRStream system, which uses a novel repair algorithm to identify and correct adaptation errors. We identify errors by periodically testing the performance of inactive classifiers. If an error is identified, a backtracking procedure repairs training done under the misidentified context. AiRStream achieves higher accuracy compared to baseline methods and selects classifiers which better match changes in context. A case study on a real-world air quality inference task shows that AiRStream is able to build a robust model of environmental conditions, allowing the adaptions made to concept drift to be analysed and related to changes in weather.
引用
收藏
页数:2
相关论文
共 50 条
  • [1] Analyzing and repairing concept drift adaptation in data stream classification
    Ben Halstead
    Yun Sing Koh
    Patricia Riddle
    Russel Pears
    Mykola Pechenizkiy
    Albert Bifet
    Gustavo Olivares
    Guy Coulson
    [J]. Machine Learning, 2022, 111 : 3489 - 3523
  • [2] Analyzing and repairing concept drift adaptation in data stream classification
    Halstead, Ben
    Koh, Yun Sing
    Riddle, Patricia
    Pears, Russel
    Pechenizkiy, Mykola
    Bifet, Albert
    Olivares, Gustavo
    Coulson, Guy
    [J]. MACHINE LEARNING, 2022, 111 (10) : 3489 - 3523
  • [3] Uncertain Data Stream Classification with Concept Drift
    Lv Yanxia
    Wang Cuirong
    Wang Cong
    Liu Bingyu
    [J]. 2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 265 - +
  • [4] Scalable concept drift adaptation for stream data mining
    Hu, Lisha
    Li, Wenxiu
    Lu, Yaru
    Hu, Chunyu
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) : 6725 - 6743
  • [5] Adaptive Classification Algorithm for Concept Drift Data Stream
    Cai, Huan
    Lu, Kezhong
    Wu, Qirong
    Wu, Dingming
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (03): : 633 - 646
  • [6] Anensemble method for data stream classification in the presence of concept drift
    Abbaszadeh, Omid
    Amiri, Ali
    Khanteymoori, Ali Reza
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2015, 16 (12) : 1059 - 1068
  • [7] Study on a classification model of data stream based on concept drift
    [J]. 1600, Science and Engineering Research Support Society (09):
  • [8] An ensemble method for data stream classification in the presence of concept drift
    Omid ABBASZADEH
    Ali AMIRI
    Ali Reza KHANTEYMOORI
    [J]. Frontiers of Information Technology & Electronic Engineering, 2015, 16 (12) : 1059 - 1068
  • [9] Feature Selection for Handling Concept Drift in the Data Stream Classification
    Turkov, Pavel
    Krasotkina, Olga
    Mottl, Vadim
    Sychugov, Alexey
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION (MLDM 2016), 2016, 9729 : 614 - 629
  • [10] An ensemble method for data stream classification in the presence of concept drift
    Omid Abbaszadeh
    Ali Amiri
    Ali Reza Khanteymoori
    [J]. Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 1059 - 1068