Analyzing and Repairing Concept Drift Adaptation in Data Stream Classification

被引:0
|
作者
Halstead, Ben [1 ]
Koh, Yun Sing [1 ]
Riddle, Patricia [1 ]
Pears, Russel [2 ]
Pechenizkiy, Mykola [3 ]
Bifet, Albert [4 ,5 ]
Olivares, Gustavo [6 ]
Coulson, Guy [6 ]
机构
[1] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
[2] Auckland Univ Technol, Auckland, New Zealand
[3] Eindhoven Univ Technol, Eindhoven, Netherlands
[4] Univ Waikato, Hamilton, New Zealand
[5] IP Paris, Telecom Paris, LTCI, Paris, France
[6] Natl Inst Water & Atmospher Res, Auckland, New Zealand
关键词
D O I
10.1109/DSAA53316.2021.9564191
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data collected over time often exhibit changes in distribution, or concept drift, caused by changes in hidden context relevant to the classification task, e.g. weather conditions. Adaptive learning methods are able to retain performance in changing conditions by explicitly detecting concept drift and changing the classifier used to make predictions. However, in real-world conditions, existing methods often select classifiers which poorly represent current data due to adaptation errors, where change in context is misidentified. We propose the AiRStream system, which uses a novel repair algorithm to identify and correct adaptation errors. We identify errors by periodically testing the performance of inactive classifiers. If an error is identified, a backtracking procedure repairs training done under the misidentified context. AiRStream achieves higher accuracy compared to baseline methods and selects classifiers which better match changes in context. A case study on a real-world air quality inference task shows that AiRStream is able to build a robust model of environmental conditions, allowing the adaptions made to concept drift to be analysed and related to changes in weather.
引用
收藏
页数:2
相关论文
共 50 条
  • [31] Classification of concept drift data streams
    Padmalatha, E.
    Reddy, C. R. K.
    Rani, B. Padmaja
    [J]. 2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA), 2014,
  • [32] Semi-supervised Classification of Concept Drift Data Stream Based on Local Component Replacement
    Qin, Keke
    Wen, Yimin
    [J]. ARTIFICIAL INTELLIGENCE (ICAI 2018), 2018, 888 : 98 - 112
  • [33] Detection of Concept Drift for Learning from Stream Data
    Lee, Jeonghoon
    Magoules, Frederic
    [J]. 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 241 - 245
  • [34] Analyzing concept drift and shift from sample data
    Webb, Geoffrey I.
    Lee, Loong Kuan
    Goethals, Bart
    Petitjean, Francois
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (05) : 1179 - 1199
  • [35] PGNBC: Pearson Gaussian Naive Bayes classifier for data stream classification with recurring concept drift
    Babu, D. Kishore
    Ramadevi, Y.
    Ramana, K. V.
    [J]. INTELLIGENT DATA ANALYSIS, 2017, 21 (05) : 1173 - 1191
  • [36] Detecting concept drift using HEDDM in data stream
    Dongre, Snehlata S.
    Malik, Latesh G.
    Thomas, Achamma
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2019, 7 (2-3) : 164 - 179
  • [37] Detecting algorithm of concept drift from stream data
    Zhang, Jie
    Zhao, Feng
    [J]. Kongzhi yu Juece/Control and Decision, 2013, 28 (01): : 29 - 35
  • [38] Concept drift detection on stream data for revising DBSCAN
    Miyata, Yasushi
    Ishikawa, Hiroshi
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2021, 104 (01) : 87 - 94
  • [39] Analyzing concept drift and shift from sample data
    Geoffrey I. Webb
    Loong Kuan Lee
    Bart Goethals
    François Petitjean
    [J]. Data Mining and Knowledge Discovery, 2018, 32 : 1179 - 1199
  • [40] Efficient Handling of Concept Drift and Concept Evolution over Stream Data
    Haque, Ahsanul
    Khan, Latifur
    Baron, Michael
    Thuraisingham, Bhavani
    Aggarwal, Charu
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 481 - 492