Concept Neurons - Handling Drift Issues for Real-Time Industrial Data Mining

被引:12
|
作者
Moreira-Matias, Luis [1 ]
Gama, Joao [2 ,4 ]
Mendes-Moreira, Joao [2 ,3 ]
机构
[1] NEC Labs Europe, Kurfursten Anlage 36, D-69115 Heidelberg, Germany
[2] Univ Porto, Fac Econ, P-4200465 Oporto, Portugal
[3] LIAAD INESC TEC, P-4200465 Oporto, Portugal
[4] Univ Porto, DEI FEUP, P-4200465 Oporto, Portugal
关键词
Supervised learning; Online learning; Concept drift; Perceptron; Stochastic gradient descent; Regression; Residuals; Transportation;
D O I
10.1007/978-3-319-46131-1_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from data streams is a challenge faced by data science professionals from multiple industries. Most of them struggle hardly on applying traditional Machine Learning algorithms to solve these problems. It happens so due to their high availability on ready-to-use software libraries on big data technologies (e.g. SparkML). Nevertheless, most of them cannot cope with the key characteristics of this type of data such as high arrival rate and/or non-stationary distributions. In this paper, we introduce a generic and yet simplistic framework to fill this gap denominated Concept Neurons. It leverages on a combination of continuous inspection schemas and residual-based updates over the model parameters and/or the model output. Such framework can empower the resistance of most of induction learning algorithms to concept drifts. Two distinct and hence closely related flavors are introduced to handle different drift types. Experimental results on successful distinct applications on different domains along transportation industry are presented to uncover the hidden potential of this methodology.
引用
收藏
页码:96 / 111
页数:16
相关论文
共 50 条
  • [1] Scalable real-time classification of data streams with concept drift
    Tennant, Mark
    Stahl, Frederic
    Rana, Omer
    Gomes, Joao Bartolo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 75 : 187 - 199
  • [2] Data stream mining: methods and challenges for handling concept drift
    Scott Wares
    John Isaacs
    Eyad Elyan
    SN Applied Sciences, 2019, 1
  • [3] Data stream mining: methods and challenges for handling concept drift
    Wares, Scott
    Isaacs, John
    Elyan, Eyad
    SN APPLIED SCIENCES, 2019, 1 (11):
  • [4] Fast Adaptive Real-Time Classification for Data Streams with Concept Drift
    Tennant, Mark
    Stahl, Frederic
    Gomes, Joao Bartolo
    INTERNET AND DISTRIBUTED COMPUTING SYSTEMS, IDCS 2015, 2015, 9258 : 265 - 272
  • [5] Real-Time Concept Drift Detection and Its Application to ECG Data
    Desale, Ketan Sanjay
    Shinde, Swati
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2021, 17 (10) : 160 - 170
  • [6] Real-time data mining
    不详
    EXPERT SYSTEMS, 1997, 14 (03) : 157 - 157
  • [7] Security Issues in Industrial Real-Time Systems
    Al-Jarad, Talhah
    Al Madani, Basem
    2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL V, 2010, : 357 - 361
  • [8] Real-time feature selection technique with concept drift detection using adaptive micro-clusters for data stream mining
    Hammoodi, Mahmood Shakir
    Stahl, Frederic
    Badii, Atta
    KNOWLEDGE-BASED SYSTEMS, 2018, 161 : 205 - 239
  • [9] Dependable real-time data mining
    Thuraisingham, B
    Khan, L
    Clifton, C
    Maurer, J
    Ceruti, M
    ISORC 2005: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON OBJECT-ORIENTED REAL-TIME DISTRIBUTED COMPUTING, PROCEEDINGS, 2005, : 158 - 165
  • [10] ISSUES IN REAL-TIME DATA MANAGEMENT
    GRAHAM, MH
    REAL-TIME SYSTEMS, 1992, 4 (03) : 185 - 202