Bilevel Online Deep Learning in Non-stationary Environment

被引:1
|
作者
Han, Ya-nan [1 ]
Liu, Jian-wei [1 ]
Xiao, Bing-biao [1 ]
Wang, Xin-Tan [1 ]
Luo, Xiong-lin [1 ]
机构
[1] China Univ Petr, Coll Informat Sci & Engn, Dept Automat, Beijing Campus CUP, Beijing, Peoples R China
关键词
Online Deep Learning; Bilevel optimization; Concept drift;
D O I
10.1007/978-3-030-86340-1_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed enormous progress of online learning. However, a major challenge on the road to artificial agents is concept drift, that is, the data probability distribution would change where the data instance arrives sequentially in a stream fashion, which would lead to catastrophic forgetting and degrade the performance of the model. In this paper, we proposed a new Bilevel Online Deep Learning (BODL) framework, which combine bilevel optimization strategy and online ensemble classifier. In BODL algorithm, we use an ensemble classifier, which use the output of different hidden layers in deep neural network to build multiple base classifiers, the important weights of the base classifiers are updated according to exponential gradient descent method in an online manner. Besides, we apply the similar constraint to overcome the convergence problem of online ensemble framework. Then an effective concept drift detection mechanism utilizing the error rate of classifier is designed to monitor the change of the data probability distribution. When the concept drift is detected, our BODL algorithm can adaptively update the model parameters via bilevel optimization and then circumvent the large drift and encourage positive transfer. Finally, the extensive experiments and ablation studies are conducted on various datasets and the competitive numerical results illustrate that our BODL algorithm is a promising approach.
引用
收藏
页码:347 / 358
页数:12
相关论文
共 50 条
  • [41] Continual Prototype Evolution: Learning Online from Non-Stationary Data Streams
    De lange, Matthias
    Tuytelaars, Tinne
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8230 - 8239
  • [42] Learning for non-stationary Dirichlet processes
    Quinn, A.
    Karny, M.
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2007, 21 (10) : 827 - 855
  • [43] Social Learning in non-stationary environments
    Boursier, Etienne
    Perchet, Vianney
    Scarsini, Marco
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [44] The complexity of non-stationary reinforcement learning
    Peng, Binghui
    Papadimitriou, Christos
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
  • [45] Improved Selection of Auxiliary Objectives using Reinforcement Learning in Non-Stationary Environment
    Petrova, Irina
    Buzdalova, Arina
    Buzdalov, Maxim
    2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 580 - 583
  • [46] Review on novelty detection in the non-stationary environment
    Supriya Agrahari
    Sakshi Srivastava
    Anil Kumar Singh
    Knowledge and Information Systems, 2024, 66 : 1549 - 1574
  • [47] Review on novelty detection in the non-stationary environment
    Agrahari, Supriya
    Srivastava, Sakshi
    Singh, Anil Kumar
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (03) : 1549 - 1574
  • [48] Granular description of data in a non-stationary environment
    Rami Al-Hmouz
    Witold Pedrycz
    Abdullah Saeed Balamash
    Ali Morfeq
    Soft Computing, 2018, 22 : 523 - 540
  • [49] Deep learning based missing data recovery of non-stationary wind velocity
    Lin, Qiushuang
    Bao, Xuming
    Li, Chunxiang
    JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 2022, 224
  • [50] Granular description of data in a non-stationary environment
    Al-Hmouz, Rami
    Pedrycz, Witold
    Balamash, Abdullah Saeed
    Morfeq, Ali
    SOFT COMPUTING, 2018, 22 (02) : 523 - 540