Online Extreme Learning Machine with Hybrid Sampling Strategy for Sequential Imbalanced Data

被引:0
|
作者
Wentao Mao
Mengxue Jiang
Jinwan Wang
Yuan Li
机构
[1] Henan Normal University,School of Computer and Information Engineering
[2] Engineering Technology Research Center for Computing Intelligence and Data Mining,School of Mechanics and Civil and Architecture
[3] Northwestern Polytechnical University,undefined
来源
Cognitive Computation | 2017年 / 9卷
关键词
Online sequential extreme learning machine; Imbalance problem; Principal curve; Leave-one-out cross validation;
D O I
暂无
中图分类号
学科分类号
摘要
In real applications of cognitive computation, data with imbalanced classes are used to be collected sequentially. In this situation, some of current machine learning algorithms, e.g., support vector machine, will obtain weak classification performance, especially on minority class. To solve this problem, a new hybrid sampling online extreme learning machine (ELM) on sequential imbalanced data is proposed in this paper. The key idea is keeping the majority and minority classes balanced with similar sequential distribution characteristic of the original data. This method includes two stages. At the offline stage, we introduce the principal curve to build confidence regions of minority and majority classes respectively. Based on these two confidence zones, over-sampling of minority class and under-sampling of majority class are both conducted to generate new synthetic samples, and then, the initial ELM model is established. At the online stage, we first choose the most valuable ones from the synthetic samples of majority class in terms of sample importance. Afterwards, a new online fast leave-one-out cross validation (LOO CV) algorithm utilizing Cholesky decomposition is proposed to determine whether to update the ELM network weight at online stage or not. We also prove theoretically that the proposed method has upper bound of information loss. Experimental results on seven UCI datasets and one real-world air pollutant forecasting dataset show that, compared with ELM, OS-ELM, meta-cognitive OS-ELM, and OSELM with SMOTE strategy, the proposed method can simultaneously improve the classification performance of minority and majority classes in terms of accuracy, G-mean value, and ROC curve. As a conclusion, the proposed hybrid sampling online extreme learning machine can be effectively applied to the sequential data imbalance problem with better generalization performance and numerical stability.
引用
收藏
页码:780 / 800
页数:20
相关论文
共 50 条
  • [1] Online Extreme Learning Machine with Hybrid Sampling Strategy for Sequential Imbalanced Data
    Mao, Wentao
    Jiang, Mengxue
    Wang, Jinwan
    Li, Yuan
    COGNITIVE COMPUTATION, 2017, 9 (06) : 780 - 800
  • [2] Online Sequential Extreme Learning Machine with Under-Sampling and Over-Sampling for Imbalanced Big Data Classification
    Du, Jie
    Vong, Chi-Man
    Chang, Yajie
    Jiao, Yang
    PROCEEDINGS OF ELM-2016, 2018, 9 : 229 - 239
  • [3] Two-Stage Hybrid Extreme Learning Machine for Sequential Imbalanced Data
    Mao, Wentao
    Wang, Jinwan
    He, Ling
    Tian, Yangyang
    PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 423 - 433
  • [4] Imbalanced Data Fault Diagnosis Based on an Evolutionary Online Sequential Extreme Learning Machine
    Hao, Wei
    Liu, Feng
    SYMMETRY-BASEL, 2020, 12 (08):
  • [5] Online sequential prediction of imbalance data with two-stage hybrid strategy by extreme learning machine
    Mao, Wentao
    Wang, Jinwan
    He, Ling
    Tian, Yangyang
    NEUROCOMPUTING, 2017, 261 : 94 - 105
  • [6] Online Sequential Classification of Imbalanced Data by Combining Extreme Learning Machine and improved SMOTE Algorithm
    Mao, Wentao
    Wang, Jinwan
    Wang, Liyun
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [7] Online sequential class-specific extreme learning machine for binary imbalanced learning
    Shukla, Sanyam
    Raghuwanshi, Bhagat Singh
    NEURAL NETWORKS, 2019, 119 : 235 - 248
  • [8] Online sequential prediction of bearings imbalanced fault diagnosis by extreme learning machine
    Mao, Wentao
    He, Ling
    Yan, Yunju
    Wang, Jinwan
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2017, 83 : 450 - 473
  • [9] Imbalanced Learning for Air Pollution by Meta-Cognitive Online Sequential Extreme Learning Machine
    Chi-Man Vong
    Weng-Fai Ip
    Chi-Chong Chiu
    Pak-Kin Wong
    Cognitive Computation, 2015, 7 : 381 - 391
  • [10] Imbalanced Learning for Air Pollution by Meta-Cognitive Online Sequential Extreme Learning Machine
    Vong, Chi-Man
    Ip, Weng-Fai
    Chiu, Chi-Chong
    Wong, Pak-Kin
    COGNITIVE COMPUTATION, 2015, 7 (03) : 381 - 391