A robust multi-class AdaBoost algorithm for mislabeled noisy data

被引:61
|
作者
Sun, Bo [1 ]
Chen, Songcan [1 ]
Wang, Jiandong [1 ]
Chen, Haiyan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, 29 Yudao St, Nanjing 210016, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Ensemble learning; AdaBoost; Robustness; Multi-class classification; Mislabeled noise; CLASSIFICATION; SETS;
D O I
10.1016/j.knosys.2016.03.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
AdaBoost has been theoretically and empirically proved to be a very successful ensemble learning algorithm, which iteratively generates a set of diverse weak learners and combines their outputs using the weighted majority voting rule as the final decision. However, in some cases, AdaBoost leads to overfitting especially for mislabeled noisy training examples, resulting in both its degraded generalization performance and non-robustness. Recently, a representative approach named noise-detection based AdaBoost (ND_AdaBoost) has been proposed to improve the robustness of AdaBoost in the two-class classification scenario, however, in the multi-class scenario, this approach can hardly achieve satisfactory performance due to the following three reasons. (1) If we decompose a multi-class classification problem using such strategies as one-versus-all or one-versus-one, the obtained two-class problems usually have imbalanced training sets, which negatively influences the performance of ND_AdaBoost (2) If we directly apply ND_AdaBoost to the multi-class classification scenario, its two-class loss function is no longer applicable and its accuracy requirement for the (weak) base classifiers, i.e., greater than 0.5, is too strong to be almost satisfied. (3) ND_AdaBoost still has the tendency of overfitting as it increases the weights of correctly classified noisy examples, which could make it focus on learning these noisy examples in the subsequent iterations. To solve the dilemma, in this paper, we propose a robust multi-class AdaBoost algorithm (Rob_MulAda) whose key ingredients consist in a noise-detection based multi-class loss function and a new weight updating scheme. Experimental study indicates that our newly-proposed weight updating scheme is indeed more robust to mislabeled noises than that of ND_AdaBoost in both two -class and multi -class scenarios. In addition, through the comparison experiments, we also verify the effectiveness of Rob_MulAda and provide a suggestion in choosing the most appropriate noise-alleviating approach according to the concrete noise level in practical applications. Crown Copyright (C) 2016 Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:87 / 102
页数:16
相关论文
共 50 条
  • [1] Multi-class AdaBoost
    Zhu, Ji
    Zou, Hui
    Rosset, Saharon
    Hastie, Trevor
    STATISTICS AND ITS INTERFACE, 2009, 2 (03) : 349 - 360
  • [2] Multi-class Classifier-Based Adaboost Algorithm
    Kim, Tae-Hyun
    Park, Dong-Chul
    Woo, Dong-Min
    Jeong, Taikyeong
    Min, Soo-Young
    INTELLIGENT SCIENCE AND INTELLIGENT DATA ENGINEERING, ISCIDE 2011, 2012, 7202 : 122 - 127
  • [3] A Multi-Class Cost Sensitivity AdaBoost Algorithm Using Multi-Class Cost Exponential Loss Function
    Zhai X.
    Wang X.
    Li R.
    Jia Q.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2017, 51 (08): : 33 - 39
  • [4] Transformer Fault Diagnosis Based on Multi-Class AdaBoost Algorithm
    Li, Jifang
    Li, Genxu
    Hai, Chen
    Guo, Mengbo
    IEEE ACCESS, 2022, 10 : 1522 - 1532
  • [5] Multi-class WHMBoost: An ensemble algorithm for multi-class imbalanced data
    Zhao, Jiakun
    Jin, Ju
    Zhang, Yibo
    Zhang, Ruifeng
    Chen, Si
    INTELLIGENT DATA ANALYSIS, 2022, 26 (03) : 599 - 614
  • [6] BPSO-Adaboost-KNN ensemble learning algorithm for multi-class imbalanced data classification
    Guo Haixiang
    Li Yijing
    Li Yanan
    Liu Xiao
    Li Jinling
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 49 : 176 - 193
  • [7] A Study on Emotion Recognition Based on Hierarchical Adaboost Multi-class Algorithm
    Zhang, Song
    Hu, Bin
    Li, Tiantian
    Zheng, Xiangwei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT II, 2018, 11335 : 105 - 113
  • [8] Using robust features with Multi-Class SVMs to classify noisy sounds
    Rabaoui, Asma
    Kadri, Hachem
    Lachiri, Zied
    Ellouze, Noureddine
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 594 - +
  • [9] Semi-supervised multi-class Adaboost by exploiting unlabeled data
    Song, Enmin
    Huang, Dongshan
    Ma, Guangzhi
    Hung, Chih-Cheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (06) : 6720 - 6726
  • [10] A noise-detection based AdaBoost algorithm for mislabeled data
    Cao, Jingjing
    Kwong, Sam
    Wang, Ran
    PATTERN RECOGNITION, 2012, 45 (12) : 4451 - 4465