A multiclass classification using one-versus-all approach with the differential partition sampling ensemble

被引:19
|
作者
Gao, Xin [1 ]
He, Yang [1 ]
Zhang, Mi [2 ]
Diao, Xinping [2 ]
Jing, Xiao [1 ]
Ren, Bing [1 ]
Ji, Weijia [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Automat, Beijing, Peoples R China
[2] China Elect Power Res Inst, Beijing, Peoples R China
关键词
Multiclass classification; One-versus-all; Class imbalance; Differential partition sampling; Ensemble learning; DIRECTED ACYCLIC GRAPH; VS-ONE STRATEGY; FRAUD DETECTION; DECISION TREE; SELECTION; SMOTE; BINARIZATION;
D O I
10.1016/j.engappai.2020.104034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The One-versus-all(OVA) approach is one of the mainstream decomposition methods by which multiple binary classifiers are used to solve multiclass classification tasks. However, it exists the problems of serious class imbalance. This paper proposes a differential partition sampling ensemble method(DPSE) in the OVA framework. The number of majority samples and that of the minority samples in each binary training dataset are used as the upper and lower limits of the sampling interval respectively. Within this range, the construction process of the arithmetic sequence is simulated to generate the set containing multiple different sampling numbers with equal intervals. All samples are divided into safe examples, borderline examples, rare examples, and outliers according to the neighborhood information, then Random undersampling for safe samples(s-Random undersampling) and SMOTE for borderline examples and rare examples (br-SMOTE) are proposed based on the distribution characteristics of the classes. In each iteration, according to the number of differential sampling, the two methods are used to undersample or oversample the majority and minority in each binary training dataset to balance the number of positive and negative samples, which preserves the characteristic of the class structure as much as possible. Balanced training sets are used to train the binary classification model with multiple sub classifiers. The thorough experiments performed on 27 KEEL public multiclass datasets show that DPSE outperforms the typical methods in the OVA scheme, the One-versus-One scheme or direct way in classification performance.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Instance selection using one-versus-all and one-versus-one decomposition approaches in multiclass classification datasets
    Fang, Ching-Lin
    Wang, Ming-Chang
    Tsai, Chih-Fong
    Lin, Wei-Chao
    Liao, Pei-Qi
    [J]. EXPERT SYSTEMS, 2023, 40 (06)
  • [2] One-versus-one and one-versus-all multiclass SVM-RFE for gene selection in cancer classification
    Duan, Kai-Bo
    Rajapakse, Jagath C.
    Nguyen, Minh N.
    [J]. EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS, 2007, 4447 : 47 - +
  • [3] Combining One-Versus-One and One-Versus-All Strategies to Improve Multiclass SVM Classifier
    Chmielnicki, Wieslaw
    Stapor, Katarzyna
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 37 - 45
  • [4] Unified Classification and Rejection: A One-versus-all Framework
    Cheng, Zhen
    Zhang, Xu-Yao
    Liu, Cheng-Lin
    [J]. MACHINE INTELLIGENCE RESEARCH, 2024, 21 (05) : 870 - 887
  • [5] Multiclass from Binary: Expanding One-Versus-All, One-Versus-One and ECOC-Based Approaches
    Rocha, Anderson
    Goldenstein, Siome Klein
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (02) : 289 - 302
  • [6] Dialogue Act classification using RNNs introducing one-versus-all and attention mechanism
    Izumia H.
    Kato S.
    [J]. IEEJ Transactions on Electronics, Information and Systems, 2019, 139 (12) : 1407 - 1414
  • [7] One-against-all ensemble for multiclass pattern classification
    Oong, Tatt Hee
    Isa, Nor Ashidi Mat
    [J]. APPLIED SOFT COMPUTING, 2012, 12 (04) : 1303 - 1308
  • [8] Adapted One-versus-All Decision Trees for Data Stream Classification
    Hashemi, Sattar
    Yang, Ying
    Mirzamomen, Zahra
    Kangavari, Mohammadreza
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (05) : 624 - 637
  • [9] Generalized linear kernels for one-versus-all classification: Application to speaker recognition
    Hatch, Andrew O.
    Stolcke, Andreas
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 5443 - 5446
  • [10] Solving the slate tile classification problem using a DAGSVM multiclassification algorithm based on SVM binary classifiers with a one-versus-all approach
    Martinez, J.
    Iglesias, C.
    Matias, J. M.
    Taboada, J.
    Araujo, M.
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2014, 230 : 464 - 472