Multiclass imbalanced learning with one-versus-one decomposition and spectral clustering

被引:39
|
作者
Li, Qianmu [1 ,2 ,3 ,4 ,5 ,6 ]
Song, Yanjun [1 ,7 ]
Zhang, Jing [1 ]
Sheng, Victor S. [3 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Wuyi Univ, Intelligent Mfg Dept, Jiangmen 529020, Peoples R China
[3] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
[4] Nanjing XiaoZhuang Univ, Nanjing 211171, Peoples R China
[5] Jinling Inst Technol, Nanjing 211169, Peoples R China
[6] Jiangsu Zhongtian Technol Co Ltd, Nantong 226463, Peoples R China
[7] Nanjing Liancheng Technol Dev Co Ltd, Nanjing 210008, Peoples R China
基金
中国国家自然科学基金;
关键词
Imbalanced learning; Multiclass classification; One-versus-one decomposition; Spectral clustering; DATA-SETS; CLASSIFICATION; SMOTE;
D O I
10.1016/j.eswa.2019.113152
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many real-world applications, an algorithm needs to learn multiclass classification models from data with imbalanced class distributions. Multiclass imbalanced learning is currently receiving increased attention from researchers. In contrast to traditional imbalanced learning on binary datasets, multiclass imbalanced learning faces great challenges from the variety of changes in the class distributions as well as the inadequate performance of multiclass classification algorithms. In this paper, we propose a novel data preprocessing-based method to solve this problem. The proposed method combines a one-versus-one (OVO) decomposition of class pairs and a spectral clustering technique. This method first decomposes a multiclass dataset into several binary-class datasets. Then, it uses spectral clustering to divide the minority classes of binary-class subsets into subspaces and oversamples them according to the characteristics of the data. Sampling based on spectral clustering takes into account the distribution of the data and effectively avoids oversampling outliers. After the data approximately reaches the equilibrium point, multiclass classifiers can be trained from these rebalanced data. We compared the proposed method with five state-of-the-art multiclass imbalanced learning methods on seven multiclass datasets, using multiclass area under the ROC curve (MAUC), the precision of minor classes (P-min) and the average precision of all classes (P-avg) as the performance metrics. The experimental results show that our proposed method has the best overall performance. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Multi-class Imbalanced Learning with One-Versus-One Decomposition: An Empirical Study
    Song, Yanjun
    Zhang, Jing
    Yan, Han
    Li, Qianmu
    [J]. CLOUD COMPUTING AND SECURITY, PT III, 2018, 11065 : 617 - 628
  • [2] Instance selection using one-versus-all and one-versus-one decomposition approaches in multiclass classification datasets
    Fang, Ching-Lin
    Wang, Ming-Chang
    Tsai, Chih-Fong
    Lin, Wei-Chao
    Liao, Pei-Qi
    [J]. EXPERT SYSTEMS, 2023, 40 (06)
  • [3] Combining One-Versus-One and One-Versus-All Strategies to Improve Multiclass SVM Classifier
    Chmielnicki, Wieslaw
    Stapor, Katarzyna
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 37 - 45
  • [4] One-versus-one and one-versus-all multiclass SVM-RFE for gene selection in cancer classification
    Duan, Kai-Bo
    Rajapakse, Jagath C.
    Nguyen, Minh N.
    [J]. EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS, 2007, 4447 : 47 - +
  • [5] Multiclass from Binary: Expanding One-Versus-All, One-Versus-One and ECOC-Based Approaches
    Rocha, Anderson
    Goldenstein, Siome Klein
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (02) : 289 - 302
  • [6] Multiclass financial distress prediction based on one-versus-one decomposition integrated with improved decision-directed acyclic graph
    Sun, Jie
    Li, Jie
    Fujita, Hamido
    Ai, Wenguo
    [J]. JOURNAL OF FORECASTING, 2023, 42 (05) : 1167 - 1186
  • [7] Universum Selection for Boosting the Performance of Multiclass Support Vector Machines Based on One-versus-One Strategy
    Songsiri, Patoomsiri
    Cherkassky, Vladimir
    Kijsirikul, Boonserm
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 159 : 9 - 19
  • [8] Dynamic affinity-based classification of multi-class imbalanced data with one-versus-one decomposition: a fuzzy rough set approach
    Vluymans, Sarah
    Fernandez, Alberto
    Saeys, Yvan
    Cornelis, Chris
    Herrera, Francisco
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 56 (01) : 55 - 84
  • [9] Dynamic affinity-based classification of multi-class imbalanced data with one-versus-one decomposition: a fuzzy rough set approach
    Sarah Vluymans
    Alberto Fernández
    Yvan Saeys
    Chris Cornelis
    Francisco Herrera
    [J]. Knowledge and Information Systems, 2018, 56 : 55 - 84
  • [10] Exploring the effectiveness of dynamic ensemble selection in the one-versus-one scheme
    Zhang, Zhong-Liang
    Luo, Xing-Gang
    Garcia, Salvador
    Tang, Jia-Fu
    Herrera, Francisco
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 125 : 53 - 63