An empirical study of binary classifier fusion methods for multiclass classification

被引:60
|
作者
Garcia-Pedrajas, Nicolas [1 ]
Ortiz-Boyer, Domingo [1 ]
机构
[1] Univ Cordoba, Dept Comp & Numer Anal, E-14071 Cordoba, Spain
关键词
Multiclass problems; Class binarization; Output coding; Classification; CORRECTING OUTPUT CODES; DEPENDENT DESIGN; ERROR; DIVERSITY; ENSEMBLES; TESTS;
D O I
10.1016/j.inffus.2010.06.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most important topics in information fusion is the combination of individual classifiers in multi-classifier systems. We have two different tasks in this area: one is the training and construction of ensembles of classifiers, with each one being able to solve the multiclass problem; the other task is the fusion of binary classifiers, with each one solving a different two-class problem to construct a multiclass classifier. This paper is devoted to the study of several aspects on the fusion process of binary classifiers to obtain a multiclass classifier. In the general case of a classification problem with more than two classes, we are faced with the issue that many algorithms either work better with two-class problems or are specifically designed for two-class problems. In such cases, a binarization method that maps the multiclass problem into several two-class problems must be used. In this task, information fusion plays a central role because of the combination of the prediction of the different binary classifiers into a multiclass classifier. Several issues regarding the way binary learners are trained and combined are raised by this task. Issues such as individual accuracy, diversity, and independence are common to other information fusion tasks such as the construction of ensembles of classifiers. This paper presents a study of the different class binarization methods for the various standard multiclass classification problems that have been proposed while addressing aspects not considered in previous works. We are especially concerned with many of the general assumptions in the field that have not been fully assessed by experimentation. We test the different methods in a large set of real-world problems from the UCI Machine Learning Repository, and we use six different base learners. Our results corroborate some of the previous results present in the literature. Furthermore, we present new results regarding the influence of the base learner on the performance of each method. We also show new results on the behavior of binary testing error and the independence of binary classifiers depending on the coding strategy. Finally, we study the behavior of the methods when the number of classes is high and in the presence of noise. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:111 / 130
页数:20
相关论文
共 50 条
  • [21] Filter Selection Methods for Multiclass Classification
    Cascaro, Rhodessa J.
    Gerardo, Bobby D.
    Medina, Ruji P.
    2019 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND BIG DATA (ICCBD 2019), 2019, : 27 - 31
  • [22] Nonparametric plug-in classifier for multiclass classification of SDE paths
    Denis, Christophe
    Dion-Blanc, Charlotte
    Ella-Mintsa, Eddy
    Tran, Viet Chi
    SCANDINAVIAN JOURNAL OF STATISTICS, 2024, 51 (03) : 1103 - 1160
  • [23] Binary imbalanced big data classification based on fuzzy data reduction and classifier fusion
    Junhai Zhai
    Mohan Wang
    Sufang Zhang
    Soft Computing, 2022, 26 : 2781 - 2792
  • [24] Binary imbalanced big data classification based on fuzzy data reduction and classifier fusion
    Zhai, Junhai
    Wang, Mohan
    Zhang, Sufang
    SOFT COMPUTING, 2022, 26 (06) : 2781 - 2792
  • [25] An empirical evaluation of the classification error of two thresholding methods for Fisher's classifier
    Rueda, L
    Ngom, A
    IC-AI '04 & MLMTA'04 , VOL 1 AND 2, PROCEEDINGS, 2004, : 837 - 842
  • [26] Novel multiclass SVM-based binary decision tree classifier
    Osman, Hossam
    2007 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1-3, 2007, : 540 - 543
  • [27] Binary Shapelet Transform for Multiclass Time Series Classification
    Bostrom, Aaron
    Bagnall, Anthony
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, 2015, 9263 : 257 - 269
  • [28] Binary Shapelet Transform for Multiclass Time Series Classification
    Bostrom, Aaron
    Bagnall, Anthony
    TRANSACTIONS ON LARGE-SCALE DATA- AND KNOWLEDGE-CENTERED SYSTEMS XXXII, 2017, 10420 : 24 - 46
  • [29] Classifier fusion for VoIP attacks classification
    Safarik, Jakub
    Rezac, Filip
    SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXVI, 2017, 10200
  • [30] Improving Genetic Programming Classification For Binary And Multiclass Datasets
    Al-Madi, Nailah
    Ludwig, Simone A.
    2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING (CIDM), 2013, : 166 - 173