An empirical study of binary classifier fusion methods for multiclass classification

被引:60
|
作者
Garcia-Pedrajas, Nicolas [1 ]
Ortiz-Boyer, Domingo [1 ]
机构
[1] Univ Cordoba, Dept Comp & Numer Anal, E-14071 Cordoba, Spain
关键词
Multiclass problems; Class binarization; Output coding; Classification; CORRECTING OUTPUT CODES; DEPENDENT DESIGN; ERROR; DIVERSITY; ENSEMBLES; TESTS;
D O I
10.1016/j.inffus.2010.06.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most important topics in information fusion is the combination of individual classifiers in multi-classifier systems. We have two different tasks in this area: one is the training and construction of ensembles of classifiers, with each one being able to solve the multiclass problem; the other task is the fusion of binary classifiers, with each one solving a different two-class problem to construct a multiclass classifier. This paper is devoted to the study of several aspects on the fusion process of binary classifiers to obtain a multiclass classifier. In the general case of a classification problem with more than two classes, we are faced with the issue that many algorithms either work better with two-class problems or are specifically designed for two-class problems. In such cases, a binarization method that maps the multiclass problem into several two-class problems must be used. In this task, information fusion plays a central role because of the combination of the prediction of the different binary classifiers into a multiclass classifier. Several issues regarding the way binary learners are trained and combined are raised by this task. Issues such as individual accuracy, diversity, and independence are common to other information fusion tasks such as the construction of ensembles of classifiers. This paper presents a study of the different class binarization methods for the various standard multiclass classification problems that have been proposed while addressing aspects not considered in previous works. We are especially concerned with many of the general assumptions in the field that have not been fully assessed by experimentation. We test the different methods in a large set of real-world problems from the UCI Machine Learning Repository, and we use six different base learners. Our results corroborate some of the previous results present in the literature. Furthermore, we present new results regarding the influence of the base learner on the performance of each method. We also show new results on the behavior of binary testing error and the independence of binary classifiers depending on the coding strategy. Finally, we study the behavior of the methods when the number of classes is high and in the presence of noise. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:111 / 130
页数:20
相关论文
共 50 条
  • [41] Multiclass SMS Message Categorization: Beyond Spam Binary Classification
    Dewi, Fatia Kusuma
    Fadhlurrahman, Mgs M. Rizqi
    Rahmanianto, Mohamad Dwiyan
    Mahendra, Rahmad
    2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 210 - 215
  • [42] Reducing multiclass cancer classification to binary by output coding and SVM
    Shen, L
    Tan, EC
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2006, 30 (01) : 63 - 71
  • [43] Binary Imbalanced Data Classification Based on Modified D2GAN Oversampling and Classifier Fusion
    Zhai, Junhai
    Qi, Jiaxing
    Zhang, Sufang
    IEEE ACCESS, 2020, 8 (169456-169469) : 169456 - 169469
  • [44] Twin SVM for conditional probability estimation in binary and multiclass classification
    Shao, Yuan -Hai
    Lv, Xiao-Jing
    Huang, Ling-Wei
    Bai, Lan
    PATTERN RECOGNITION, 2023, 136
  • [45] On Binary Reduction of Large-Scale Multiclass Classification Problems
    Joshi, Bikash
    Amini, Massih-Reza
    Partalas, Ioannis
    Ralaivola, Liva
    Usunier, Nicolas
    Gaussier, Eric
    ADVANCES IN INTELLIGENT DATA ANALYSIS XIV, 2015, 9385 : 132 - 144
  • [46] Detecting Smart Contract Vulnerabilities with Combined Binary and Multiclass Classification
    Mezina, Anzhelika
    Ometov, Aleksandr
    CRYPTOGRAPHY, 2023, 7 (03)
  • [47] Information Theoretic Learning and local modeling for binary and multiclass classification
    Porto-Díaz, Iago
    Martínez-Rego, David
    Alonso-Betanzos, Amparo
    Fontenla-Romero, Oscar
    Progress in Artificial Intelligence, 2012, 1 (04) : 315 - 328
  • [48] A fusion neural network classifier for image classification
    Kang, Sanggil
    Park, Sungjoon
    PATTERN RECOGNITION LETTERS, 2009, 30 (09) : 789 - 793
  • [49] A Comparative Study of Binary Classification Methods for Pulsar Detection
    Priyanka, V
    Anil, B. S.
    Dinakar, B. R.
    2018 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT - 2018), 2018, : 345 - 349
  • [50] Applying the Multiclass Classification Methods for the Classification of Online Social Network Friends
    Sever, Nikolina
    Humski, Luka
    Ilic, Juraj
    Skocir, Zoran
    Pintar, Damir
    Vranic, Mihaela
    2017 25TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2017, : 67 - 72