Risk upper bounds for general ensemble methods with an application to multiclass classification

被引:5
|
作者
Laviolette, Francois [1 ]
Morvant, Emilie [2 ]
Ralaivola, Liva [3 ]
Roy, Jean-Francis [1 ,4 ]
机构
[1] Univ Laval, Dept Informat & Genie Logiciel, Quebec City, PQ G1K 7P4, Canada
[2] Univ Lyon, UJM St Etienne, CNRS, IOGS,Lab Hubert Curien UMR 5516, F-42023 St Etienne, France
[3] Aix Marseille Univ, CNRS, Cent Marseille, LIF,QARMA, Marseille, France
[4] Coveo Solut Inc, Quebec City, PQ, Canada
关键词
Majority vote; Ensemble methods; PAC-Bayesian Theory; Multiclass classification; Multilabel Prediction; PAC-BAYESIAN ANALYSIS;
D O I
10.1016/j.neucom.2016.09.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper generalizes a pivotal result from the PAC-Bayesian literature-the-C-bound-primarily designed for binary classification to the general case of ensemble methods of voters with arbitrary outputs. We provide a generic version of the C-bound, an upper bound over the risk of models expressed as a weighted majority vote that is based on the first and second statistical moments of the vote's margin. On the one hand, this bound may advantageously be applied on more complex outputs than mere binary outputs, such as multiclass labels and multilabel, and on the other hand, it allows us to consider margin relaxations. We provide a specialization of the bound to multiclass classification together with empirical evidence that the presented theoretical result is tightly bound to the risk of the majority vote classifier. We also give insights as to how the proposed bound may be of use to characterize the risk of multilabel predictors.
引用
收藏
页码:15 / 25
页数:11
相关论文
共 50 条
  • [1] Blind Multiclass Ensemble Classification
    Traganitis, Panagiotis A.
    Pages-Zamora, Alba
    Giannakis, Georgios B.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (18) : 4737 - 4752
  • [2] A hybrid ensemble for classification in multiclass datasets: An application to oilseed disease dataset
    Chaudhary, Archana
    Kolhe, Savita
    Kamal, Raj
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2016, 124 : 65 - 72
  • [3] Towards unbalanced multiclass intrusion detection with hybrid sampling methods and ensemble classification
    Le, Thi-Thu-Huong
    Shin, Yeongjae
    Kim, Myeongkil
    Kim, Howon
    APPLIED SOFT COMPUTING, 2024, 157
  • [4] On the consistency of multiclass classification methods
    Tewari, Ambuj
    Bartlett, Peter L.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 1007 - 1025
  • [5] Multiclass classification methods in ecology
    Bourel, M.
    Segura, A. M.
    ECOLOGICAL INDICATORS, 2018, 85 : 1012 - 1021
  • [6] On the consistency of multiclass classification methods
    Tewari, A
    Bartlett, PL
    LEARNING THEORY, PROCEEDINGS, 2005, 3559 : 143 - 157
  • [7] New Bounds on the Accuracy of Majority Voting for Multiclass Classification
    Aeeneh, Sina
    Zlatanov, Nikola
    Yu, Jiangshan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [8] Ensemble approaches of support vector machines for multiclass classification
    Min, Jun-Ki
    Hong, Jin-Hyuk
    Cho, Sung-Bae
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2007, 4815 : 1 - 10
  • [9] Strength of ensemble learning in multiclass classification of rockburst intensity
    Zhang, Junfei
    Wang, Yuhang
    Sun, Yuantian
    Li, Guichen
    INTERNATIONAL JOURNAL FOR NUMERICAL AND ANALYTICAL METHODS IN GEOMECHANICS, 2020, 44 (13) : 1833 - 1853
  • [10] A Cluster-Based Semisupervised Ensemble for Multiclass Classification
    Soares, Rodrigo G. F.
    Chen, Huanhuan
    Yao, Xin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2017, 1 (06): : 408 - 420