Risk upper bounds for general ensemble methods with an application to multiclass classification

被引:5
|
作者
Laviolette, Francois [1 ]
Morvant, Emilie [2 ]
Ralaivola, Liva [3 ]
Roy, Jean-Francis [1 ,4 ]
机构
[1] Univ Laval, Dept Informat & Genie Logiciel, Quebec City, PQ G1K 7P4, Canada
[2] Univ Lyon, UJM St Etienne, CNRS, IOGS,Lab Hubert Curien UMR 5516, F-42023 St Etienne, France
[3] Aix Marseille Univ, CNRS, Cent Marseille, LIF,QARMA, Marseille, France
[4] Coveo Solut Inc, Quebec City, PQ, Canada
关键词
Majority vote; Ensemble methods; PAC-Bayesian Theory; Multiclass classification; Multilabel Prediction; PAC-BAYESIAN ANALYSIS;
D O I
10.1016/j.neucom.2016.09.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper generalizes a pivotal result from the PAC-Bayesian literature-the-C-bound-primarily designed for binary classification to the general case of ensemble methods of voters with arbitrary outputs. We provide a generic version of the C-bound, an upper bound over the risk of models expressed as a weighted majority vote that is based on the first and second statistical moments of the vote's margin. On the one hand, this bound may advantageously be applied on more complex outputs than mere binary outputs, such as multiclass labels and multilabel, and on the other hand, it allows us to consider margin relaxations. We provide a specialization of the bound to multiclass classification together with empirical evidence that the presented theoretical result is tightly bound to the risk of the majority vote classifier. We also give insights as to how the proposed bound may be of use to characterize the risk of multilabel predictors.
引用
收藏
页码:15 / 25
页数:11
相关论文
共 50 条
  • [41] Ensemble neural networks with novel gene-subsets for multiclass cancer classification
    Hong, Jin-Hyuk
    Cho, Sung-Bae
    NEURAL INFORMATION PROCESSING, PART II, 2008, 4985 : 856 - 865
  • [42] MULTICLASS SVM WITH HIERARCHICAL INTERACTION: APPLICATION TO FACE CLASSIFICATION
    Jiu, Mingyuan
    Pustelnik, Nelly
    Qi, Lin
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [43] Ensemble methods with outliers for phonocardiogram classification
    Homsi, Masun Nabhan
    Warrick, Philip
    PHYSIOLOGICAL MEASUREMENT, 2017, 38 (08) : 1631 - 1644
  • [44] An improved model accuracy for forecasting risk measures: application of ensemble methods
    Makatjane, Katleho
    Mmelesi, Kesaobaka
    JOURNAL OF APPLIED ECONOMICS, 2024, 27 (01)
  • [45] Transductive methods for distributed ensemble classification
    Miller, David J.
    Pal, Siddharth
    2006 40TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS, VOLS 1-4, 2006, : 1605 - 1610
  • [46] Application of Machine Learning on Brain Cancer Multiclass Classification
    Panca, V.
    Rustam, Z.
    INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2016 (ISCPMS 2016), 2017, 1862
  • [47] Application of Imbalanced Data Classification Quality Metrics as Weighting Methods of the Ensemble Data Stream Classification Algorithms
    Wegier, Weronika
    Ksieniewicz, Pawel
    ENTROPY, 2020, 22 (08)
  • [48] Practical Ensemble Classification Error Bounds for Different Operating Points
    Varshney, Kush R.
    Prenger, Ryan J.
    Marlatt, Tracy L.
    Chen, Barry Y.
    Hanley, William G.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (11) : 2590 - 2601
  • [49] Critical Assessment of the Biomarker Discovery and Classification Methods for Multiclass Metabolomics
    Yang, Qingxia
    Gong, Yaguo
    Zhu, Feng
    ANALYTICAL CHEMISTRY, 2023, 95 (13) : 5542 - 5552
  • [50] Estimating vulnerability metrics with word embedding and multiclass classification methods
    Hakan Kekül
    Burhan Ergen
    Halil Arslan
    International Journal of Information Security, 2024, 23 : 247 - 270