Minimax semi-supervised set-valued approach to multi-class classification

被引:0
|
作者
Chzhen, Evgenii [1 ]
Denis, Christophe [2 ]
Hebiri, Mohamed [2 ]
机构
[1] Univ Paris Saclay, Lab Math Orsay, ICNRS, INRIA, F-91405 Orsay, France
[2] Univ Gustave Eiffel, Lab Anal & Math Appl, F-77454 Marne La Vallee 2, France
关键词
Multi-class classification; set-valued classification; minimax optimality; semi-supervised classification; DENSITY CONTOUR; OPTIMAL RATES; BOUNDS;
D O I
10.3150/20-BEJ1313
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study supervised and semi-supervised algorithms in the set-valued classification framework with controlled expected size. While the former methods can use only n labeled samples, the latter are able to make use of N additional unlabeled data. We obtain semi-supervised minimax rates of convergence under the a-margin assumption and a beta-Holder condition on the conditional distribution of labels. Our analysis implies that if no further assumption is made, there is no supervised method that outperforms the semi-supervised estimator proposed in this work- the best achievable rate for any supervised method is O (n(-1/2)), even if the margin assumption is extremely favorable; on the contrary, the developed semi-supervised estimator can achieve faster O((n/log n)(-(1+alpha)beta/(2 beta+d))) rate of convergence provided that sufficiently many unlabeled samples are available. We also show that under additional smoothness assumption, supervised methods are able to achieve faster rates and the unlabeled sample cannot improve the rate of convergence. Finally, a numerical study supports our theory and emphasizes the relevance of the assumptions we required from an empirical perspective.
引用
收藏
页码:2389 / 2412
页数:24
相关论文
共 50 条
  • [1] Semi-Supervised Boosting for Multi-Class Classification
    Valizadegan, Hamed
    Jin, Rong
    Jain, Anil K.
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 522 - 537
  • [2] Efficient set-valued prediction in multi-class classification
    Thomas Mortier
    Marek Wydmuch
    Krzysztof Dembczyński
    Eyke Hüllermeier
    Willem Waegeman
    [J]. Data Mining and Knowledge Discovery, 2021, 35 : 1435 - 1469
  • [3] Efficient set-valued prediction in multi-class classification
    Mortier, Thomas
    Wydmuch, Marek
    Dembczynski, Krzysztof
    Huellermeier, Eyke
    Waegeman, Willem
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 35 (04) : 1435 - 1469
  • [4] Improving Multi-class Classification for Endomicroscopic Images by Semi-supervised Learning
    Wu, Hang
    Tong, Li
    Wang, May D.
    [J]. 2017 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2017, : 5 - 8
  • [5] A Multi-class Semi-Supervised Classification Algorithm Based on Evidence Theory
    一种基于证据理论的多类半监督分类算法
    [J]. 2018, Chinese Institute of Electronics (46):
  • [6] Semi-supervised Batch Mode Active Learning for Multi-class Classification
    Lv, Jujian
    Zhao, Huimin
    Chen, Rongjun
    Zhan, Jin
    Li, Jianhong
    Lin, Kaihan
    Li, Canyao
    [J]. ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, 2020, 11691 : 117 - 127
  • [7] Regularized Multi-Class Semi-Supervised Boosting
    Saffari, Amir
    Leistner, Christian
    Bischof, Horst
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 967 - 974
  • [8] Dynamic label propagation for semi-supervised multi-class multi-label classification
    Wang, Bo
    Tsotsos, John
    [J]. PATTERN RECOGNITION, 2016, 52 : 75 - 84
  • [9] Dynamic Label Propagation for Semi-supervised Multi-class Multi-label Classification
    Wang, Bo
    Tu, Zhuowen
    Tsotsos, John K.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 425 - 432
  • [10] Laplacian adaptive weighted discriminant analysis for semi-supervised multi-class classification
    Xue, Yangtao
    Zhang, Li
    Yin, Hongwei
    [J]. NEUROCOMPUTING, 2024, 584