Gradient-based Algorithms for Machine Teaching

被引:6
|
作者
Wang, Pei [1 ]
Nagrecha, Kabir [1 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, San Diego, CA 92093 USA
关键词
LEARNING-ABILITY;
D O I
10.1109/CVPR46437.2021.00144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of machine teaching is considered. A new formulation is proposed under the assumption of an optimal student, where optimality is defined in the usual machine learning sense of empirical risk minimization. This is a sensible assumption for machine learning students and for human students in crowdsourcing platforms, who tend to perform at least as well as machine learning systems. It is shown that, if allowed unbounded effort, the optimal student always learns the optimal predictor for a classification task. Hence, the role of the optimal teacher is to select the teaching set that minimizes student effort. This is formulated as a problem of functional optimization where, at each teaching iteration, the teacher seeks to align the steepest descent directions of the risk of (1) the teaching set and (2) entire example population. The optimal teacher, denoted MaxGrad, is then shown to maximize the gradient of the risk on the set of new examples selected per iteration. MaxGrad teaching algorithms are finally provided for both binary and multiclass tasks, and shown to have some similarities with boosting algorithms. Experimental evaluations demonstrate the effectiveness of MaxGrad, which outperforms previous algorithms on the classification task, for both machine learning and human students from MTurk, by a substantial margin.
引用
收藏
页码:1387 / 1396
页数:10
相关论文
共 50 条
  • [1] Auxiliary gradient-based sampling algorithms
    Titsias, Michalis K.
    Papaspiliopoulos, Omiros
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2018, 80 (04) : 749 - 767
  • [2] Yield prediction for crops by gradient-based algorithms
    Mahesh, Pavithra
    Soundrapandiyan, Rajkumar
    [J]. PLOS ONE, 2024, 19 (08):
  • [3] Gradient-based genetic algorithms in image registration
    Maslov, IV
    Gertner, I
    [J]. AUTOMATIC TARGET RECOGNITION XI, 2001, 4379 : 509 - 520
  • [4] Orthogonal wavelets and stochastic gradient-based algorithms
    Attallah, S
    Najim, M
    [J]. PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS, 1996, : 405 - 408
  • [5] Robust multiscale algorithms for gradient-based motion estimation
    Lu, Qing-Hua
    Zhang, Xian-Min
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2007, 17 (06) : 333 - 340
  • [6] Selective coefficient update of gradient-based adaptive algorithms
    Aboulnasr, T
    Mayyas, K
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1929 - 1932
  • [7] Empirical Characterization of Discretization Error in Gradient-based Algorithms
    Bachrach, Jonathan
    Beal, Jacob
    Horowitz, Joshua
    Qumsiyeh, Dany
    [J]. SASO 2008: SECOND IEEE INTERNATIONAL CONFERENCE ON SELF-ADAPTIVE AND SELF-ORGANIZING SYSTEMS, PROCEEDINGS, 2008, : 203 - 212
  • [8] Hybridization of the multi-objective evolutionary algorithms and the gradient-based algorithms
    Hu, XL
    Huang, ZC
    Wang, ZF
    [J]. CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 870 - 877
  • [9] Gradient-based optimization algorithms for networks of reconfigurable sensors
    de Groot, T. H.
    Krasnov, O. A.
    Yarovoy, A. G.
    [J]. CONTROL ENGINEERING PRACTICE, 2014, 29 : 74 - 85
  • [10] Comprehensive analysis of gradient-based hyperparameter optimization algorithms
    Bakhteev, O. Y.
    Strijov, V. V.
    [J]. ANNALS OF OPERATIONS RESEARCH, 2020, 289 (01) : 51 - 65