Dropout Distillation

被引:0
|
作者
Bulo, Samuel Rota [1 ]
Porzi, Lorenzo [1 ]
Kontschieder, Peter [2 ,3 ]
机构
[1] FBK Irst, Trento, Italy
[2] Mapillary, Graz, Austria
[3] Microsoft Res, Cambridge, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dropout is a popular stochastic regularization technique for deep neural networks that works by randomly dropping (i.e. zeroing) units from the network during training. This randomization process allows to implicitly train an ensemble of exponentially many networks sharing the same parametrization, which should be averaged at test time to deliver the final prediction. A typical workaround for this intractable averaging operation consists in scaling the layers undergoing dropout randomization. This simple rule called "standard dropout" is efficient, but might degrade the accuracy of the prediction. In this work we introduce a novel approach, coined "dropout distillation", that allows us to train a predictor in a way to better approximate the intractable, but preferable, averaging process, while keeping under control its computational efficiency. We are thus able to construct models that are as efficient as standard dropout, or even more efficient, while being more accurate. Experiments on standard benchmark datasets demonstrate the validity of our method, yielding consistent improvements over conventional dropout.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] DROPOUT DEBATE
    MCGUIRE, HH
    WOODBURY, RA
    COHEN, S
    ANTONIUS, JI
    STRAUSS, MB
    NEW ENGLAND JOURNAL OF MEDICINE, 1969, 281 (12): : 681 - &
  • [22] Continuous Dropout
    Shen, Xu
    Tian, Xinmei
    Liu, Tongliang
    Xu, Fang
    Tao, Dacheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) : 3926 - 3937
  • [23] Wasserstein dropout
    Sicking, Joachim
    Akila, Maram
    Pintz, Maximilian
    Wirtz, Tim
    Wrobel, Stefan
    Fischer, Asja
    MACHINE LEARNING, 2024, 113 (05) : 3161 - 3204
  • [24] THE ELUSIVE DROPOUT
    LANSON, E
    VOCATIONAL GUIDANCE QUARTERLY, 1961, 9 (03): : 167 - 168
  • [25] CHANNEL DROPOUT
    BURSTEIN, H
    AUDIO, 1984, 68 (04): : 18 - 18
  • [26] FASEB dropout
    Russo, E
    SCIENTIST, 2000, 14 (01): : 25 - 25
  • [27] Dropout Attacks
    Yuan, Andrew
    Oprea, Alina
    Tan, Cheng
    45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024, 2024, : 1255 - 1269
  • [28] BECOMING A DROPOUT
    MOTZ, AB
    PHYLON, 1969, 30 (02) : 125 - 138
  • [29] DOCTOR OR DROPOUT
    STRAUSS, MB
    NEW ENGLAND JOURNAL OF MEDICINE, 1969, 280 (25): : 1417 - &
  • [30] Dropout prevention
    不详
    MANUFACTURING ENGINEERING, 2007, 139 (03): : 27 - 27