Dropout Distillation

被引:0
|
作者
Bulo, Samuel Rota [1 ]
Porzi, Lorenzo [1 ]
Kontschieder, Peter [2 ,3 ]
机构
[1] FBK Irst, Trento, Italy
[2] Mapillary, Graz, Austria
[3] Microsoft Res, Cambridge, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dropout is a popular stochastic regularization technique for deep neural networks that works by randomly dropping (i.e. zeroing) units from the network during training. This randomization process allows to implicitly train an ensemble of exponentially many networks sharing the same parametrization, which should be averaged at test time to deliver the final prediction. A typical workaround for this intractable averaging operation consists in scaling the layers undergoing dropout randomization. This simple rule called "standard dropout" is efficient, but might degrade the accuracy of the prediction. In this work we introduce a novel approach, coined "dropout distillation", that allows us to train a predictor in a way to better approximate the intractable, but preferable, averaging process, while keeping under control its computational efficiency. We are thus able to construct models that are as efficient as standard dropout, or even more efficient, while being more accurate. Experiments on standard benchmark datasets demonstrate the validity of our method, yielding consistent improvements over conventional dropout.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] DROPOUT DYNAMICS
    TUEL, JK
    CALIFORNIA JOURNAL OF EDUCATIONAL RESEARCH, 1966, 17 (01): : 5 - 11
  • [42] Wasserstein dropout
    Joachim Sicking
    Maram Akila
    Maximilian Pintz
    Tim Wirtz
    Stefan Wrobel
    Asja Fischer
    Machine Learning, 2024, 113 : 3161 - 3204
  • [43] Hybrid dropout
    Park, Chongsun
    Lee, MyeongGyu
    KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (06) : 899 - 908
  • [44] SORORITY DROPOUT
    ROSE, HA
    ELTON, CF
    JOURNAL OF COLLEGE STUDENT PERSONNEL, 1971, 12 (06): : 460 - 463
  • [45] Low-income Latinos and dropout: Strategies to prevent dropout
    Schwarzbaum, SE
    JOURNAL OF MULTICULTURAL COUNSELING AND DEVELOPMENT, 2004, 32 : 296 - 306
  • [46] Dropout rates and reasons for dropout among patients receiving clozapine
    Grover, Sandeep
    Mishra, Eepsita
    Chakrabarti, Subho
    INDIAN JOURNAL OF PSYCHIATRY, 2023, 65 (06) : 680 - 686
  • [47] Developing a definition of early ECT dropout and exploring correlates of dropout
    Mahgoub, Yassir
    Hamlin, Dallas
    Francis, Andrew
    GENERAL HOSPITAL PSYCHIATRY, 2023, 85 : 247 - 248
  • [48] Dropout in adult education as a phenomenon of fit - an integrative model proposal for the genesis of dropout in adult education based on dropout experiences
    Thalhmmer, Veronika
    Hoffman, Stefanie
    von Hippel, Alga
    Schmidt-Hertha, Bernhard
    EUROPEAN JOURNAL FOR RESEARCH ON THE EDUCATION AND LEARNING OF ADULTS, 2022, 13 (03): : 231 - 246
  • [49] Advanced Dropout: A Model-Free Methodology for Bayesian Dropout Optimization
    Xie, Jiyang
    Ma, Zhanyu
    Lei, Jianjun
    Zhang, Guoqiang
    Xue, Jing-Hao
    Tan, Zheng-Hua
    Guo, Jun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4605 - 4625
  • [50] Checkerboard Dropout: A Structured Dropout With Checkerboard Pattern for Convolutional Neural Networks
    Nguyen, Khanh-Binh
    Choi, Jaehyuk
    Yang, Joon-Sung
    IEEE ACCESS, 2022, 10 : 76044 - 76054