Sampling matters: SGD smoothing through importance sampling

被引:0
|
作者
Zancato, Luca [1 ]
Chiuso, Alessandro [1 ]
机构
[1] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
关键词
D O I
10.1109/CDC51059.2022.9992486
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many authors have suggested studying the loss landscape of Deep Neural Networks as a tool to understand their generalisation capabilities, the performance of optimisation algorithms as well as to tailor acceleration schemes. Nonetheless, a complete understanding of the entire learning process is still missing. This is mainly due to the complexity of the loss landscape, characterised by a rich structure with many local minima and saddle points. The purpose of this paper is to study the stochastic nature of Stochastic Gradient Descent and its link with the loss function as well as to propose a data sampling scheme which favours smoothing of the loss landscape to accelerate convergence speed. We validate our sampling scheme on AlexNet and ResNet applied to the CIFAR-10 dataset.
引用
收藏
页码:3359 / 3364
页数:6
相关论文
共 50 条
  • [1] IS-ASGD: Accelerating Asynchronous SGD using Importance Sampling
    Wang, Fei
    Gao, Xiaofeng
    Ye, Jun
    Chen, Guihai
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [2] Another look at rejection sampling through importance sampling
    Chen, YG
    [J]. STATISTICS & PROBABILITY LETTERS, 2005, 72 (04) : 277 - 283
  • [3] Fixed-lag smoothing using sequential importance sampling
    Clapp, TC
    Godsill, SJ
    [J]. BAYESIAN STATISTICS 6, 1999, : 743 - 752
  • [4] SGD with Coordinate Sampling: Theory and Practice
    Leluc, Remi
    Portier, Francois
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [5] SGD with Coordinate Sampling: Theory and Practice
    Leluc, Rémi
    Portier, François
    [J]. Journal of Machine Learning Research, 2022, 23
  • [6] SAMPLING AND SMOOTHING OF SPECTRA
    WILLSON, PD
    EDWARDS, TH
    [J]. APPLIED SPECTROSCOPY REVIEWS, 1976, 12 (01) : 1 - 81
  • [7] Adaptive Sampling for SGD by Exploiting Side Information
    Gopal, Siddharth
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [8] Accelerating Stratified Sampling SGD by Reconstructing Strata
    Liu, Weijie
    Qian, Hui
    Zhang, Chao
    Shen, Zebang
    Xie, Jiahao
    Zheng, Nenggan
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2725 - 2731
  • [9] The importance of importance sampling
    Beichl, I
    Sullivan, F
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 1999, 1 (02) : 71 - 73
  • [10] Sampling design matters
    Beam, CA
    [J]. MEDICAL DECISION MAKING, 1996, 16 (02) : 197 - 197