Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

被引:7
|
作者
Ponti, Moacir A. [1 ]
dos Santos, Fernando P. [1 ]
Ribeiro, Leo S. F. [1 ]
Cavallari, Gabriel B. [1 ]
机构
[1] ICMC Univ Sao Paulo USP, Sao Carlos, SP, Brazil
来源
2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021) | 2021年
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1109/SIBGRAPI54419.2021.00011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training deep neural networks may be challenging in real world data. Using models as black-boxes, even with transfer learning, can result in poor generalization or inconclusive results when it comes to small datasets or specific applications. This tutorial covers the basic steps as well as more recent options to improve models, in particular, but not restricted to, supervised learning. It can be particularly useful in datasets that are not as well-prepared as those in challenges, and also under scarce annotation and/or small data. We describe basic procedures as data preparation, optimization and transfer learning, but also recent architectural choices such as use of transformer modules, alternative convolutional layers, activation functions, wide/depth, as well as training procedures including curriculum, contrastive and self-supervised learning.
引用
收藏
页码:9 / 16
页数:8
相关论文
共 23 条
  • [21] SAR Image Despeckling by Deep Neural Networks: from a Pre-Trained Model to an End-to-End Training Strategy
    Dalsasso, Emanuele
    Yang, Xiangli
    Denis, Loic
    Tupin, Florence
    Yang, Wen
    REMOTE SENSING, 2020, 12 (16)
  • [22] Multi-timescale Feature-extraction Architecture of Deep Neural Networks for Acoustic Model Training from Raw Speech Signal
    Takeda, Ryu
    Nakadai, Kazuhiro
    Komatani, Kazunori
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 2503 - 2510
  • [23] 3D-RADNet: Extracting labels from DICOM metadata for training general medical domain deep 3D convolution neural networks
    Du, Richard
    Vardhanabhuti, Varut
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 121, 2020, 121 : 174 - 192