Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

被引：7

作者：

Ponti, Moacir A. ^{[1
]}

dos Santos, Fernando P. ^{[1
]}

Ribeiro, Leo S. F. ^{[1
]}

Cavallari, Gabriel B. ^{[1
]}

机构：

[1] ICMC Univ Sao Paulo USP, Sao Carlos, SP, Brazil

来源：

2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021) | 2021年

基金：

巴西圣保罗研究基金会;

关键词：

D O I：

10.1109/SIBGRAPI54419.2021.00011

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Training deep neural networks may be challenging in real world data. Using models as black-boxes, even with transfer learning, can result in poor generalization or inconclusive results when it comes to small datasets or specific applications. This tutorial covers the basic steps as well as more recent options to improve models, in particular, but not restricted to, supervised learning. It can be particularly useful in datasets that are not as well-prepared as those in challenges, and also under scarce annotation and/or small data. We describe basic procedures as data preparation, optimization and transfer learning, but also recent architectural choices such as use of transformer modules, alternative convolutional layers, activation functions, wide/depth, as well as training procedures including curriculum, contrastive and self-supervised learning.

引用

页码：9 / 16

页数：8

共 23 条

[21] SAR Image Despeckling by Deep Neural Networks: from a Pre-Trained Model to an End-to-End Training Strategy
Dalsasso, Emanuele
Yang, Xiangli
Denis, Loic
Tupin, Florence
Yang, Wen
REMOTE SENSING, 2020, 12 (16)
[22] Multi-timescale Feature-extraction Architecture of Deep Neural Networks for Acoustic Model Training from Raw Speech Signal
Takeda, Ryu
Nakadai, Kazuhiro
Komatani, Kazunori
2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 2503 - 2510
[23] 3D-RADNet: Extracting labels from DICOM metadata for training general medical domain deep 3D convolution neural networks
Du, Richard
Vardhanabhuti, Varut
MEDICAL IMAGING WITH DEEP LEARNING, VOL 121, 2020, 121 : 174 - 192

← 1 2 3 →