Challenges for the Repeatability of Deep Learning Models

被引：35

作者：

Alahmari, Saeed S. ^{[1
,2
]}

Goldgof, Dmitry B. ^{[1
]}

Mouton, Peter R. ^{[1
,3
]}

Hall, Lawrence O. ^{[1
]}

机构：

[1] Univ S Florida, Dept Comp Sci & Engn, Tampa, FL 33620 USA

[2] Najran Univ, Dept Comp Sci, Najran 664624207, Saudi Arabia

[3] SRC Biosci, Tampa, FL 33606 USA

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

美国国家科学基金会;

关键词：

Deep learning; Training; Libraries; Computer architecture; Software; Computational modeling; Microprocessors; Pytorch; torch; Keras; TensorFlow; reproducibility; reproducible; repeatability; replicability; replicable deep learning models; deterministic models; determinism; ARBITRARY PARTICLES; UNBIASED ESTIMATION; NUMBER; PROVENANCE;

D O I：

10.1109/ACCESS.2020.3039833

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep learning training typically starts with a random sampling initialization approach to set the weights of trainable layers. Therefore, different and/or uncontrolled weight initialization prevents learning the same model multiple times. Consequently, such models yield different results during testing. However, even with the exact same initialization for the weights, a lack of repeatability, replicability, and reproducibility may still be observed during deep learning for many reasons such as software versions, implementation variations, and hardware differences. In this article, we study repeatability when training deep learning models for segmentation and classification tasks using U-Net and LeNet-5 architectures in two development environments Pytorch and Keras (with TensorFlow backend). We show that even with the available control of randomization in Keras and TensorFlow, there are uncontrolled randomizations. We also show repeatable results for the same deep learning architectures using the Pytorch deep learning library. Finally, we discuss variations in the implementation of the weight initialization algorithm across deep learning libraries as a source of uncontrolled error in deep learning results.

引用

页码：211860 / 211868

页数：9

共 50 条

[31] Deep Learning in Cybersecurity: Challenges and Approaches
Imamverdiyev, Yadigar N.
Abdullayeva, Fargana J.
[J]. INTERNATIONAL JOURNAL OF CYBER WARFARE AND TERRORISM, 2020, 10 (02) : 82 - 105
[32] Understanding deep learning - challenges and prospects
Adnan, Niha
Umer, Fahad
[J]. JOURNAL OF THE PAKISTAN MEDICAL ASSOCIATION, 2022, 72 (02) : S66 - S70
[33] Deep Learning in Neuroimaging: Promises and challenges
Yan, Weizheng
Qu, Gang
Hu, Wenxing
Abrol, Anees
Cai, Biao
Qiao, Chen
Plis, Sergey M.
Wang, Yu-Ping
Sui, Jing
Calhoun, Vince D.
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2022, 39 (02) : 87 - 98
[34] Software Engineering Challenges of Deep Learning
Arpteg, Anders
Brinne, Bjorn
Crnkovic-Friis, Luka
Bosch, Jan
[J]. 44TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2018), 2018, : 50 - 59
[35] Challenges in Deep Learning for Multimodal Applications
Ghosh, Sayan
[J]. ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 611 - 615
[36] Deep Learning in Automotive: Challenges and Opportunities
Falcini, Fabio
Lami, Giuseppe
[J]. SOFTWARE PROCESS IMPROVEMENT AND CAPABILITY DETERMINATION, SPICE 2017, 2017, 770 : 279 - 288
[37] Challenges concerning deep learning in SPOCs
Filius, Renee M.
de Kleijn, Renske A. M.
Uijl, Sabine G.
Prins, Frans J.
van Rijen, Harold V. M.
Grobbee, Diederick E.
[J]. INTERNATIONAL JOURNAL OF TECHNOLOGY ENHANCED LEARNING, 2018, 10 (1-2) : 111 - 127
[38] List of Deep Learning Models
Mosavi, Amir
Ardabili, Sina
Varkonyi-Koczy, Annamaria R.
[J]. ENGINEERING FOR SUSTAINABLE FUTURE, 2020, 101 : 202 - 214
[39] Learning Deep Generative Models
Salakhutdinov, Ruslan
[J]. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 2, 2015, 2 : 361 - 385
[40] Learning the Space of Deep Models
Berardi, Gianluca
De Luigi, Luca
Salti, Samuele
Di Stefano, Luigi
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2482 - 2488

← 1 2 3 4 5 →