Challenges for the Repeatability of Deep Learning Models

被引:35
|
作者
Alahmari, Saeed S. [1 ,2 ]
Goldgof, Dmitry B. [1 ]
Mouton, Peter R. [1 ,3 ]
Hall, Lawrence O. [1 ]
机构
[1] Univ S Florida, Dept Comp Sci & Engn, Tampa, FL 33620 USA
[2] Najran Univ, Dept Comp Sci, Najran 664624207, Saudi Arabia
[3] SRC Biosci, Tampa, FL 33606 USA
基金
美国国家科学基金会;
关键词
Deep learning; Training; Libraries; Computer architecture; Software; Computational modeling; Microprocessors; Pytorch; torch; Keras; TensorFlow; reproducibility; reproducible; repeatability; replicability; replicable deep learning models; deterministic models; determinism; ARBITRARY PARTICLES; UNBIASED ESTIMATION; NUMBER; PROVENANCE;
D O I
10.1109/ACCESS.2020.3039833
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning training typically starts with a random sampling initialization approach to set the weights of trainable layers. Therefore, different and/or uncontrolled weight initialization prevents learning the same model multiple times. Consequently, such models yield different results during testing. However, even with the exact same initialization for the weights, a lack of repeatability, replicability, and reproducibility may still be observed during deep learning for many reasons such as software versions, implementation variations, and hardware differences. In this article, we study repeatability when training deep learning models for segmentation and classification tasks using U-Net and LeNet-5 architectures in two development environments Pytorch and Keras (with TensorFlow backend). We show that even with the available control of randomization in Keras and TensorFlow, there are uncontrolled randomizations. We also show repeatable results for the same deep learning architectures using the Pytorch deep learning library. Finally, we discuss variations in the implementation of the weight initialization algorithm across deep learning libraries as a source of uncontrolled error in deep learning results.
引用
收藏
页码:211860 / 211868
页数:9
相关论文
共 50 条
  • [31] Deep Learning in Cybersecurity: Challenges and Approaches
    Imamverdiyev, Yadigar N.
    Abdullayeva, Fargana J.
    [J]. INTERNATIONAL JOURNAL OF CYBER WARFARE AND TERRORISM, 2020, 10 (02) : 82 - 105
  • [32] Understanding deep learning - challenges and prospects
    Adnan, Niha
    Umer, Fahad
    [J]. JOURNAL OF THE PAKISTAN MEDICAL ASSOCIATION, 2022, 72 (02) : S66 - S70
  • [33] Deep Learning in Neuroimaging: Promises and challenges
    Yan, Weizheng
    Qu, Gang
    Hu, Wenxing
    Abrol, Anees
    Cai, Biao
    Qiao, Chen
    Plis, Sergey M.
    Wang, Yu-Ping
    Sui, Jing
    Calhoun, Vince D.
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2022, 39 (02) : 87 - 98
  • [34] Software Engineering Challenges of Deep Learning
    Arpteg, Anders
    Brinne, Bjorn
    Crnkovic-Friis, Luka
    Bosch, Jan
    [J]. 44TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2018), 2018, : 50 - 59
  • [35] Challenges in Deep Learning for Multimodal Applications
    Ghosh, Sayan
    [J]. ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 611 - 615
  • [36] Deep Learning in Automotive: Challenges and Opportunities
    Falcini, Fabio
    Lami, Giuseppe
    [J]. SOFTWARE PROCESS IMPROVEMENT AND CAPABILITY DETERMINATION, SPICE 2017, 2017, 770 : 279 - 288
  • [37] Challenges concerning deep learning in SPOCs
    Filius, Renee M.
    de Kleijn, Renske A. M.
    Uijl, Sabine G.
    Prins, Frans J.
    van Rijen, Harold V. M.
    Grobbee, Diederick E.
    [J]. INTERNATIONAL JOURNAL OF TECHNOLOGY ENHANCED LEARNING, 2018, 10 (1-2) : 111 - 127
  • [38] List of Deep Learning Models
    Mosavi, Amir
    Ardabili, Sina
    Varkonyi-Koczy, Annamaria R.
    [J]. ENGINEERING FOR SUSTAINABLE FUTURE, 2020, 101 : 202 - 214
  • [39] Learning Deep Generative Models
    Salakhutdinov, Ruslan
    [J]. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 2, 2015, 2 : 361 - 385
  • [40] Learning the Space of Deep Models
    Berardi, Gianluca
    De Luigi, Luca
    Salti, Samuele
    Di Stefano, Luigi
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2482 - 2488