Challenges for the Repeatability of Deep Learning Models

被引:35
|
作者
Alahmari, Saeed S. [1 ,2 ]
Goldgof, Dmitry B. [1 ]
Mouton, Peter R. [1 ,3 ]
Hall, Lawrence O. [1 ]
机构
[1] Univ S Florida, Dept Comp Sci & Engn, Tampa, FL 33620 USA
[2] Najran Univ, Dept Comp Sci, Najran 664624207, Saudi Arabia
[3] SRC Biosci, Tampa, FL 33606 USA
基金
美国国家科学基金会;
关键词
Deep learning; Training; Libraries; Computer architecture; Software; Computational modeling; Microprocessors; Pytorch; torch; Keras; TensorFlow; reproducibility; reproducible; repeatability; replicability; replicable deep learning models; deterministic models; determinism; ARBITRARY PARTICLES; UNBIASED ESTIMATION; NUMBER; PROVENANCE;
D O I
10.1109/ACCESS.2020.3039833
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning training typically starts with a random sampling initialization approach to set the weights of trainable layers. Therefore, different and/or uncontrolled weight initialization prevents learning the same model multiple times. Consequently, such models yield different results during testing. However, even with the exact same initialization for the weights, a lack of repeatability, replicability, and reproducibility may still be observed during deep learning for many reasons such as software versions, implementation variations, and hardware differences. In this article, we study repeatability when training deep learning models for segmentation and classification tasks using U-Net and LeNet-5 architectures in two development environments Pytorch and Keras (with TensorFlow backend). We show that even with the available control of randomization in Keras and TensorFlow, there are uncontrolled randomizations. We also show repeatable results for the same deep learning architectures using the Pytorch deep learning library. Finally, we discuss variations in the implementation of the weight initialization algorithm across deep learning libraries as a source of uncontrolled error in deep learning results.
引用
收藏
页码:211860 / 211868
页数:9
相关论文
共 50 条
  • [21] Advances and Challenges of Deep Learning
    Wang, Shui-Hua
    Zhang, Yu-Dong
    [J]. Recent Patents on Engineering, 2023, 17 (04):
  • [22] Challenges of Deep Learning in Cancers
    Tebbe, Elliot A.
    Simone, Melissa
    Greene, Madelyne Z.
    [J]. INTERNATIONAL JOURNAL OF MENTAL HEALTH NURSING, 2023, 32 (04) : 1148 - 1159
  • [23] Challenges of Deep Learning in Cancers
    Zhang, Yudong
    Hong, Jin
    [J]. TECHNOLOGY IN CANCER RESEARCH & TREATMENT, 2023, 22
  • [24] Deep Q-Learning in Robotics: Improvement of Accuracy and Repeatability
    Sumanas, Marius
    Petronis, Algirdas
    Bucinskas, Vytautas
    Dzedzickis, Andrius
    Virzonis, Darius
    Morkvenaite-Vilkonciene, Inga
    [J]. SENSORS, 2022, 22 (10)
  • [25] Automatic Image Annotation Based on Deep Learning Models: A Systematic Review and Future Challenges
    Adnan, Myasar Mundher
    Rahim, Mohd Shafry Mohd
    Rehman, Amjad
    Mehmood, Zahid
    Saba, Tanzila
    Naqvi, Rizwan Ali
    [J]. IEEE ACCESS, 2021, 9 : 50253 - 50264
  • [26] Deep learning models for traffic flow prediction in autonomous vehicles: A review, solutions, and challenges
    Miglani, Arzoo
    Kumar, Neeraj
    [J]. VEHICULAR COMMUNICATIONS, 2019, 20
  • [27] Challenges in Building of Deep Learning Models for Glioblastoma Segmentation: Evidence from Clinical Data
    Kurmukov, Anvar
    Dalechina, Aleksandra
    Saparov, Talgat
    Belyaev, Mikhail
    Zolotova, Svetlana
    Golanov, Andrey
    Nikolaeva, Anna
    [J]. PUBLIC HEALTH AND INFORMATICS, PROCEEDINGS OF MIE 2021, 2021, 281 : 298 - 302
  • [28] On the challenges of global entity-aware deep learning models for groundwater level prediction
    Heudorfer, Benedikt
    Liesch, Tanja
    Broda, Stefan
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2024, 28 (03) : 525 - 543
  • [29] Deep Learning for Earthquake Disaster Assessment: Objects, Data, Models, Stages, Challenges, and Opportunities
    Jia, Jing
    Ye, Wenjie
    [J]. REMOTE SENSING, 2023, 15 (16)
  • [30] Data Management Challenges for Deep Learning
    Raj, Aiswarya
    Bosch, Jan
    Olsson, Helena Holmstrom
    Arpteg, Anders
    Brinne, Bjorn
    [J]. 2019 45TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2019), 2019, : 140 - 147