An Analysis of the Interaction Between Transfer Learning Protocols in Deep Neural Networks

被引:1
|
作者
Plested, Jo [1 ]
Gedeon, Tom [1 ]
机构
[1] Australian Natl Univ, Res Sch Comp Sci, Canberra, ACT, Australia
关键词
Transfer learning; Convolutional neural networks;
D O I
10.1007/978-3-030-36708-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We extend work on the transferability of features in deep neural networks to explore the interaction between training hyperparameters, optimal number of layers to transfer and the size of a target dataset. We show that using the commonly adopted transfer learning protocols results in increased overfitting and significantly decreased accuracy compared to optimal protocols, particularly for very small target datasets. We demonstrate that there is a relationship between fine-tuning hyperparameters used and the optimal number of layers to transfer. Our research shows that if this relationship is not taken into account, the optimal number of layers to transfer to the target dataset will likely be estimated incorrectly. Best practice transfer learning protocols cannot be predicted from existing research that has analysed transfer learning under very specific conditions that are not universally applicable. Extrapolating transfer learning training settings from previous findings can in fact be counterintuitive, particularly in the case of smaller datasets. We present optimal transfer learning protocols for various target dataset sizes from very small to large when source and target datasets and tasks are similar. Our results show that using these settings results in a large increase in accuracy when compared to commonly used transfer learning protocols. These results are most significant with very small target datasets. We observed an increase in accuracy of 47.8% on our smallest dataset which comprised of only 10 training examples per class. These findings are important as they are likely to improve outcomes from past, current and future research in transfer learning. We expect that researchers will want to re-examine their experiments to incorporate our findings and to check the robustness of their existing results.
引用
收藏
页码:312 / 323
页数:12
相关论文
共 50 条
  • [21] A Kernel Analysis of Feature Learning in Deep Neural Networks
    Canatar, Abdulkadir
    Pehlevan, Cengiz
    2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
  • [22] Medical Image Analysis using Deep Convolutional Neural Networks: CNN Architectures and Transfer Learning
    Dutta, Pronnoy
    Upadhyay, Pradumn
    De, Madhurima
    Khalkar, R. G.
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, : 175 - 180
  • [23] A Survey on Attacks and Their Countermeasures in Deep Learning: Applications in Deep Neural Networks, Federated, Transfer, and Deep Reinforcement Learning
    Ali, Haider
    Chen, Dian
    Harrington, Matthew
    Salazar, Nathaniel
    Al Ameedi, Mohannad
    Khan, Ahmad Faraz
    Butt, Ali R.
    Cho, Jin-Hee
    IEEE ACCESS, 2023, 11 : 120095 - 120130
  • [24] Computationally Efficient Training of Deep Neural Networks via Transfer Learning
    Oyen, Diane
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2019, 2019, 10996
  • [25] Deep Neural Networks with Transfer Learning Model for Brain Tumors Classification
    Bulla, Premamayudu
    Anantha, Lakshmipathi
    Peram, Subbarao
    TRAITEMENT DU SIGNAL, 2020, 37 (04) : 593 - 601
  • [26] Transfer learning approach in deep neural networks for uterine fibroid detection
    Sundar, Sumod
    Sumathy, S.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2022, 25 (01) : 52 - 63
  • [27] Filter Pruning for Efficient Transfer Learning in Deep Convolutional Neural Networks
    Reinhold, Caique
    Roisenberg, Mauro
    ARTIFICIAL INTELLIGENCEAND SOFT COMPUTING, PT I, 2019, 11508 : 191 - 202
  • [28] Deep neural networks and transfer learning applied to multimedia web mining
    Lopez-Sanchez, Daniel
    Gonzalez Arrieta, Angelica
    Corchado, Juan M.
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2018, 620 : 124 - 131
  • [29] Transfer Learning from Deep Neural Networks for Predicting Student Performance
    Tsiakmaki, Maria
    Kostopoulos, Georgios
    Kotsiantis, Sotiris
    Ragos, Omiros
    APPLIED SCIENCES-BASEL, 2020, 10 (06):
  • [30] Deep neural networks with transfer learning model for brain tumors classification
    Bulla P.
    Anantha L.
    Peram S.
    Bulla, Premamayudu (drbpm_it@vignan.ac.in), 1600, International Information and Engineering Technology Association (37): : 593 - 601