Layer Removal for Transfer Learning with Deep Convolutional Neural Networks

被引:2
|
作者
Zhi, Weiming [1 ]
Chen, Zhenghao [2 ]
Yueng, Henry Wing Fung [2 ]
Lu, Zhicheng [2 ]
Zandavi, Seid Miad [2 ]
Chung, Yuk Ying [2 ]
机构
[1] Univ Auckland, Dept Engn Sci, Auckland 1010, New Zealand
[2] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
关键词
Convolutional neural networks; Transfer learning; Deep learning;
D O I
10.1007/978-3-319-70096-0_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is usually difficult to find datasets of sufficient size to train Deep Convolutional Neural Networks (DCNNs) from scratch. In practice, a neural network is often pre-trained on a very large source dataset. Then, a target dataset is transferred onto the neural network. This approach is a form of transfer learning, and allows very deep networks to achieve outstanding performance even when a small target dataset is available. It is thought that the bottom layers of the pre-trained network contain general information, which are applicable to different datasets and tasks, while the upper layers of the pre-trained network contain abstract information relevant to a specific dataset and task. While studies have been conducted on the fine-tuning of these layers, the removal of these layers have not yet been considered. This paper explores the effect of removing the upper convolutional layers of a pre-trained network. We empirically investigated whether removing upper layers of a deep pre-trained network can improve performance for transfer learning. We found that removing upper pre-trained layers gives a significant boost in performance, but the ideal number of layers to remove depends on the dataset. We suggest removing pre-trained convolutional layers when applying transfer learning on off-the-shelf pre-trained DCNNs. The ideal number of layers to remove will depend on the dataset, and remain as a parameter to be tuned.
引用
收藏
页码:460 / 469
页数:10
相关论文
共 50 条
  • [21] A Transfer Learning Approach for Diabetic Retinopathy Classification Using Deep Convolutional Neural Networks
    Krishnan, Arvind Sai
    Clive, Derik R.
    Bhat, Vilas
    Ramteke, Pravin Bhaskar
    Koolagudi, Shashidhar G.
    [J]. IEEE INDICON: 15TH IEEE INDIA COUNCIL INTERNATIONAL CONFERENCE, 2018,
  • [22] Deep Transfer Learning Based on Convolutional Neural Networks for Intelligent Fault Diagnosis of Spacecraft
    Xiang, Gang
    Chen, Wenjing
    Peng, Yu
    Wang, Yuanjin
    Qu, Chen
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5522 - 5526
  • [23] Breast cancer masses classification using deep convolutional neural networks and transfer learning
    Hassan, Shayma'a A.
    Sayed, Mohammed S.
    Abdalla, Mahmoud, I
    Rashwan, Mohsen A.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (41-42) : 30735 - 30768
  • [24] Breast cancer masses classification using deep convolutional neural networks and transfer learning
    Shayma’a A. Hassan
    Mohammed S. Sayed
    Mahmoud I Abdalla
    Mohsen A. Rashwan
    [J]. Multimedia Tools and Applications, 2020, 79 : 30735 - 30768
  • [25] Intelligent Identification of Jute Pests Based on Transfer Learning and Deep Convolutional Neural Networks
    Md Sakib Ullah Sourav
    Huidong Wang
    [J]. Neural Processing Letters, 2023, 55 : 2193 - 2210
  • [26] Classification of Diabetic Retinopathy Disease with Transfer Learning using Deep Convolutional Neural Networks
    Somasundaram, Krishnamoorthy
    Sivakumar, Paulraj
    Suresh, Durairaj
    [J]. ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2021, 21 (03) : 49 - 56
  • [27] Intelligent Identification of Jute Pests Based on Transfer Learning and Deep Convolutional Neural Networks
    Sourav, Md Sakib Ullah
    Wang, Huidong
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (03) : 2193 - 2210
  • [28] Automated tea quality identification based on deep convolutional neural networks and transfer learning
    Zhang, Cheng
    Wang, Jin
    Lu, Guodong
    Fei, Shaomei
    Zheng, Tao
    Huang, Bincheng
    [J]. JOURNAL OF FOOD PROCESS ENGINEERING, 2023, 46 (04)
  • [29] Sparse Deep Transfer Learning for Convolutional Neural Network
    Liu, Jiaming
    Wang, Yali
    Qiao, Yu
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2245 - 2251
  • [30] Deep Learning Convolutional Neural Networks for Radio Identification
    Riyaz, Shamnaz
    Sankhe, Kunal
    Ioannidis, Stratis
    Chowdhury, Kaushik
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (09) : 146 - 152