Parameter Transfer Unit for Deep Neural Networks

被引:12
|
作者
Zhang, Yinghua [1 ]
Zhang, Yu [1 ]
Yang, Qiang [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
关键词
Transfer learning; Deep neural networks;
D O I
10.1007/978-3-030-16145-3_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Parameters in deep neural networks which are trained on large-scale databases can generalize across multiple domains, which is referred as "transferability". Unfortunately, the transferability is usually defined as discrete states and it differs with domains and network architectures. Existing works usually heuristically apply parameter-sharing or fine-tuning, and there is no principled approach to learn a parameter transfer strategy. To address the gap, a Parameter Transfer Unit (PTU) is proposed in this paper. PTU learns a fine-grained nonlinear combination of activations from both the source domain network and the target domain network, and subsumes hand-crafted discrete transfer states. In the PTU, the transferability is controlled by two gates which are artificial neurons and can be learned from data. The PTU is a general and flexible module which can be used in both CNNs and RNNs. It can be also integrated with other transfer learning methods in a plug-and-play manner. Experiments are conducted with various network architectures and multiple transfer domain pairs. Results demonstrate the effectiveness of the PTU as it outperforms heuristic parameter-sharing and fine-tuning in most settings.
引用
收藏
页码:82 / 95
页数:14
相关论文
共 50 条
  • [21] TRANSFER KNOWLEDGE FOR HIGH SPARSITY IN DEEP NEURAL NETWORKS
    Liu, Wenran
    Chen, Xiaogang
    Ji, Xiangyang
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1354 - 1358
  • [22] Gradient rectified parameter unit of the fully connected layer in convolutional neural networks
    Zheng, Tianyou
    Wang, Qiang
    Shen, Yue
    Lin, Xiaotian
    KNOWLEDGE-BASED SYSTEMS, 2022, 248
  • [23] Facial Action Unit Detection Using Deep Neural Networks in Videos
    Akay, Simge
    Arica, Nafiz
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [24] A Deep Learning Framework for Automated Transfer Learning of Neural Networks
    Balaiah, Thanasekhar
    Jeyadoss, Timothy Jones Thomas
    Thirumurugan, Sainee
    Ravi, Rahul Chander
    2019 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC 2019), 2019, : 428 - 432
  • [25] Transfer Learning for Latin and Chinese Characters with Deep Neural Networks
    Ciresan, Dan C.
    Meier, Ueli
    Schmidhuber, Juergen
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [26] Japanese animation style transfer using deep neural networks
    Ye, Shiyang
    Ohtera, Ryo
    Proceedings of the 2017 IEEE International Conference on Information, Communication and Engineering: Information and Innovation for Modern Technology, ICICE 2017, 2018, : 492 - 495
  • [27] Transfer learning for gene expression prediction with deep neural networks
    Arslan, Emre
    Rai, Kunal
    CANCER RESEARCH, 2020, 80 (16)
  • [28] Deep neural networks with transfer learning in millet crop images
    Coulibaly, Solemane
    Kamsu-Foguem, Bernard
    Kamissoko, Dantouma
    Traore, Daouda
    COMPUTERS IN INDUSTRY, 2019, 108 : 115 - 120
  • [29] Research on Task Discovery for Transfer Learning in Deep Neural Networks
    Akdemir, Arda
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, 2020, : 33 - 41
  • [30] Knowledge Transfer in Deep Block-Modular Neural Networks
    Terekhov, Alexander V.
    Montone, Guglielmo
    O'Regan, J. Kevin
    BIOMIMETIC AND BIOHYBRID SYSTEMS, LIVING MACHINES 2015, 2015, 9222 : 268 - 279