KTAN: Knowledge Transfer Adversarial Network

被引:11
|
作者
Liu, Peiye [1 ]
Liu, Wu [2 ]
Ma, Huadong [1 ]
Jiang, Zhewei [3 ]
Seok, Mingoo [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] JD AI Res, Beijing, Peoples R China
[3] Columbia Univ, New York, NY USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Knowledge transfer; distillation; adversarial learning;
D O I
10.1109/ijcnn48605.2020.9207235
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge distillation was pioneered to transfer the generalization ability of a large teacher deep network to a light-weight student network. The student network can retain the high quality of the teacher network, yet exhibiting low computational complexity and storage requirement, which is attractive for deploying a deep convolution neural network on a resource-constrained mobile device. However, most of the existing methods focus on transferring the probability distribution of a softmax layer in a teacher network and neglect the intermediate representations. However, we find that the intermediate representation is critical for a student network to better understand the transferred generalization as compared to the probability distribution only. In this paper, therefore, we propose such a knowledge transfer adversarial network method which holistically considers both intermediate representations and probability distributions of a teacher network. To transfer the knowledge of intermediate representations, we set high-level teacher feature maps as a target, toward which the method trains student feature maps. Furthermore, to support various structures of a student network, we arrange a novel teacher-to-student layer. Finally, the proposed method employs an adversarial learning process. Specifically, it includes a discriminator network to fully exploit the spatial correlation of feature maps during the training process of a student network. The experimental results demonstrate that the proposed method can significantly improve the performance of a student network on two important vision tasks, image classification and object detection.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] An Adversarial Transfer Network for Knowledge Representation Learning
    Wang, Huijuan
    Li, Shuangyin
    Pan, Rong
    [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 1749 - 1760
  • [2] Improving Heterogeneous Network Knowledge Transfer Based on the Principle of Generative Adversarial
    Lei, Feifei
    Cheng, Jieren
    Yang, Yue
    Tang, Xiangyan
    Sheng, Victor S.
    Huang, Chunzao
    [J]. ELECTRONICS, 2021, 10 (13)
  • [3] Generative Adversarial Network-Based Transfer Reinforcement Learning for Routing With Prior Knowledge
    Dong, Tianjian
    Qi, Qi
    Wang, Jingyu
    Liu, Alex X.
    Sun, Haifeng
    Zhuang, Zirui
    Liao, Jianxin
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (02): : 1673 - 1689
  • [4] MKTN: Adversarial-Based Multifarious Knowledge Transfer Network from Complementary Teachers
    Zhang, Xiaobing
    Chang, Heyu
    Hao, Yaohui
    Chang, Dexian
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [5] MKTN: Adversarial-Based Multifarious Knowledge Transfer Network from Complementary Teachers
    Xiaobing Zhang
    Heyu Chang
    Yaohui Hao
    Dexian Chang
    [J]. International Journal of Computational Intelligence Systems, 17
  • [6] Knowledge Transfer with Weighted Adversarial Network for Cold-Start Store Site Recommendation
    Liu, Yan
    Guo, Bin
    Zhang, Daqing
    Zeghlache, Djamal
    Chen, Jingmin
    Hu, Ke
    Zhang, Sizhe
    Zhou, Dan
    Yu, Zhiwen
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (03)
  • [7] Adversarial Knowledge Transfer from Unlabeled Data
    Gupta, Akash
    Panda, Rameswar
    Paul, Sujoy
    Zhang, Jianming
    Roy-Chowdhury, Amit K.
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2175 - 2183
  • [8] A Knowledge Selective Adversarial Network for Link Prediction in Knowledge Graph
    Hu, Kairong
    Liu, Hai
    Hao, Tianyong
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 171 - 183
  • [9] Hierarchical Knowledge Squeezed Adversarial Network Compression
    Li, Peng
    Shu, Changyong
    Xie, Yuan
    Qu, Yanyun
    Kong, Hui
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11370 - 11377
  • [10] Adversarial Separation Network for Text Style Transfer
    Yang, Haitong
    Zhou, Guangyou
    He, Tingting
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (02)