Filter Grafting for Deep Neural Networks

被引:27
|
作者
Meng, Fanxu [1 ,2 ]
Cheng, Hao [2 ]
Li, Ke [2 ]
Xu, Zhixin [1 ]
Ji, Rongrong [3 ,4 ]
Sun, Xing [2 ]
Lu, Guangming [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen, Peoples R China
[2] Tencent Youtu Lab, Shanghai, Peoples R China
[3] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Fujian, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Guangdong, Peoples R China
关键词
D O I
10.1109/CVPR42600.2020.00663
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new learning paradigm called filter grafting, which aims to improve the representation capability of Deep Neural Networks (DNNs). The motivation is that DNNs have unimportant (invalid) filters (e.g., l(1) norm close to 0). These filters limit the potential of DNNs since they are identified as having little effect on the network. While filter pruning removes these invalid filters for efficiency consideration, filter grafting re-activates them from an accuracy boosting perspective. The activation is processed by grafting external information (weights) into invalid filters. To better perform the grafting process, we develop an entropy-based criterion to measure the information of filters and an adaptive weighting strategy for balancing the grafted information among networks. After the grafting operation, the network has very few invalid filters compared with its untouched state, enpowering the model with more representation capacity. We also perform extensive experiments on the classification and recognition tasks to show the superiority of our method. For example, the grafted MobileNetV2 outperforms the non-grafted MobileNetV2 by about 7 percent on CIFAR-100 dataset. Code is available at https.//github.com/fxmeng/filter-grafting.git.
引用
收藏
页码:6598 / 6606
页数:9
相关论文
共 50 条
  • [1] Holistic Filter Pruning for Efficient Deep Neural Networks
    Enderich, Lukas
    Timm, Fabian
    Burgard, Wolfram
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2595 - 2604
  • [2] Spatially-Adaptive Filter Units for Deep Neural Networks
    Tabernik, Domen
    Kristan, Matej
    Leonardis, Ales
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9388 - 9396
  • [3] Filter Pruning via Feature Discrimination in Deep Neural Networks
    He, Zhiqiang
    Qian, Yaguan
    Wang, Yuqi
    Wang, Bin
    Guan, Xiaohui
    Gu, Zhaoquan
    Ling, Xiang
    Zeng, Shaoning
    Wang, Haijiang
    Zhou, Wujie
    [J]. COMPUTER VISION, ECCV 2022, PT XXI, 2022, 13681 : 245 - 261
  • [4] Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
    He, Yang
    Kang, Guoliang
    Dong, Xuanyi
    Fu, Yanwei
    Yang, Yi
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2234 - 2240
  • [5] Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks
    He, Yang
    Dong, Xuanyi
    Kang, Guoliang
    Fu, Yanwei
    Yan, Chenggang
    Yang, Yi
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (08) : 3594 - 3604
  • [6] Filter-Wise Quantization of Deep Neural Networks for IoT Devices
    Kim, Hoseung
    Jo, Geunhye
    Lee, Hayun
    Shin, Dongkun
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
  • [7] Learning Filter Pruning Criteria for Deep Convolutional Neural Networks Acceleration
    He, Yang
    Ding, Yuhang
    Liu, Ping
    Zhu, Linchao
    Zhang, Hanwang
    Yang, Yi
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2006 - 2015
  • [8] On Filter Generalization for Music Bandwidth Extension Using Deep Neural Networks
    Sulun, Serkan
    Davies, Matthew E. P.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (01) : 132 - 142
  • [9] SFP: Similarity-based filter pruning for deep neural networks
    Li, Guoqing
    Li, Rengang
    Li, Tuo
    Shen, Chaoyao
    Zou, Xiaofeng
    Wang, Jiuyang
    Wang, Changhong
    Li, Nanjun
    [J]. INFORMATION SCIENCES, 2025, 689
  • [10] PatterNet: Explore and Exploit Filter Patterns for Efficient Deep Neural Networks
    Khaleghi, Behnam
    Mallappa, Uday
    Yaldiz, Duygu
    Yang, Haichao
    Shah, Monil
    Kang, Jaeyoung
    Rosing, Tajana
    [J]. PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 223 - 228