SAViT: Structure-Aware Vision Transformer Pruning via Collaborative Optimization

被引:0
|
作者
Zheng, Chuanyang [1 ]
Li, Zheyang [1 ,2 ]
Zhang, Kai [1 ]
Yang, Zhi [1 ]
Tan, Wenming [1 ]
Xiao, Jun [2 ]
Ren, Ye [1 ]
Pu, Shiliang [1 ]
机构
[1] Hikvis Res Inst, Hangzhou, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision Transformers (ViTs) yield impressive performance across various vision tasks. However, heavy computation and memory footprint make them inaccessible for edge devices. Previous works apply importance criteria determined independently by each individual component to prune ViTs. Considering that heterogeneous components in ViTs play distinct roles, these approaches lead to suboptimal performance. In this paper, we introduce joint importance, which integrates essential structural-aware interactions between components for the first time, to perform collaborative pruning. Based on the theoretical analysis, we construct a Taylor-based approximation to evaluate the joint importance. This guides pruning toward a more balanced reduction across all components. To further reduce the algorithm complexity, we incorporate the interactions into the optimization function under some mild assumptions. Moreover, the proposed method can be seamlessly applied to various tasks including object detection. Extensive experiments demonstrate the effectiveness of our method. Notably, the proposed approach outperforms the existing state-of-the-art approaches on ImageNet, increasing accuracy by 0.7% over the DeiT-Base baseline while saving 50% FLOPs. On COCO, we are the first to show that 70% FLOPs of Faster R-CNN with ViT backbone can be removed with only 0.3% mAP drop. The code is available at https://github.com/hikvision-research/SAViT.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Parallel Pencil Drawing Stylization via Structure-Aware Optimization
    Chen, Zhihua
    Jin, Yuxi
    Sheng, Bin
    Li, Ping
    Sun, Hanqiu
    [J]. PROCEEDINGS OF THE 31ST INTERNATIONAL CONFERENCE ON COMPUTER ANIMATION AND SOCIAL AGENTS (CASA 2016), 2015, : 32 - 37
  • [2] Structure-Aware Transformer for Graph Representation Learning
    Chen, Dexiong
    O'Bray, Leslie
    Borgwardt, Karsten
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [3] Table Fact Verification with Structure-Aware Transformer
    Zhang, Hongzhi
    Wang, Yingyao
    Wang, Sirui
    Cao, Xuezhi
    Zhang, Fuzheng
    Wang, Zhongyuan
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1624 - 1629
  • [4] RNA design via structure-aware multifrontier ensemble optimization
    Zhou, Tianshuo
    Dai, Ning
    Li, Sizhen
    Ward, Max
    Mathews, David H.
    Huang, Liang
    [J]. BIOINFORMATICS, 2023, 39 : i563 - i571
  • [5] StructCoder: Structure-Aware Transformer for Code Generation
    Tipirneni, Sindhu
    Zhu, Ming
    Reddy, Chandan K.
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
  • [6] RNA design via structure-aware multifrontier ensemble optimization
    Zhou, Tianshuo
    Dai, Ning
    Li, Sizhen
    Ward, Max
    Mathews, David H.
    Huang, Liang
    [J]. BIOINFORMATICS, 2023, 39 : I563 - I571
  • [7] Global Vision Transformer Pruning with Hessian-Aware Saliency
    Yang, Huanrui
    Yin, Hongxu
    Shen, Maying
    Molchanov, Pavlo
    Li, Hai
    Kautz, Jan
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18547 - 18557
  • [8] Structure-Aware Collaborative Representation for Hyperspectral Image Classification
    Li, Wei
    Zhang, Yuxiang
    Liu, Na
    Du, Qian
    Tao, Ran
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (09): : 7246 - 7261
  • [9] Chemsearch: collaborative compound libraries with structure-aware browsing
    Gaffney, Stephen G.
    Smaga, Sarah
    Schepartz, Alanna
    Townsend, Jeffrey P.
    [J]. BIOINFORMATICS ADVANCES, 2021, 1 (01):
  • [10] Structure-aware geometric optimization of hexahedral mesh
    Wang, Rui
    Zheng, Zhihao
    Yu, Weishan
    Shao, Yanli
    Gao, Shuming
    [J]. COMPUTER-AIDED DESIGN, 2021, 138