Skip-Convolutions for Efficient Video Processing

被引:28
|
作者
Habibian, Amirhossein [1 ]
Abati, Davide [1 ]
Cohen, Taco S. [1 ]
Bejnordi, Babak Ehteshami [1 ]
机构
[1] Qualcomm AI Res, San Diego, CA 92121 USA
关键词
D O I
10.1109/CVPR46437.2021.00272
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Skip-Convolutions to leverage the large amount of redundancies in video streams and save computations. Each video is represented as a series of changes across frames and network activations, denoted as residuals. We reformulate standard convolution to be efficiently computed on residual frames: each layer is coupled with a binary gate deciding whether a residual is important to the model prediction, e.g. foreground regions, or it can be safely skipped, e.g. background regions. These gates can either be implemented as an efficient network trained jointly with convolution kernels, or can simply skip the residuals based on their magnitude. Gating functions can also incorporate block-wise sparsity structures, as required for efficient implementation on hardware platforms. By replacing all convolutions with Skip-Convolutions in two state-of-the-art architectures, namely EfficientDet and HRNet, we reduce their computational cost consistently by a factor of 3 similar to 4x for two different tasks, without any accuracy drop. Extensive comparisons with existing model compression, as well as image and video efficiency methods demonstrate that Skip-Convolutions set a new state-of-the-art by effectively exploiting the temporal redundancies in videos.
引用
收藏
页码:2694 / 2703
页数:10
相关论文
共 50 条
  • [41] Warped Convolutions: Efficient Invariance to Spatial Transformations
    Henriques, Joao F.
    Vedaldi, Andrea
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [42] Detouring Skip Graph: Efficient Routing via Detour Routes on Skip Graph Topology
    Kaneko, Takeshi
    Banno, Ryohei
    Shudo, Kazuyuki
    Abe, Kota
    Teranishi, Yuuichi
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2020, 1 : 1658 - 1673
  • [43] Efficient Table Border Segmentation with Asymmetric Convolutions
    Minouei, Mohammad
    Soheili, Mohammad Reza
    Stricker, Didier
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
  • [44] Skipping CNN Convolutions Through Efficient Memoization
    de Moura, Rafael Fao
    Santos, Paulo C.
    de Lima, Joao Paulo C.
    Alves, Marco A. Z.
    Beck, Antonio C. S.
    Carro, Luigi
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2019, 2019, 11733 : 65 - 76
  • [45] Pampoo: An efficient Skip-trie based query processing framework for P2P systems
    Li Meifang
    Zhu Hongkai
    Shen Derong
    Nie Tiezheng
    Yue, Kou
    Ge, Yu
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2007, 4847 : 190 - +
  • [46] Reconfigurable Hardware-Friendly CU-Group Based Merge/Skip Mode for High Efficient Video Coding
    Dai, Wei
    Au, Oscar C.
    Wen, Xing
    Zhu, Wenjing
    Zou, Feng
    Zhang, Xingyu
    Jakhetiya, Vinit
    2013 IEEE 15TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2013, : 46 - 51
  • [47] PatchNet - Short-range template matching for efficient video processing
    Mao, Huizi
    Zhu, Sibo
    Han, Song
    Dally, William J.
    arXiv, 2021,
  • [48] Efficient spatio-temporal decomposition for perceptual processing of video sequences
    Lindh, P
    Lambrecht, CJVB
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL III, 1996, : 331 - 334
  • [49] Efficient post-processing for block-based compressed video
    Kim, Y
    Yi, T
    PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 101 - 105
  • [50] Efficient processing of video containment queries by using composite ordinal features
    Seo, Jung Hyuk
    Kim, Myoung Ho
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (02) : 2891 - 2910