Skip-Convolutions for Efficient Video Processing

被引:28
|
作者
Habibian, Amirhossein [1 ]
Abati, Davide [1 ]
Cohen, Taco S. [1 ]
Bejnordi, Babak Ehteshami [1 ]
机构
[1] Qualcomm AI Res, San Diego, CA 92121 USA
关键词
D O I
10.1109/CVPR46437.2021.00272
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Skip-Convolutions to leverage the large amount of redundancies in video streams and save computations. Each video is represented as a series of changes across frames and network activations, denoted as residuals. We reformulate standard convolution to be efficiently computed on residual frames: each layer is coupled with a binary gate deciding whether a residual is important to the model prediction, e.g. foreground regions, or it can be safely skipped, e.g. background regions. These gates can either be implemented as an efficient network trained jointly with convolution kernels, or can simply skip the residuals based on their magnitude. Gating functions can also incorporate block-wise sparsity structures, as required for efficient implementation on hardware platforms. By replacing all convolutions with Skip-Convolutions in two state-of-the-art architectures, namely EfficientDet and HRNet, we reduce their computational cost consistently by a factor of 3 similar to 4x for two different tasks, without any accuracy drop. Extensive comparisons with existing model compression, as well as image and video efficiency methods demonstrate that Skip-Convolutions set a new state-of-the-art by effectively exploiting the temporal redundancies in videos.
引用
收藏
页码:2694 / 2703
页数:10
相关论文
共 50 条
  • [1] Dissected 3D CNNs: Temporal skip connections for efficient online video processing
    Koepueklue, Okan
    Hoermann, Stefan
    Herzog, Fabian
    Cevikalp, Hakan
    Rigoll, Gerhard
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 215
  • [2] AFFINE SKIP AND DIRECT MODES FOR EFFICIENT VIDEO CODING
    Huang, Han
    Woods, John W.
    Zhao, Yao
    Bai, Huihui
    2012 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2012,
  • [3] Efficient processing of compressed video
    Wee, SJ
    Apostolopoulos, AG
    CONFERENCE RECORD OF THE THIRTY-SECOND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 855 - 859
  • [4] Efficient SKIP Mode Detection for Coarse Grain Quality Scalable Video Coding
    Shen, Liquan
    Sun, Yiwen
    Liu, Zhi
    Zhang, Zhaoyang
    IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (10) : 887 - 890
  • [5] PASS: Patch Automatic Skip Scheme for Efficient On-Device Video Perception
    Zhou, Qihua
    Guo, Song
    Pan, Jun
    Liang, Jiacheng
    Guo, Jingcai
    Xu, Zhenda
    Zhou, Jingren
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 3938 - 3954
  • [6] DISTRIBUTED VIDEO CODING WITH ZERO MOTION SKIP AND EFFICIENT DCT COEFFICIENT ENCODING
    Hua, Guogang
    Chen, Chang Wen
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 777 - +
  • [7] Delta Distillation for Efficient Video Processing
    Habibian, Amirhossein
    Ben Yahia, Haitam
    Abati, Davide
    Gavves, Efstratios
    Porikli, Fatih
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 213 - 229
  • [8] Efficient intramode SKIP detection algorithm for H.264/AVC video encoder
    Kim, Byung-Gyu
    Kim, Jong-Ho
    OPTICAL ENGINEERING, 2006, 45 (09)
  • [9] An Efficient Intra Skip Decision Algorithm for H.264/AVC Video Coding
    Wang, Ying-Hong
    Cheng, Kuo-Hsiang
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2014, 17 (03): : 329 - 339
  • [10] Efficient two-stage early SKIP mode termination for depth video coding
    Zeng, Huanqiang
    Wang, Yongtao
    Wei, Zhe
    Cai, Canhui
    COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (04) : 1344 - 1352