Skip-Convolutions for Efficient Video Processing

被引:28
|
作者
Habibian, Amirhossein [1 ]
Abati, Davide [1 ]
Cohen, Taco S. [1 ]
Bejnordi, Babak Ehteshami [1 ]
机构
[1] Qualcomm AI Res, San Diego, CA 92121 USA
关键词
D O I
10.1109/CVPR46437.2021.00272
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Skip-Convolutions to leverage the large amount of redundancies in video streams and save computations. Each video is represented as a series of changes across frames and network activations, denoted as residuals. We reformulate standard convolution to be efficiently computed on residual frames: each layer is coupled with a binary gate deciding whether a residual is important to the model prediction, e.g. foreground regions, or it can be safely skipped, e.g. background regions. These gates can either be implemented as an efficient network trained jointly with convolution kernels, or can simply skip the residuals based on their magnitude. Gating functions can also incorporate block-wise sparsity structures, as required for efficient implementation on hardware platforms. By replacing all convolutions with Skip-Convolutions in two state-of-the-art architectures, namely EfficientDet and HRNet, we reduce their computational cost consistently by a factor of 3 similar to 4x for two different tasks, without any accuracy drop. Extensive comparisons with existing model compression, as well as image and video efficiency methods demonstrate that Skip-Convolutions set a new state-of-the-art by effectively exploiting the temporal redundancies in videos.
引用
收藏
页码:2694 / 2703
页数:10
相关论文
共 50 条
  • [21] Aggregation Skip Graph: A Skip Graph Extension for Efficient Aggregation Query
    Abe, Kota
    Abe, Toshiyuki
    Ueda, Tatsuya
    Ishibashi, Hayato
    Matsuura, Toshio
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN P2P SYSTEMS (AP2PS 2010), 2010, : 93 - 99
  • [22] Early Video Pioneer: An Interview with Skip Blumberg
    La Rosa, Melanie
    JOURNAL OF FILM AND VIDEO, 2012, 64 (1-2) : 30 - 41
  • [23] PASS: Patch Automatic Skip Scheme for Efficient Real-Time Video Perception on Edge Devices
    Zhou, Qihua
    Guo, Song
    Pan, Jun
    Liang, Jiacheng
    Xu, Zhenda
    Zhou, Jingren
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3787 - 3795
  • [24] Affine SKIP and MERGE Modes for Video Coding
    Chen, Huanbang
    Liang, Fan
    Lin, Sixin
    2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [25] Introducing skip mode in distributed video coding
    Mys, Stefaan
    Slowack, Jurgen
    Skorupa, Jozef
    Lambert, Peter
    Van de Walle, Rik
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2009, 24 (03) : 200 - 213
  • [26] EFFICIENT DEALIASED CONVOLUTIONS WITHOUT PADDING
    Bowman, John C.
    Roberts, Malcolm
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2011, 33 (01): : 386 - 406
  • [27] A note on efficient density estimators of convolutions
    Bandyopadhyay, Soutir
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (11) : 3056 - 3060
  • [28] Can Dilated Convolutions Capture Ultrasound Video Dynamics?
    Maraci, Mohammad Ali
    Xie, Weidi
    Noble, J. Alison
    MACHINE LEARNING IN MEDICAL IMAGING: 9TH INTERNATIONAL WORKSHOP, MLMI 2018, 2018, 11046 : 116 - 124
  • [29] ConvPoint: Continuous convolutions for point cloud processing
    Boulch, Alexandre
    COMPUTERS & GRAPHICS-UK, 2020, 88 : 24 - 34
  • [30] Efficient Post-Video Processing for Thin Display Devices
    Jeong, Jin-Hwan
    Kim, Hag-Young
    2010 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS ICCE, 2010,