Skip-Convolutions for Efficient Video Processing

被引:28
|
作者
Habibian, Amirhossein [1 ]
Abati, Davide [1 ]
Cohen, Taco S. [1 ]
Bejnordi, Babak Ehteshami [1 ]
机构
[1] Qualcomm AI Res, San Diego, CA 92121 USA
关键词
D O I
10.1109/CVPR46437.2021.00272
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Skip-Convolutions to leverage the large amount of redundancies in video streams and save computations. Each video is represented as a series of changes across frames and network activations, denoted as residuals. We reformulate standard convolution to be efficiently computed on residual frames: each layer is coupled with a binary gate deciding whether a residual is important to the model prediction, e.g. foreground regions, or it can be safely skipped, e.g. background regions. These gates can either be implemented as an efficient network trained jointly with convolution kernels, or can simply skip the residuals based on their magnitude. Gating functions can also incorporate block-wise sparsity structures, as required for efficient implementation on hardware platforms. By replacing all convolutions with Skip-Convolutions in two state-of-the-art architectures, namely EfficientDet and HRNet, we reduce their computational cost consistently by a factor of 3 similar to 4x for two different tasks, without any accuracy drop. Extensive comparisons with existing model compression, as well as image and video efficiency methods demonstrate that Skip-Convolutions set a new state-of-the-art by effectively exploiting the temporal redundancies in videos.
引用
收藏
页码:2694 / 2703
页数:10
相关论文
共 50 条
  • [31] ANT: Adapt Network Across Time for Efficient Video Processing
    Liang, Feng
    Chin, Ting-Wu
    Zhou, Yang
    Marculescu, Diana
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2602 - 2607
  • [32] Efficient Online Surveillance Video Processing Based on Spark Framework
    Zhang, Haitao
    Yan, Jin
    Kou, Yue
    BIG DATA COMPUTING AND COMMUNICATIONS, (BIGCOM 2016), 2016, 9784 : 309 - 318
  • [33] Robust and efficient post-processing for video object detection
    Sabater, Alberto
    Montesano, Luis
    Murillo, Ana C.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10536 - 10542
  • [34] Efficient Post-Video Processing for Thin Display Devices
    Jeong, Jin-Hwan
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (02) : 1097 - 1101
  • [35] Efficient Computational Cost Saving in Video Processing for QoE Estimation
    Llorente, Alvaro
    Perez, Javier Guinea
    Rodrigo, Juan Antonio
    Jimenez, David
    Menendez, Jose Manuel
    IEEE ACCESS, 2024, 12 : 34846 - 34862
  • [36] TCRSCANet: Harnessing Temporal Convolutions and Recurrent Skip Component for Enhanced RUL Estimation in Mechanical Systems
    Abdul Wahid
    John G. Breslin
    Muhammad Ali Intizar
    Human-Centric Intelligent Systems, 2024, 4 (1): : 1 - 24
  • [37] Network Video Frame-Skip Modeling and Simulation
    Cempron, Jonathan Paul C.
    Ilao, Joel P.
    2021 27TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE (M2VIP), 2021,
  • [38] AGGREGATED DILATED CONVOLUTIONS FOR EFFICIENT MOTION DEBLURRING
    Miao, Hong
    Zhang, Wenqiang
    Bai, Jiansong
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [39] CondConv: Conditionally Parameterized Convolutions for Efficient Inference
    Yang, Brandon
    Bender, Gabriel
    Le, Quoc V.
    Ngiam, Jiquan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [40] Structured Convolutions for Efficient Neural Network Design
    Bhalgat, Yash
    Zhang, Yizhe
    Lin, Jamie Menjay
    Porikli, Fatih
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33