Skip-Convolutions for Efficient Video Processing

被引：28

作者：

Habibian, Amirhossein ^{[1
]}

Abati, Davide ^{[1
]}

Cohen, Taco S. ^{[1
]}

Bejnordi, Babak Ehteshami ^{[1
]}

机构：

[1] Qualcomm AI Res, San Diego, CA 92121 USA

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.00272

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose Skip-Convolutions to leverage the large amount of redundancies in video streams and save computations. Each video is represented as a series of changes across frames and network activations, denoted as residuals. We reformulate standard convolution to be efficiently computed on residual frames: each layer is coupled with a binary gate deciding whether a residual is important to the model prediction, e.g. foreground regions, or it can be safely skipped, e.g. background regions. These gates can either be implemented as an efficient network trained jointly with convolution kernels, or can simply skip the residuals based on their magnitude. Gating functions can also incorporate block-wise sparsity structures, as required for efficient implementation on hardware platforms. By replacing all convolutions with Skip-Convolutions in two state-of-the-art architectures, namely EfficientDet and HRNet, we reduce their computational cost consistently by a factor of 3 similar to 4x for two different tasks, without any accuracy drop. Extensive comparisons with existing model compression, as well as image and video efficiency methods demonstrate that Skip-Convolutions set a new state-of-the-art by effectively exploiting the temporal redundancies in videos.

引用

页码：2694 / 2703

页数：10

共 50 条

[31] ANT: Adapt Network Across Time for Efficient Video Processing
Liang, Feng
Chin, Ting-Wu
Zhou, Yang
Marculescu, Diana
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2602 - 2607
[32] Efficient Online Surveillance Video Processing Based on Spark Framework
Zhang, Haitao
Yan, Jin
Kou, Yue
BIG DATA COMPUTING AND COMMUNICATIONS, (BIGCOM 2016), 2016, 9784 : 309 - 318
[33] Robust and efficient post-processing for video object detection
Sabater, Alberto
Montesano, Luis
Murillo, Ana C.
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10536 - 10542
[34] Efficient Post-Video Processing for Thin Display Devices
Jeong, Jin-Hwan
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (02) : 1097 - 1101
[35] Efficient Computational Cost Saving in Video Processing for QoE Estimation
Llorente, Alvaro
Perez, Javier Guinea
Rodrigo, Juan Antonio
Jimenez, David
Menendez, Jose Manuel
IEEE ACCESS, 2024, 12 : 34846 - 34862
[36] TCRSCANet: Harnessing Temporal Convolutions and Recurrent Skip Component for Enhanced RUL Estimation in Mechanical Systems
Abdul Wahid
John G. Breslin
Muhammad Ali Intizar
Human-Centric Intelligent Systems, 2024, 4 (1): : 1 - 24
[37] Network Video Frame-Skip Modeling and Simulation
Cempron, Jonathan Paul C.
Ilao, Joel P.
2021 27TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE (M2VIP), 2021,
[38] AGGREGATED DILATED CONVOLUTIONS FOR EFFICIENT MOTION DEBLURRING
Miao, Hong
Zhang, Wenqiang
Bai, Jiansong
2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
[39] CondConv: Conditionally Parameterized Convolutions for Efficient Inference
Yang, Brandon
Bender, Gabriel
Le, Quoc V.
Ngiam, Jiquan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[40] Structured Convolutions for Efficient Neural Network Design
Bhalgat, Yash
Zhang, Yizhe
Lin, Jamie Menjay
Porikli, Fatih
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33

← 1 2 3 4 5 →