Fast Object Detection in Compressed Video

被引:34
|
作者
Wang, Shiyao [1 ,2 ,3 ]
Lu, Hongchao [1 ]
Deng, Zhidong [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, THUAI,BNRist,Ctr Intelligent Connected Vehicles &, Beijing, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Tsinghua Univ, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
SEARCH;
D O I
10.1109/ICCV.2019.00720
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection in videos has drawn increasing attention since it is more practical in real scenarios. Most of the deep learning methods use CNNs to process each decoded frame in a video stream individually. However, the free of charge yet valuable motion information already embedded in the video compression format is usually overlooked. In this paper, we propose a fast object detection method by taking advantage of this with a novel Motion aided Memory Network (MMNet). The MMNet has two major advantages: 1) It significantly accelerates the procedure of feature extraction for compressed videos. It only need to run a complete recognition network for I-frames, i.e. a few reference frames in a video, and it produces the features for the following P frames (predictive frames) with a light weight memory network, which runs fast; 2) Unlike existing methods that establish an additional network to model motion of frames, we take full advantage of both motion vectors and residual errors that are freely available in video streams. To our best knowledge, the MMNet is the first work that investigates a deep convolutional detector on compressed videos. Our method is evaluated on the large-scale ImageNet VID dataset, and the results show that it is 3x times faster than single image detector R-FCN and 10x times faster than high-performance detector MANet at a minor accuracy loss.
引用
收藏
页码:7103 / 7112
页数:10
相关论文
共 50 条
  • [1] A fast and accurate video object detection and segmentation nethod in the compressed domain
    Wang, ZH
    Liu, GZ
    Liu, L
    [J]. PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 1209 - 1212
  • [2] Object detection and localization in compressed video
    Creusere, CD
    Dahman, G
    [J]. CONFERENCE RECORD OF THE THIRTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1 AND 2, 2001, : 93 - 97
  • [3] Multiple Moving Object Detection for Fast Video Content Description in Compressed Domain
    Francesca Manerba
    Jenny Benois-Pineau
    Riccardo Leonardi
    Boris Mansencal
    [J]. EURASIP Journal on Advances in Signal Processing, 2008
  • [4] Multiple moving object detection for fast video content description in compressed domain
    Manerba, Francesca
    Benois-Pineau, Jenny
    Leonardi, Riccardo
    Mansencal, Boris
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)
  • [5] Moving object detection in wavelet compressed video
    Töreyin, BU
    Çetin, AE
    Aksay, A
    Akhan, MB
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2005, 20 (03) : 255 - 264
  • [6] Fast object detection in compressed JPEG Images
    Deguerre, Benjamin
    Chatelain, Clement
    Gasso, Gilles
    [J]. 2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 333 - 338
  • [7] FAST OBJECT DETECTION IN H264/AVC AND HEVC COMPRESSED DOMAINS FOR VIDEO SURVEILLANCE
    Jaballah, Sami
    Larabi, Mohamed-Chaker
    [J]. 2019 8TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP 2019), 2019, : 123 - 128
  • [8] Object detection and classification from compressed video streams
    Joshi, Suvarna
    Ojo, Stephen
    Yadav, Sangeeta
    Gulia, Preeti
    Gill, Nasib Singh
    Alsberi, Hassan
    Rizwan, Ali
    Hassan, Mohamed M.
    [J]. EXPERT SYSTEMS, 2023,
  • [9] Fast Object Detection in HEVC Intra Compressed Domain
    Chen, Liuhong
    Sun, Heming
    Katto, Jiro
    Zeng, Xiaoyang
    Fan, Yibo
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 756 - 760
  • [10] Fast object detection and segmentation in MPEG compressed domain
    Sukmarg, O
    Rao, KR
    [J]. IEEE 2000 TENCON PROCEEDINGS, VOLS I-III: INTELLIGENT SYSTEMS AND TECHNOLOGIES FOR THE NEW MILLENNIUM, 2000, : B364 - B368