Fast Object Detection in Compressed Video

被引:34
|
作者
Wang, Shiyao [1 ,2 ,3 ]
Lu, Hongchao [1 ]
Deng, Zhidong [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, THUAI,BNRist,Ctr Intelligent Connected Vehicles &, Beijing, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Tsinghua Univ, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
SEARCH;
D O I
10.1109/ICCV.2019.00720
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection in videos has drawn increasing attention since it is more practical in real scenarios. Most of the deep learning methods use CNNs to process each decoded frame in a video stream individually. However, the free of charge yet valuable motion information already embedded in the video compression format is usually overlooked. In this paper, we propose a fast object detection method by taking advantage of this with a novel Motion aided Memory Network (MMNet). The MMNet has two major advantages: 1) It significantly accelerates the procedure of feature extraction for compressed videos. It only need to run a complete recognition network for I-frames, i.e. a few reference frames in a video, and it produces the features for the following P frames (predictive frames) with a light weight memory network, which runs fast; 2) Unlike existing methods that establish an additional network to model motion of frames, we take full advantage of both motion vectors and residual errors that are freely available in video streams. To our best knowledge, the MMNet is the first work that investigates a deep convolutional detector on compressed videos. Our method is evaluated on the large-scale ImageNet VID dataset, and the results show that it is 3x times faster than single image detector R-FCN and 10x times faster than high-performance detector MANet at a minor accuracy loss.
引用
收藏
页码:7103 / 7112
页数:10
相关论文
共 50 条
  • [41] Small and fast moving object detection and tracking in sports video sequences
    Zaveri, MA
    Merchant, SN
    Desai, UB
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1539 - 1542
  • [42] A Change Information Based Fast Algorithm for Video Object Detection and Tracking
    Subudhi, Badri Narayan
    Nanda, Pradipta Kumar
    Ghosh, Ashish
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (07) : 993 - 1004
  • [43] Feature reduction and hierarchy of classifiers for fast object detection in video images
    Heisele, B
    Serre, T
    Mukherjee, S
    Poggio, T
    [J]. 2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2001, : 18 - 24
  • [44] Surveillance video synopsis in the compressed domain for fast video browsing
    Wang, Shi-zheng
    Wang, Zhong-yuan
    Hu, Rui-min
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (08) : 1431 - 1442
  • [45] Techniques for fast partitioning of compressed and uncompressed video
    Adjeroh, DA
    Lee, MC
    Orji, CU
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 1997, 4 (02) : 225 - 243
  • [46] A fast and robust shot detection method in HEVC/H.265 compressed video
    Rayatifard, M.
    Mehrabi, M.
    Ghanbari, M.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 41193 - 41211
  • [47] A fast encryption algorithm for compressed MPEG video
    Zhang, CN
    Wu, YW
    [J]. PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 1333 - 1336
  • [48] Fast segmentation method for MPEG compressed video
    Jin, Hong
    Zhou, Yuan-Hua
    Wang, Dong-Ping
    [J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2001, 35 (09): : 1408 - 1411
  • [49] Techniques for Fast Partitioning of Compressed and Uncompressed Video
    Donald A. Adjeroh
    M.C. Lee
    Cyril U. Orji
    [J]. Multimedia Tools and Applications, 1997, 4 : 225 - 243
  • [50] A fast and robust shot detection method in HEVC/H.265 compressed video
    M. Rayatifard
    M. Mehrabi
    M. Ghanbari
    [J]. Multimedia Tools and Applications, 2024, 83 : 41193 - 41211